pdfsandwich

Generate sandwich OCR PDFs from scanned file

http://www.tobias-elze.de/pdfsandwich/

/api/formula/pdfsandwich.json (JSON API)

Formula code on GitHub

Current versions:

stable 0.1.7
head ⚡️ HEAD
bottle 🍾 mojave, high_sierra, sierra, el_capitan

Depends on:

exact-image 1.0.1 Image processing library
ghostscript 9.25 Interpreter for PostScript and PDF
imagemagick 7.0.8-12 Tools and libraries to manipulate images in many formats
poppler 0.69.0 PDF rendering library (based on the xpdf-3.0 code base)
tesseract 3.05.02 OCR (Optical Character Recognition) engine
unpaper 6.1 Post-processing for scanned/photocopied books

Depends on when building from source:

gawk 4.2.1 GNU awk utility
ocaml 4.07.0 General purpose programming language in the ML family

Analytics:

Installs (30 days)
pdfsandwich 123
Installs on Request (30 days)
pdfsandwich 124
Build Errors (30 days)
pdfsandwich 0
Installs (90 days)
pdfsandwich 140
Installs on Request (90 days)
pdfsandwich 140
Installs (365 days)
pdfsandwich 431
Installs on Request (365 days)
pdfsandwich 430
Fork me on GitHub