Generate sandwich OCR PDFs from scanned file


/api/formula/pdfsandwich.json (JSON API)

Formula code on GitHub

Current versions:

stable 0.1.7
head ⚡️ HEAD
bottle 🍾 mojave, high_sierra, sierra, el_capitan

Depends on:

exact-image 1.0.2 Image processing library
ghostscript 9.27 Interpreter for PostScript and PDF
imagemagick 7.0.8-53 Tools and libraries to manipulate images in many formats
poppler 0.79.0 PDF rendering library (based on the xpdf-3.0 code base)
tesseract 4.0.0 OCR (Optical Character Recognition) engine
unpaper 6.1 Post-processing for scanned/photocopied books

Depends on when building from source:

gawk 5.0.1 GNU awk utility
ocaml 4.07.1 General purpose programming language in the ML family


Installs (30 days)
pdfsandwich 70
Installs on Request (30 days)
pdfsandwich 70
Build Errors (30 days)
pdfsandwich 0
Installs (90 days)
pdfsandwich 164
Installs on Request (90 days)
pdfsandwich 164
Installs (365 days)
pdfsandwich 898
Installs on Request (365 days)
pdfsandwich 897
Fork me on GitHub