Generate sandwich OCR PDFs from scanned file


/api/formula/pdfsandwich.json (JSON API)

Formula code on GitHub

Current versions:

stable 0.1.7
head ⚡️ HEAD
bottle 🍾 mojave, high_sierra, sierra, el_capitan

Depends on:

exact-image 1.0.1 Image processing library
ghostscript 9.26 Interpreter for PostScript and PDF
imagemagick 7.0.8-23 Tools and libraries to manipulate images in many formats
poppler 0.73.0 PDF rendering library (based on the xpdf-3.0 code base)
tesseract 4.0.0 OCR (Optical Character Recognition) engine
unpaper 6.1 Post-processing for scanned/photocopied books

Depends on when building from source:

gawk 4.2.1 GNU awk utility
ocaml 4.07.1 General purpose programming language in the ML family


Installs (30 days)
pdfsandwich 92
Installs on Request (30 days)
pdfsandwich 92
Build Errors (30 days)
pdfsandwich 0
Installs (90 days)
pdfsandwich 263
Installs on Request (90 days)
pdfsandwich 263
Installs (365 days)
pdfsandwich 697
Installs on Request (365 days)
pdfsandwich 697
Fork me on GitHub