Generate sandwich OCR PDFs from scanned file


/api/formula/pdfsandwich.json (JSON API)

Formula code on GitHub

Current versions:

stable 0.1.7
head ⚡️ HEAD
bottle 🍾 mojave, high_sierra, sierra, el_capitan

Depends on:

exact-image 1.0.2 Image processing library
ghostscript 9.26 Interpreter for PostScript and PDF
imagemagick 7.0.8-46 Tools and libraries to manipulate images in many formats
poppler 0.76.1 PDF rendering library (based on the xpdf-3.0 code base)
tesseract 4.0.0 OCR (Optical Character Recognition) engine
unpaper 6.1 Post-processing for scanned/photocopied books

Depends on when building from source:

gawk 5.0.0 GNU awk utility
ocaml 4.07.1 General purpose programming language in the ML family


Installs (30 days)
pdfsandwich 46
Installs on Request (30 days)
pdfsandwich 46
Build Errors (30 days)
pdfsandwich 0
Installs (90 days)
pdfsandwich 194
Installs on Request (90 days)
pdfsandwich 194
Installs (365 days)
pdfsandwich 824
Installs on Request (365 days)
pdfsandwich 823
Fork me on GitHub