Generate sandwich OCR PDFs from scanned file


/api/formula/pdfsandwich.json (JSON API)

Formula code on GitHub

Current versions:

stable 0.1.7
head ⚡️ HEAD
bottle 🍾 mojave, high_sierra, sierra, el_capitan

Depends on:

exact-image 1.0.2 Image processing library
ghostscript 9.52 Interpreter for PostScript and PDF
imagemagick 7.0.10-0 Tools and libraries to manipulate images in many formats
poppler 0.86.1 PDF rendering library (based on the xpdf-3.0 code base)
tesseract 4.1.1 OCR (Optical Character Recognition) engine
unpaper 6.1 Post-processing for scanned/photocopied books

Depends on when building from source:

gawk 5.0.1 GNU awk utility
ocaml 4.09.0 General purpose programming language in the ML family


Installs (30 days)
pdfsandwich 39
Installs on Request (30 days)
pdfsandwich 39
Build Errors (30 days)
pdfsandwich 0
Installs (90 days)
pdfsandwich 205
Installs on Request (90 days)
pdfsandwich 205
Installs (365 days)
pdfsandwich 730
Installs on Request (365 days)
pdfsandwich 730
Fork me on GitHub