ocrmypdf

Adds an OCR text layer to scanned PDF files

https://github.com/jbarlow83/OCRmyPDF

License: MPL-2.0

/api/formula/ocrmypdf.json (JSON API)

Formula code on GitHub

Current versions:

stable 11.1.1
bottle 🍾 catalina, mojave, high_sierra

Depends on:

freetype 2.10.2 Software library to render fonts
ghostscript 9.53.2 Interpreter for PostScript and PDF
jbig2enc 0.29 JBIG2 encoder (for monochrome documents)
jpeg 9d Image manipulation library
leptonica 1.80.0 Image processing and image analysis library
libpng 1.6.37 Library for manipulating PNG images
pngquant 2.12.5 PNG image optimizing utility
pybind11 2.5.0 Seamless operability between C++11 and Python
python@3.8 3.8.5 Interpreted, interactive, object-oriented programming language
qpdf 10.0.1 Tools for and transforming and inspecting PDF files
tesseract 4.1.1 OCR (Optical Character Recognition) engine
unpaper 6.1 Post-processing for scanned/photocopied books

Depends on when building from source:

pkg-config 0.29.2 Manage compile and link flags for libraries

Analytics:

Installs (30 days)
ocrmypdf 2,131
Installs on Request (30 days)
ocrmypdf 2,132
Build Errors (30 days)
ocrmypdf 0
Installs (90 days)
ocrmypdf 6,386
Installs on Request (90 days)
ocrmypdf 6,383
Installs (365 days)
ocrmypdf 19,149
Installs on Request (365 days)
ocrmypdf 19,130
Fork me on GitHub