Adds an OCR text layer to scanned PDF files


/api/formula-linux/ocrmypdf.json (JSON API)

Linux formula code on GitHub

Current versions:

stable 9.6.1
bottle 🍾 catalina, mojave, high_sierra, x86_64_linux

Depends on:

freetype 2.10.1 Software library to render fonts
ghostscript 9.52 Interpreter for PostScript and PDF
jbig2enc 0.29 JBIG2 encoder (for monochrome documents)
jpeg 9d Image manipulation library
leptonica 1.79.0 Image processing and image analysis library
libpng 1.6.37 Library for manipulating PNG images
libxml2 2.9.10 GNOME XML library
pngquant 2.12.5 PNG image optimizing utility
pybind11 2.4.3 Seamless operability between C++11 and Python
python 3.7.7 Interpreted, interactive, object-oriented programming language
qpdf 9.1.1 Tools for and transforming and inspecting PDF files
tesseract 4.1.1 OCR (Optical Character Recognition) engine
unpaper 6.1 Post-processing for scanned/photocopied books
libffi 3.2.1 Portable Foreign Function Interface library
libxslt 1.1.34 C XSLT library for GNOME
zlib 1.2.11 General-purpose lossless data-compression library

Depends on when building from source:

pkg-config 0.29.2 Manage compile and link flags for libraries


Installs (30 days)
ocrmypdf 22
Installs on Request (30 days)
ocrmypdf 22
Build Errors (30 days)
ocrmypdf 0
Installs (90 days)
ocrmypdf 62
Installs on Request (90 days)
ocrmypdf 62
Installs (365 days)
ocrmypdf 66
Installs on Request (365 days)
ocrmypdf 66
Fork me on GitHub