ocrmypdf

Install command:
brew install ocrmypdf

Adds an OCR text layer to scanned PDF files

https://ocrmypdf.readthedocs.io/en/latest/

License: MPL-2.0

Formula JSON API: /api/formula/ocrmypdf.json

Formula code: ocrmypdf.rb on GitHub

Bottle (binary package) installation support provided for:

Apple Silicon sonoma
ventura
monterey
Intel sonoma
ventura
monterey
64-bit linux

Current versions:

stable 16.5.0

Depends on:

cryptography 43.0.1 Cryptographic recipes and primitives for Python
freetype 2.13.3 Software library to render fonts
ghostscript 10.03.1 Interpreter for PostScript and PDF
img2pdf 0.5.1 Convert images to PDF via direct JPEG inclusion
jbig2enc 0.29 JBIG2 encoder (for monochrome documents)
libheif 1.18.2 ISO/IEC 23008-12:2017 HEIF file format decoder and encoder
libpng 1.6.43 Library for manipulating PNG images
pillow 10.4.0 Friendly PIL fork (Python Imaging Library)
pngquant 3.0.3 PNG image optimizing utility
pybind11 2.13.5 Seamless operability between C++11 and Python
python@3.12 3.12.5 Interpreted, interactive, object-oriented programming language
qpdf 11.9.1 Tools for and transforming and inspecting PDF files
tesseract 5.4.1 OCR (Optical Character Recognition) engine
unpaper 7.0.0 Post-processing for scanned/photocopied books

Depends on when building from source:

pkg-config 0.29.2 Manage compile and link flags for libraries

Analytics:

Installs (30 days)
ocrmypdf 2,408
ocrmypdf --HEAD 5
Installs on Request (30 days)
ocrmypdf 2,408
ocrmypdf --HEAD 5
Build Errors (30 days)
ocrmypdf 0
Installs (90 days)
ocrmypdf 8,306
ocrmypdf --HEAD 18
Installs on Request (90 days)
ocrmypdf 8,306
ocrmypdf --HEAD 18
Installs (365 days)
ocrmypdf 34,313
ocrmypdf --HEAD 58
Installs on Request (365 days)
ocrmypdf 34,310
ocrmypdf --HEAD 58
Fork me on GitHub