ocrmypdf

Install command:
brew install ocrmypdf

Adds an OCR text layer to scanned PDF files

https://ocrmypdf.readthedocs.io/en/latest/

License: MPL-2.0

Formula JSON API: /api/formula/ocrmypdf.json

Formula code: ocrmypdf.rb on GitHub

Bottle (binary package) installation support provided for:

Apple Silicon sonoma
ventura
monterey
Intel sonoma
ventura
monterey
64-bit linux

Current versions:

stable 16.4.0

Depends on:

cryptography 42.0.8 Cryptographic recipes and primitives for Python
freetype 2.13.2 Software library to render fonts
ghostscript 10.03.1 Interpreter for PostScript and PDF
img2pdf 0.5.1 Convert images to PDF via direct JPEG inclusion
jbig2enc 0.29 JBIG2 encoder (for monochrome documents)
libheif 1.17.6 ISO/IEC 23008-12:2017 HEIF file format decoder and encoder
libpng 1.6.43 Library for manipulating PNG images
pillow 10.3.0 Friendly PIL fork (Python Imaging Library)
pngquant 3.0.3 PNG image optimizing utility
pybind11 2.12.0 Seamless operability between C++11 and Python
python@3.12 3.12.4 Interpreted, interactive, object-oriented programming language
qpdf 11.9.1 Tools for and transforming and inspecting PDF files
tesseract 5.4.1 OCR (Optical Character Recognition) engine
unpaper 7.0.0 Post-processing for scanned/photocopied books

Depends on when building from source:

pkg-config 0.29.2 Manage compile and link flags for libraries

Analytics:

Installs (30 days)
ocrmypdf 2,311
ocrmypdf --HEAD 6
Installs on Request (30 days)
ocrmypdf 2,312
ocrmypdf --HEAD 6
Build Errors (30 days)
ocrmypdf 0
Installs (90 days)
ocrmypdf 6,055
ocrmypdf --HEAD 16
Installs on Request (90 days)
ocrmypdf 6,054
ocrmypdf --HEAD 16
Installs (365 days)
ocrmypdf 32,361
ocrmypdf --HEAD 51
Installs on Request (365 days)
ocrmypdf 32,358
ocrmypdf --HEAD 51
Fork me on GitHub