ocrmypdf

Install command:
brew install ocrmypdf

Adds an OCR text layer to scanned PDF files

https://ocrmypdf.readthedocs.io/en/latest/

License: MPL-2.0

Formula JSON API: /api/formula/ocrmypdf.json

Formula code: ocrmypdf.rb on GitHub

Bottle (binary package) installation support provided for:

Apple Silicon sequoia
sonoma
ventura
Intel sonoma
ventura
64-bit linux

Current versions:

stable 16.6.2

Depends on:

cryptography 43.0.3 Cryptographic recipes and primitives for Python
freetype 2.13.3 Software library to render fonts
ghostscript 10.04.0 Interpreter for PostScript and PDF
img2pdf 0.5.1 Convert images to PDF via direct JPEG inclusion
jbig2enc 0.29 JBIG2 encoder (for monochrome documents)
libheif 1.19.5 ISO/IEC 23008-12:2017 HEIF file format decoder and encoder
libpng 1.6.44 Library for manipulating PNG images
pillow 11.0.0 Friendly PIL fork (Python Imaging Library)
pngquant 3.0.3 PNG image optimizing utility
pybind11 2.13.6 Seamless operability between C++11 and Python
python@3.13 3.13.0 Interpreted, interactive, object-oriented programming language
qpdf 11.9.1 Tools for and transforming and inspecting PDF files
tesseract 5.5.0 OCR (Optical Character Recognition) engine
unpaper 7.0.0 Post-processing for scanned/photocopied books

Depends on when building from source:

pkgconf 2.3.0 Package compiler and linker metadata toolkit

Analytics:

Installs (30 days)
ocrmypdf 3,282
ocrmypdf --HEAD 6
Installs on Request (30 days)
ocrmypdf 3,282
ocrmypdf --HEAD 6
Build Errors (30 days)
ocrmypdf 7
Installs (90 days)
ocrmypdf 6,866
ocrmypdf --HEAD 10
Installs on Request (90 days)
ocrmypdf 6,867
ocrmypdf --HEAD 10
Installs (365 days)
ocrmypdf 30,490
ocrmypdf --HEAD 56
Installs on Request (365 days)
ocrmypdf 30,488
ocrmypdf --HEAD 56
Fork me on GitHub