ocrmypdf

Install command:
brew install ocrmypdf

Adds an OCR text layer to scanned PDF files

https://ocrmypdf.readthedocs.io/en/latest/

License: MPL-2.0

Formula JSON API: /api/formula/ocrmypdf.json

Formula code: ocrmypdf.rb on GitHub

Bottle (binary package) installation support provided for:

Apple Silicon sequoia
sonoma
ventura
Intel sonoma
ventura
64-bit linux

Current versions:

stable 16.7.0

Depends on:

cryptography 44.0.0 Cryptographic recipes and primitives for Python
freetype 2.13.3 Software library to render fonts
ghostscript 10.04.0 Interpreter for PostScript and PDF
img2pdf 0.5.1 Convert images to PDF via direct JPEG inclusion
jbig2enc 0.29 JBIG2 encoder (for monochrome documents)
libheif 1.19.5 ISO/IEC 23008-12:2017 HEIF file format decoder and encoder
libpng 1.6.44 Library for manipulating PNG images
pillow 11.0.0 Friendly PIL fork (Python Imaging Library)
pngquant 3.0.3 PNG image optimizing utility
pybind11 2.13.6 Seamless operability between C++11 and Python
python@3.13 3.13.1 Interpreted, interactive, object-oriented programming language
qpdf 11.9.1 Tools for and transforming and inspecting PDF files
tesseract 5.5.0 OCR (Optical Character Recognition) engine
unpaper 7.0.0 Post-processing for scanned/photocopied books

Depends on when building from source:

pkgconf 2.3.0 Package compiler and linker metadata toolkit

Analytics:

Installs (30 days)
ocrmypdf 2,494
ocrmypdf --HEAD 3
Installs on Request (30 days)
ocrmypdf 2,493
ocrmypdf --HEAD 3
Build Errors (30 days)
ocrmypdf 2
Installs (90 days)
ocrmypdf 7,033
ocrmypdf --HEAD 10
Installs on Request (90 days)
ocrmypdf 7,033
ocrmypdf --HEAD 10
Installs (365 days)
ocrmypdf 29,852
ocrmypdf --HEAD 55
Installs on Request (365 days)
ocrmypdf 29,851
ocrmypdf --HEAD 55
Fork me on GitHub