Adds an OCR text layer to scanned PDF files
https://ocrmypdf.readthedocs.io/en/latest/
License: MPL-2.0
Formula JSON API: /api/formula/ocrmypdf.json
Formula code: ocrmypdf.rb
on GitHub
Bottle (binary package) installation support provided for:
Apple Silicon | sequoia | ✅ |
---|---|---|
sonoma | ✅ | |
ventura | ✅ | |
Intel | sonoma | ✅ |
ventura | ✅ | |
64-bit linux | ✅ |
Current versions:
stable | ✅ | 16.7.0 |
Depends on:
cryptography | 44.0.0 | Cryptographic recipes and primitives for Python |
freetype | 2.13.3 | Software library to render fonts |
ghostscript | 10.04.0 | Interpreter for PostScript and PDF |
img2pdf | 0.5.1 | Convert images to PDF via direct JPEG inclusion |
jbig2enc | 0.29 | JBIG2 encoder (for monochrome documents) |
libheif | 1.19.5 | ISO/IEC 23008-12:2017 HEIF file format decoder and encoder |
libpng | 1.6.44 | Library for manipulating PNG images |
pillow | 11.0.0 | Friendly PIL fork (Python Imaging Library) |
pngquant | 3.0.3 | PNG image optimizing utility |
pybind11 | 2.13.6 | Seamless operability between C++11 and Python |
python@3.13 | 3.13.1 | Interpreted, interactive, object-oriented programming language |
qpdf | 11.9.1 | Tools for and transforming and inspecting PDF files |
tesseract | 5.5.0 | OCR (Optical Character Recognition) engine |
unpaper | 7.0.0 | Post-processing for scanned/photocopied books |
Depends on when building from source:
pkgconf | 2.3.0 | Package compiler and linker metadata toolkit |
Analytics:
Installs (30 days) | |
---|---|
ocrmypdf |
2,494 |
ocrmypdf --HEAD |
3 |
Installs on Request (30 days) | |
ocrmypdf |
2,493 |
ocrmypdf --HEAD |
3 |
Build Errors (30 days) | |
ocrmypdf |
2 |
Installs (90 days) | |
ocrmypdf |
7,033 |
ocrmypdf --HEAD |
10 |
Installs on Request (90 days) | |
ocrmypdf |
7,033 |
ocrmypdf --HEAD |
10 |
Installs (365 days) | |
ocrmypdf |
29,852 |
ocrmypdf --HEAD |
55 |
Installs on Request (365 days) | |
ocrmypdf |
29,851 |
ocrmypdf --HEAD |
55 |