Adds an OCR text layer to scanned PDF files
https://ocrmypdf.readthedocs.io/en/latest/
License: MPL-2.0
Formula JSON API: /api/formula/ocrmypdf.json
Formula code: ocrmypdf.rb
on GitHub
Bottle (binary package) installation support provided for:
Apple Silicon | sonoma | ✅ |
---|---|---|
ventura | ✅ | |
monterey | ✅ | |
Intel | sonoma | ✅ |
ventura | ✅ | |
monterey | ✅ | |
64-bit linux | ✅ |
Current versions:
stable | ✅ | 16.5.0 |
Depends on:
cryptography | 43.0.1 | Cryptographic recipes and primitives for Python |
freetype | 2.13.3 | Software library to render fonts |
ghostscript | 10.03.1 | Interpreter for PostScript and PDF |
img2pdf | 0.5.1 | Convert images to PDF via direct JPEG inclusion |
jbig2enc | 0.29 | JBIG2 encoder (for monochrome documents) |
libheif | 1.18.2 | ISO/IEC 23008-12:2017 HEIF file format decoder and encoder |
libpng | 1.6.43 | Library for manipulating PNG images |
pillow | 10.4.0 | Friendly PIL fork (Python Imaging Library) |
pngquant | 3.0.3 | PNG image optimizing utility |
pybind11 | 2.13.5 | Seamless operability between C++11 and Python |
python@3.12 | 3.12.5 | Interpreted, interactive, object-oriented programming language |
qpdf | 11.9.1 | Tools for and transforming and inspecting PDF files |
tesseract | 5.4.1 | OCR (Optical Character Recognition) engine |
unpaper | 7.0.0 | Post-processing for scanned/photocopied books |
Depends on when building from source:
pkg-config | 0.29.2 | Manage compile and link flags for libraries |
Analytics:
Installs (30 days) | |
---|---|
ocrmypdf |
2,408 |
ocrmypdf --HEAD |
5 |
Installs on Request (30 days) | |
ocrmypdf |
2,408 |
ocrmypdf --HEAD |
5 |
Build Errors (30 days) | |
ocrmypdf |
0 |
Installs (90 days) | |
ocrmypdf |
8,306 |
ocrmypdf --HEAD |
18 |
Installs on Request (90 days) | |
ocrmypdf |
8,306 |
ocrmypdf --HEAD |
18 |
Installs (365 days) | |
ocrmypdf |
34,313 |
ocrmypdf --HEAD |
58 |
Installs on Request (365 days) | |
ocrmypdf |
34,310 |
ocrmypdf --HEAD |
58 |