Adds an OCR text layer to scanned PDF files
https://ocrmypdf.readthedocs.io/en/latest/
License: MPL-2.0
Formula JSON API: /api/formula/ocrmypdf.json
Formula code: ocrmypdf.rb
on GitHub
Bottle (binary package) installation support provided for:
Apple Silicon | sonoma | ✅ |
---|---|---|
ventura | ✅ | |
monterey | ✅ | |
Intel | sonoma | ✅ |
ventura | ✅ | |
monterey | ✅ | |
64-bit linux | ✅ |
Current versions:
stable | ✅ | 16.1.2 |
Depends on:
cryptography | 42.0.5 | Cryptographic recipes and primitives for Python |
freetype | 2.13.2 | Software library to render fonts |
ghostscript | 10.03.0 | Interpreter for PostScript and PDF |
img2pdf | 0.5.1 | Convert images to PDF via direct JPEG inclusion |
jbig2enc | 0.29 | JBIG2 encoder (for monochrome documents) |
libpng | 1.6.43 | Library for manipulating PNG images |
pillow | 10.2.0 | Friendly PIL fork (Python Imaging Library) |
pngquant | 3.0.3 | PNG image optimizing utility |
pybind11 | 2.12.0 | Seamless operability between C++11 and Python |
python@3.12 | 3.12.2 | Interpreted, interactive, object-oriented programming language |
qpdf | 11.9.0 | Tools for and transforming and inspecting PDF files |
tesseract | 5.3.4 | OCR (Optical Character Recognition) engine |
unpaper | 7.0.0 | Post-processing for scanned/photocopied books |
Analytics:
Installs (30 days) | |
---|---|
ocrmypdf |
2,229 |
ocrmypdf --HEAD |
4 |
Installs on Request (30 days) | |
ocrmypdf |
2,229 |
ocrmypdf --HEAD |
4 |
Build Errors (30 days) | |
ocrmypdf |
0 |
Installs (90 days) | |
ocrmypdf |
7,760 |
ocrmypdf --HEAD |
10 |
Installs on Request (90 days) | |
ocrmypdf |
7,760 |
ocrmypdf --HEAD |
10 |
Installs (365 days) | |
ocrmypdf |
30,964 |
ocrmypdf --HEAD |
40 |
Installs on Request (365 days) | |
ocrmypdf |
30,961 |
ocrmypdf --HEAD |
40 |