ocrmypdf/OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Repository profile
A web interface to extract tabular data from PDFs
Repository updates
Get generated camelot-dev/excalibur development summaries by email, or follow the weekly and monthly RSS feeds.
Sign in to subscribe by email. RSS feeds are public.
Sign in to subscribeTracked growth, recent movement, and commit velocity from stored repository snapshots.
Latest capture 2026-06-24 13:05
1 capture since 2026-06-24
Stars from baseline 0
All tracked data
Frameworks, package managers, ecosystems, and dependency manifests found during catalog scans.
Scanned 2026-06-24 13:05
pyproject.toml
python ecosystem,
14 dependencies
setup.cfg
python ecosystem,
0 dependencies
uv.lock
python ecosystem,
71 dependencies
Searchable topics, generated tags, and stack labels that explain where this repository fits.
Agent instructions and tool configuration paths found in the repository tree.
Nearest indexed repositories by embedding similarity.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
:books: Web app for browsing, reading and downloading eBooks stored in a Calibre database
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
Virtual whiteboard for sketching hand-drawn like diagrams
A tool to detect whether a PDF has a bad redaction
Read and analyze SEC EDGAR filings in Python. 10-K, 8-K, XBRL financials, Form 3/4/5, 13F, ADV — clean API, well-typed, MIT-licensed.