← Back to search

github Active

Repository profile

ocrmypdf/OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python MPL-2.0 main Stack scanned README.md

Open website Open GitHub

Stars: 33,959
Forks: 2,343
Watchers: 189
Issues: 101
Commits: 4,377
Awesome lists: 1

Repository updates

Get generated ocrmypdf/OCRmyPDF development summaries by email, or follow the weekly and monthly RSS feeds.

Weekly RSS Monthly RSS

Activity and growth

Tracked growth, recent movement, and commit velocity from stored repository snapshots.

Latest capture 2026-06-24 13:17

Star growth, last 7 days: 0 0.0%
Commit velocity, last 7 days: 0 0.0%
Stars since baseline: 0
Snapshot coverage: 1

Tracked growth

1 capture since 2026-06-24

Stars from baseline 0

Time horizon

All tracked data

Custom start Custom end

Stars history

Total stars

Commits history

Default branch commits

Detected stack

Frameworks, package managers, ecosystems, and dependency manifests found during catalog scans.

Scanned 2026-06-24 13:17

Stack signals: 2
Package managers: 1
Manifest files: 2
Dependencies: 38

Frameworks and tools

pytest test framework · high confidence
Streamlit app framework · high confidence

uv python

Dependency files

2 manifests

pyproject.toml python ecosystem, 38 dependencies
uv.lock python ecosystem, 0 dependencies

Classification

Searchable topics, generated tags, and stack labels that explain where this repository fits.

Topics: 5
Tags: 0
Stacks: 2

Topics

#image-processing #ocr #pdf #python #tesseract

Generated tags

No generated tags yet.

Stack labels

pytest Streamlit

AI development signals

Agent instructions and tool configuration paths found in the repository tree.

0 paths

No AI development config files detected.

Similar repositories

Nearest indexed repositories by embedding similarity.

raphael-seo/Versatile-OCR-Program

Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)

682 stars

Python 0 awesome lists

JaidedAI/EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

29,647 stars

Python 1 awesome list

The fastest PDF library for Python and Rust. Text extraction, image extraction, markdown conversion, PDF creation & editing. 0.8ms mean, 5× faster than industry leaders, 100% pass rate on 3,830 PDFs. MIT/Apache-2.0.

846 stars

Rust 2 awesome lists

pikepdf/pikepdf

A Python library for reading and writing PDF, powered by QPDF

2,746 stars

Python 1 awesome list

AryanBV/pdf-toolkit-mcp

Write-capable PDF toolkit for any MCP client: 22 tools to read, create, render, encrypt, and transform PDFs. Vision rendering for scans, form-preserving merge and split, AES-256, zero native dependencies.

7 stars

TypeScript 1 awesome list

icereed/paperless-gpt

Use LLMs and LLM Vision (OCR) to handle paperless-ngx - Document Digitalization powered by AI

2,442 stars

Go 2 awesome lists

Metadata

Language: Python
License: MPL-2.0
Default branch: main
Created: 2013-12-20
First commit: 2013-04-09
Last pushed: 2026-06-22
GitHub updated: 2026-06-24
Last synced: 2026-06-24 13:17
Stack detected: 2026-06-24 13:17
Archived: no

Links and files

GitHub Website

http://ocrmypdf.readthedocs.io/

README

Appears in

Awesome Python Applications

ocrmypdf/OCRmyPDF

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

raphael-seo/Versatile-OCR-Program

JaidedAI/EasyOCR

yfedoseev/pdf_oxide

pikepdf/pikepdf

AryanBV/pdf-toolkit-mcp

icereed/paperless-gpt

Metadata

Links and files

Appears in

How it works

Pricing

Follow repository updates

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

raphael-seo/Versatile-OCR-Program

JaidedAI/EasyOCR

yfedoseev/pdf_oxide

pikepdf/pikepdf

AryanBV/pdf-toolkit-mcp

icereed/paperless-gpt

Metadata

Links and files

Appears in