Sign in
← Back to search

raphael-seo/Versatile-OCR-Program

Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)

Stars
683
Forks
49
Commits
65
Language
Python
Awesome lists
0

Similar repositories

junhoyeo/BetterOCR

🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.

636 stars
Python 1 awesome list

allenai/olmocr

Toolkit for linearizing PDFs for LLM datasets/training

17353 stars
Python 1 awesome list

JaidedAI/EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

29529 stars
Python 1 awesome list

PDFMathTranslate/PDFMathTranslate

[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

34114 stars
Python 2 awesome lists

Tracked growth

1 capture since 2026-06-02

Latest capture 2026-06-02 07:15

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2025-04-01
  • First commit: 2025-04-01
  • Last pushed: 2026-05-13
  • Archived: no
  • Stack detected: 2026-06-02 07:15
  • License: NOASSERTION

AI development signals

No AI development config files detected.

Appears in

  • No awesome list links recorded.