Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Awesome-list intelligence for GitHub
Discover projects curated by awesome-list maintainers, then narrow them by stars, age, freshness, archive status, language, topics, generated tags, detected stacks, package managers, and source list.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
A community-supported supercharged document management system: scan, index and archive all your documents
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎 苹果系统翻译,OpenAI,Gemini,DeepL,Google,Bing,腾讯,百度,阿里,小牛,彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words and translating text.
OCR & Document Extraction using vision models
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
Open Source Document Management System for Digital Archives (Scanned Documents)
Lightweight document management system packed with all the features you can expect from big expensive solutions
Use LLMs and LLM Vision (OCR) to handle paperless-ngx - Document Digitalization powered by AI
Assist in organizing your piles of documents, resulting from scanners, e-mails and other sources with miminal effort.
A tool to convert a Wallpaper's color scheme / palette, OCR with VLM's Traditional & Hybrid, Image Compression ,color palette extraction, image upsacling with Adversarial Networks and more image processing features.
Copy any text on your screen, stop retyping.
ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.
A self-hosted file conversion server & share tool that supports 445 file formats in 13 languages.
Open-source screenshot and screen recording for macOS. The free, native alternative to CleanShot X. Built with Swift 6.0 and SwiftUI.
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
A lightweight macOS selection translator built with pure Swift 6, featuring on-device Apple Translate for privacy, only 5MB install size and stable ~50MB memory usage. 一款轻量级 macOS 划词翻译工具,纯 Swift 6 开发,设备端 Apple 翻译保护隐私,安装体积仅 5MB,后台运行内存稳定约 50MB
🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.
Papermerge DMS core backend, REST API server, and frontend UI
I, Librarian - open-source version of a PDF managing SaaS.
OpenRewind is a fully open-source, privacy-first alternative to rewind.ai. With OpenRewind, you can easily access your digital history, enhancing your memory and productivity without compromising your privacy.
Minimalist macOS OCR tool. Open-source, privacy-first, and built with SwiftUI.
Clipboard Manager for macOS
Screen translation app for macOS — select any area, get instant translation. On-device by default with Apple Vision OCR + Apple Translation.
MCP server for video analysis — extracts transcripts, key frames, OCR text, and metadata from video URLs. Supports Loom and direct video files.
The world's largest library of translated ancient texts. AI-powered OCR and translation of Renaissance and early modern manuscripts.
Always-on context for AI agents: screen capture, OCR, voice, MCP server. Local-first, AGPL-3.0.