Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Awesome List
Curated list of the best truly open-source AI projects, models, tools, and infrastructure.
GitHub stars and default-branch commits for alvinreal/awesome-opensource-ai.
767 repos currently saved from this list.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Official inference framework for 1-bit LLMs
Extremely fast Query Engine for DataFrames, written in Rust
DuckDB is an analytical in-process SQL database management system
ToolJet is the open-source foundation of ToolJet AI - the enterprise app generation platform for building internal tools, dashboard, business applications, workflows and AI agents 🚀
Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active.
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Convert PDF to markdown + JSON quickly with high accuracy
10 Weeks, 20 Lessons, Data Science for All!
Cross-platform, customizable ML solutions for live and streaming media.
In-depth tutorials on LLMs, RAGs and real-world AI agent applications.
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
DSPy: The framework for programming—not prompting—language models
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,提供 Docker 镜像,一键部署,开箱即用。LLM API management & key redistribution system, unifying multiple providers under a single API. Single binary, Docker-ready, with an English UI.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
💫 Industrial-strength Natural Language Processing (NLP) in Python
Self-hosted AI coding assistant
⏩ Source-controlled AI checks, enforceable in CI. Powered by the open-source Continue CLI
A modular graph-based Retrieval-Augmented Generation (RAG) system
ChatDev 2.0: Dev All through LLM-powered Multi-Agent Collaboration
Build resilient agents.
You like pytorch? You like micrograd? You love tinygrad! ❤️
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
The fundamental package for scientific computing with Python.
Conductor is an event driven agentic workflow engine providing durable and highly resilient execution engine for applications and AI Agents
The Frontend Stack for Agents & Generative UI. React + Angular. Makers of the AG-UI Protocol
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Lightpanda: the headless browser designed for AI and automation
SOTA Open Source TTS
Open Source AI Platform - AI Chat with advanced features that works with every LLM
An AI prompt optimizer for writing better prompts and getting better AI results.
Open-Sora: Democratizing Efficient Video Production for All
No description.
Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.
Composio powers 1000+ toolkits, tool search, context management, authentication, and a sandboxed workbench to help you build AI agents that turn intent into action.
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
LLM Frontend for Power Users.
SGLang is a high-performance serving framework for large language models and multimodal models.
Search infrastructure for AI
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.
The fastai deep learning library
Integrate cutting-edge LLM technology quickly and easily into your apps
🤗 smolagents: a barebones library for agents that think in code.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
An autonomous agent that conducts deep research on any data using any LLM providers
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.