Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Awesome List
Curated list of the best truly open-source AI projects, models, tools, and infrastructure.
GitHub stars and default-branch commits for alvinreal/awesome-opensource-ai.
767 repos currently saved from this list.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
๐ฅ MaxKB is an open-source platform for building enterprise-grade agents. ๅผบๅคงๆ็จ็ๅผๆบไผไธ็บงๆบ่ฝไฝๅนณๅฐใ
Open standard for machine learning interoperability
"RAG-Anything: All-in-One RAG Framework"
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Temporal service
Minimalist ML framework for Rust
FinGPT: Open-Source Financial Large Language Models! Revolutionize ๐ฅ We release the trained model on HuggingFace.
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
๐ฆ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
Kilo is the all-in-one agentic engineering platform. Build, ship, and iterate faster with the most popular open source coding agent.
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
A TTS model capable of generating ultra-realistic dialogue in one pass.
Development repository for the Triton language and compiler
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
open-source agentic AI data assistant for the next generation of AI + Data products.
State-of-the-Art Embeddings, Retrieval, and Reranking
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Train transformer language models with reinforcement learning.
Open source agentic operating system
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
Build reliable customer-facing AI agents with Parlant: an interaction control harness optimized for controlled, consistent, and predictable LLM interactions.
High-performance In-browser LLM Inference Engine
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
Datasets, Transforms and Models specific to Computer Vision
AI Agent Framework, the Pydantic way
๐ง Leon is your open-source personal assistant.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
๐ซ CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
Network Analysis in Python
Your Personal AI Assistant; easy to install, deploy on your own machine or on the cloud; supports multiple chat apps with easily extensible capabilities.
Workflow Engine for Kubernetes
Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.
Ongoing research training transformer models at scale
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vicuna, Claude, ChatGLM, MOSS, ่ฎฏ้ฃๆ็ซ, ๆๅฟไธ่จ and more, discover the best answers
Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native databaseโ.
State-of-the-art Machine Learning for the web. Run ๐ค Transformers directly in your browser, with no need for a server!
Computer Vision Annotation Tool (CVAT) is a leading platform for building high-quality visual datasets for vision AI. It offers open-source, cloud, and enterprise products, as well as labeling services, for image, video, and 3D annotation with AI-assisted labeling, quality assurance, team collaboration, analytics, and developer APIs.
The LLM Evaluation Framework
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
Machine Learning Toolkit for Kubernetes