Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Awesome List
Curated list of the best truly open-source AI projects, models, tools, and infrastructure.
GitHub stars and default-branch commits for alvinreal/awesome-opensource-ai.
767 repos currently saved from this list.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).
A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
AI Observability & Evaluation
An Open-Source Asynchronous Coding Agent
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
A unified framework for machine learning with time series
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
A lightweight, lightning-fast, in-process vector database
cuDF - GPU DataFrame Library
A collection of pre-trained, state-of-the-art models in the ONNX format
Bayesian Modeling and Probabilistic Programming in Python
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
High-speed Large Language Model Serving for Local Deployment
Containers for machine learning
A python library for user-friendly forecasting and anomaly detection on time series.
Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Matting, 3D Segmentation, etc.
AutoML library for deep learning
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.
AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework
Easily fine-tune, evaluate and deploy Gemma 4, Qwen3.5, Qwen3.6, gpt-oss, DeepSeek-R1, or any open source LLM / VLM!
Autonomous AI development loop for Claude Code with intelligent exit detection
🚀 Next Gen Multi-tenant AI One-Stop Solution. Builtin Admin & Billing System. Enterprise-Grade Unified LLM Gateway Support for 200+ Models And 35+ Providers, Load Balacing w/ Priority-base Routing, Cost Management, Chat Share, Cloud Sync, Credit/Subscription Billing, All File Parsing, Web Search, Built-in Model Cache.
Deeplake is AI Data Runtime for Agents. It provides serverless postgres with a multimodal datalake, enabling scalable retrieval and training.
A CLI that writes your git commit messages for you with AI
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
ModelScope: bring the notion of Model-as-a-Service to life.
Apache Iceberg
An AI-powered search engine with a generative UI
Simple, Elastic-quality search for Postgres
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
An Application Framework for AI Engineering
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge bases. It can effectively overcome the shortcomings of the traditional RAG vector similarity calculation model.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Kimi Code CLI is your next CLI agent.
Cybersecurity AI (CAI), the framework for AI Security
Multi-platform SDK for integrating GitHub Copilot Agent into apps and services
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
Apache Beam is a unified programming model for Batch and Streaming data processing.
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
🤖 AI Gateway | AI Native API Gateway
Build effective agents using Model Context Protocol and simple workflow patterns
Supercharge Your LLM with the Fastest KV Cache Layer
Accessible large language models via k-bit quantization for PyTorch.
AI Toolkit for Healthcare Imaging