Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Awesome List
Curated list of the best truly open-source AI projects, models, tools, and infrastructure.
GitHub stars and default-branch commits for alvinreal/awesome-opensource-ai.
767 repos currently saved from this list.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search scenario.
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
Time series forecasting with PyTorch
This repo contains the Hugging Face Deep Reinforcement Learning Course.
A blazing fast inference solution for text embeddings models
An Engine-Agnostic Deep Learning Framework in Java
Lightning ⚡️ fast forecasting with statistical and econometric models.
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.
An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models
Relax! Flux is the ML library that doesn't make you tensor
Optimize prompts, code, and more with AI-powered Reflective Text Evolution
A Rust machine learning framework.
On-device AI across mobile, embedded and edge for PyTorch
Align Anything: Training All-modality Model with Feedback
Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
Peekaboo is a macOS CLI & optional MCP server that enables AI agents to capture screenshots of applications, or the entire system, with optional visual question answering through local or remote AI models.
HelixDB is an open-source graph-vector database built from scratch in Rust.
Notebooks using the Hugging Face libraries 🤗
A fast inference library for running LLMs locally on modern consumer-class GPUs
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.
The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!
Fast inference engine for Transformer models
A fast multimodal LLM for real-time voice
Reliable Multi-Agent Orchestration Framework
LLM training code for Databricks foundation models
Pre-trained Deep Learning models and demos (high quality and extremely fast)
Webots Robot Simulator
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
A light-weight, flexible, and expressive statistical data testing library
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!
Minimal CLI coding agent by Mistral
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc
An open source extension that connects AI agents to computational notebooks in JupyterLab.
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8), EdgeTPU, CoreML.
Set of tools to assess and improve LLM security.
The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
AdalFlow: The library to build & auto-optimize LLM applications.
Machine Learning Pipelines for Kubeflow
🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines
Optimizing inference proxy for LLMs
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
A nearly-live implementation of OpenAI's Whisper.