Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Awesome List
Curated list of the best truly open-source AI projects, models, tools, and infrastructure.
GitHub stars and default-branch commits for alvinreal/awesome-opensource-ai.
767 repos currently saved from this list.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
A natural language interface for computers
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Autonomous coding agent as an SDK, IDE extension, or CLI assistant.
Scrapy, a fast high-level web crawling & scraping framework for Python.
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
Get your documents ready for gen AI
Clone a voice in 5 seconds to generate arbitrary speech in real-time
The simplest, fastest repository for training/finetuning medium-sized GPTs.
AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods
A programming framework for agentic AI
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
Ultralytics YOLO 🚀
Universal memory layer for AI Agents
Interact with your documents using the power of GPT, 100% privately, no data leaks
No fortress, purely open ground. OpenManus is Coming.
Build AI Agents, Visually
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
🧠「大模型」2小时完全从0训练64M的小参数LLM!Train a 64M-parameter LLM from scratch in just 2h!
LlamaIndex is the leading document agent and OCR platform
🔥🔥🔥 Open-source Jira, Linear, Monday, and ClickUp alternative. Plane is a modern project management platform to manage tasks, sprints, docs, and triage.
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM]
Focus on prompting and generating
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
12 Weeks, 24 Lessons, AI for All!
Learn how to develop, deploy and iterate on production-grade ML applications.
Open-Source Frontier Voice AI
Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs
an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM
aider is AI pair programming in your terminal
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.
Run frontier AI locally.
Streamlit — A faster way to build and share data apps.
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.
Apache Spark - A unified analytics engine for large-scale data processing
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Making large AI models cheaper, faster and more accessible
Build, run, and manage agent platforms.
Powerful AI Client
A library for efficient similarity search and clustering of dense vectors.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
A generative speech model for daily dialogue.