Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Awesome List
Curated list of the best truly open-source AI projects, models, tools, and infrastructure.
GitHub stars and default-branch commits for alvinreal/awesome-opensource-ai.
767 repos currently saved from this list.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Harness LLMs with Multi-Agent Programming
Latitude is the open-source agent engineering platform
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
Official repository for the Boltz biomolecular interaction models
One beautiful Ruby API for OpenAI, Anthropic, Gemini, Bedrock, Azure, OpenRouter, DeepSeek, Ollama, VertexAI, Perplexity, Mistral, xAI, GPUStack & OpenAI compatible APIs. Agents, Chat, Vision, Audio, PDF, Images, Embeddings, Tools, Streaming & Rails integration.
A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.
The Python Risk Identification Tool for generative AI (PyRIT) is an open source framework built to empower security professionals and engineers to proactively identify risks in generative AI systems.
Simple RL training for reasoning
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
RTAB-Map library and standalone application
Maestro: Netflix’s Workflow Orchestrator
A full-stack AI Red Teaming platform securing AI ecosystems via OpenClaw Security Scan, Agent Scan, Skills Scan, MCP scan, AI Infra scan and LLM jailbreak evaluation.
A retargetable MLIR-based machine learning compiler and runtime toolkit.
Simple, safe way to store and distribute tensors
A system for agentic LLM-powered data processing and ETL
This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks
Generative models for conditional audio generation
Open Source AI Infra & Engineering Control Plane
A flexible, high-performance 3D simulator for Embodied AI research.
The best OSS video generation models, created by Genmo
Input OpenAPI. Output SDKs and Docs.
The official Python client for the Hugging Face Hub.
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Fully customizable AI chatbot component for your website
Open-source security automation platform for teams and AI agents
The most accurate document search and store for building AI apps
Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.
OpenAgents - AI Agent Networks for Open Collaboration
AI Agent that handles engineering tasks end-to-end: integrates with developers’ tools, plans, executes, and iterates until it achieves a successful result.
Quickly and accurately render even the largest data.
Superfast AI decision making and intelligent processing of multi-modal data.
⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)
Synthetic data generation for tabular data
Heterogeneous GPU Sharing on Kubernetes
A library for mechanistic interpretability of GPT-style language models
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Personal AI Notebooks. Organize files & webpages and generate notes from them. Open source, local & open data, open model choice (incl. local).
Qwen3.6 is the large language model series developed by Qwen team, Alibaba Group.
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solving. The framework leverages a top-level planning agent to coordinate multiple specialized lower-level agents, enabling automated task decomposition and efficient execution across diverse and complex domains.
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2
II-Agent: a new open-source framework to build and deploy intelligent agents
Evaluation and Tracking for LLM Experiments and AI Agents
MTEB: Massive Text Embedding Benchmark
Build production-ready AI agents in both Python and Typescript.
Achieve state of the art inference performance with modern accelerators on Kubernetes
🕹️ Open-source, developer-first LLMOps platform designed to streamline prompt design, version management, instant delivery, collaboration, troubleshooting, observability and more.
JAX-based neural network library