Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
GitHub projects from awesome lists
Search names, descriptions, topics, tags, and stacks, then tune results by ecosystem, freshness, health, and cross-list signal.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparent evaluation of foundation models, including large language models (LLMs) and multimodal models.
pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tidb.ai
A library for debugging/inspecting machine learning classifiers and explaining their predictions
A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
Ruler — apply the same rules to all coding agents
Minimalistic large language model 3D-parallelism training
Probabilistic programming with NumPy powered by JAX for autograd and JIT compilation to GPU/TPU/CPU.
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
nanoflann: a C++11 header-only library for Nearest Neighbor (NN) search with KD-trees
llama.cpp fork with additional SOTA quants and improved performance
The memory-first coding agent
Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory.
Kubernetes-native Job Queueing
Maid is a free and open source application for interfacing with llama.cpp models locally, and with Anthropic, DeepSeek, Ollama, Mistral and OpenAI models remotely.
Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
[NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from simple icons to intricate anime characters.
Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. 🚀💻 Integrates with 50+ LLM Providers, VectorDBs, Agent Frameworks and GPUs.
An Open Standard for lineage metadata collection
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
Machine learning metrics for distributed, scalable PyTorch applications.
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
PentestAgent is an AI agent framework for black-box security testing, supporting bug bounty, red-team, and penetration testing workflows.
SDG is a specialized framework designed to generate high-quality structured tabular data.
Data Contracts engine for the modern data stack. https://www.soda.io
Bionic is an on-premise replacement for ChatGPT, offering the advantages of Generative AI while maintaining strict data confidentiality
Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app
Fully open data curation for reasoning models
Feature engineering and selection open-source Python library compatible with sklearn.