Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Awesome-list intelligence for GitHub
Discover projects curated by awesome-list maintainers, then narrow them by stars, age, freshness, archive status, language, topics, generated tags, detected stacks, package managers, and source list.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Fast, flexible LLM inference
Efficient Triton Kernels for LLM Training
非线智能 NoneLinear - ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括374个大模型,覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、qwen3.6-plus、百川、讯飞星火、商汤senseChat等商用模型, 以及step3.5-flash、kimi-k2.6、ernie4.5、MiniMax-M2.7、deepseek-v4、Qwen3.6、llama4、智谱GLM-5.1、MiMo-V2、LongCat、gemma4、mistral等开源大模型。不仅提供排行榜,也提供规模超200万的大模型缺陷库!方便广大社区研究分析、改进大模型。
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.
A PyTorch native platform for training generative AI models
A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.
A blazing fast inference solution for text embeddings models
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
No description.
AdalFlow: The library to build & auto-optimize LLM applications.
Harness LLMs with Multi-Agent Programming
The platform for LLM evaluations and AI agent testing
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
The hub for EleutherAI's work on interpretability and learning dynamics
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparent evaluation of foundation models, including large language models (LLMs) and multimodal models.
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
Minimalistic large language model 3D-parallelism training
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Seamlessly integrate LLMs as Python functions
OpenAGI: When LLM Meets Domain Experts
Mesh TensorFlow: Model Parallelism Made Easier
MemFree - Hybrid AI Search Engine & AI Page Generator