vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
A high-throughput and memory-efficient inference and serving engine for LLMs
Universal LLM Deployment Engine with ML Compilation
OpenMMLab Foundational Library for Training Deep Learning Models
A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.
Achieve state of the art inference performance with modern accelerators on Kubernetes
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
1 capture since 2026-05-25
AI agent config detected
Key config paths
.claude
.github/copilot-instructions.md
.claude
.claude/skills
.claude/skills/impl-jit-kernel
.claude/skills/impl-jit-kernel/SKILL.md
.claude/skills/install-pymllm
.claude/skills/install-pymllm/SKILL.md
.claude/skills/link-pymllm-lib
.claude/skills/link-pymllm-lib/SKILL.md
.claude/skills/update-codeowners
.claude/skills/update-codeowners/SKILL.md
.github/copilot-instructions.md