Sign in
← Back to search
Stars
81,308
Forks
17,368
Commits
17066
Language
Python
Awesome lists
3

Similar repositories

LMCache/LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

8344 stars
Python 1 awesome list

llm-d/llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

3250 stars
Shell 1 awesome list

jd-opensource/xllm

A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.

1300 stars
C++ 1 awesome list

InternLM/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

7873 stars
Python 2 awesome lists

alibaba/rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

1130 stars
Cuda 1 awesome list

Tracked growth

3 captures since 2026-05-22

Latest capture 2026-05-29 03:05

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2023-02-09
  • First commit: 2023-02-09
  • Last pushed: 2026-05-29
  • Website: https://vllm.ai
  • Archived: no
  • Stack detected: —
  • License: Apache-2.0

AI development signals

AI agent config detected

7 config paths 6 files 1 directory
Agent instructions 2 Claude Code 2 Codex Gemini CLI 2

Key config paths

  • dir .gemini
  • file AGENTS.md
  • file CLAUDE.md
  • file docs/serving/integrations/codex.md
  • file rust/AGENTS.md
  • file rust/CLAUDE.md
Review config paths
  • Gemini CLI .gemini
  • Gemini CLI .gemini/config.yaml
  • Agent instructions AGENTS.md
  • Claude Code CLAUDE.md
  • Codex docs/serving/integrations/codex.md
  • Agent instructions rust/AGENTS.md
  • Claude Code rust/CLAUDE.md