← Back to search

github Active AI dev

Repository profile

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python Apache-2.0 main Stack scanned README.md

Open website Open GitHub

Stars: 85,648
Forks: 19,091
Watchers: 577
Issues: 5,586
Commits: 18,535
Awesome lists: 3

Repository updates

Get generated vllm-project/vllm development summaries by email, or follow the weekly and monthly RSS feeds.

Weekly RSS Monthly RSS

Activity and growth

Tracked growth, recent movement, and commit velocity from stored repository snapshots.

Latest capture 2026-07-08 03:06

Star growth, last 7 days: No 7-day history
Commit velocity, last 7 days: No 7-day history
Stars since baseline: +4902
Snapshot coverage: 6

Tracked growth

6 captures since 2026-05-22

Stars from baseline +4902

Time horizon

All tracked data

Custom start Custom end

Stars history

Total stars

Commits history

Default branch commits

Detected stack

Frameworks, package managers, ecosystems, and dependency manifests found during catalog scans.

Scanned 2026-07-08 03:06

Stack signals: 4
Package managers: 5
Manifest files: 44
Dependencies: 2,476

Frameworks and tools

Axum web framework · high confidence
FastAPI web framework · high confidence
pytest test framework · high confidence
Starlette web framework · high confidence

Cargo CMake PEP 517 pip uv c-cpp python rust

Dependency files

44 manifests

CMakeLists.txt c-cpp ecosystem, 1 dependency
pyproject.toml python ecosystem, 9 dependencies
setup.py python ecosystem, 22 dependencies
requirements/common.txt python ecosystem, 58 dependencies
requirements/cpu.txt python ecosystem, 8 dependencies
requirements/cuda.txt python ecosystem, 17 dependencies
requirements/dev.txt python ecosystem, 0 dependencies
requirements/docs.txt python ecosystem, 59 dependencies
36 more files

Classification

Searchable topics, generated tags, and stack labels that explain where this repository fits.

Topics: 20
Tags: 0
Stacks: 4

Topics

#amd #blackwell #cuda #deepseek #deepseek-v3 #gpt #gpt-oss #inference #kimi #llama #llm #llm-serving #model-serving #moe #openai #pytorch #qwen #qwen3 #tpu #transformer

Generated tags

No generated tags yet.

Stack labels

Axum FastAPI pytest Starlette

AI development signals

Agent instructions and tool configuration paths found in the repository tree.

11 paths

AI agent config detected

11 config paths 7 files 4 directories

Agent instructions 2 Claude Code 6 Codex Gemini CLI 2

Key config paths

1 more config path detected.

Review config paths

Claude Code .claude
Claude Code .claude/skills
Claude Code .claude/skills/ci-fails-buildkite
Claude Code .claude/skills/ci-fails-buildkite/SKILL.md
Gemini CLI .gemini
Gemini CLI .gemini/config.yaml
Agent instructions AGENTS.md
Claude Code CLAUDE.md
Codex docs/serving/integrations/codex.md
Agent instructions rust/AGENTS.md
Claude Code rust/CLAUDE.md

Similar repositories

Nearest indexed repositories by embedding similarity.

llm-d/llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

3,800 stars

Shell 1 awesome list

jd-opensource/xllm

A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.

1,370 stars

C++ 1 awesome list

xLLM-AI/xllm

A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators. It is hosted in OpenAtom Foundation.

1,474 stars

C++ 0 awesome lists

InternLM/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

7,954 stars

Python 2 awesome lists

alibaba/rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

1,269 stars

Cuda 1 awesome list

sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

30,349 stars

Python 4 awesome lists

Metadata

Language: Python
License: Apache-2.0
Default branch: main
Created: 2023-02-09
First commit: 2023-02-09
Last pushed: 2026-07-08
GitHub updated: 2026-07-08
Last synced: 2026-07-08 03:06
Stack detected: 2026-07-08 03:06
Archived: no

Links and files

GitHub Website

https://vllm.ai

README

vllm-project/vllm

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

llm-d/llm-d

jd-opensource/xllm

xLLM-AI/xllm

InternLM/lmdeploy

alibaba/rtp-llm

sgl-project/sglang

Metadata

Links and files

Appears in

How it works

Pricing

Follow repository updates

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

llm-d/llm-d

jd-opensource/xllm

xLLM-AI/xllm

InternLM/lmdeploy

alibaba/rtp-llm

sgl-project/sglang

Metadata

Links and files

Appears in