Sign in
← Back to search
Stars
7,873
Forks
699
Commits
1884
Language
Python
Awesome lists
2

Similar repositories

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

81308 stars
Python 3 awesome lists

llm-d/llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

3250 stars
Shell 1 awesome list

mlc-ai/mlc-llm

Universal LLM Deployment Engine with ML Compilation

22709 stars
Python 1 awesome list

jd-opensource/xllm

A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.

1300 stars
C++ 1 awesome list

NVIDIA/TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

13725 stars
Python 2 awesome lists

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 21:01

Stars history

Total stars

Commits history

Default branch commits

Metadata

AI development signals

AI agent config detected

7 config paths 3 files 4 directories
Claude Code 7

Key config paths

  • dir .claude
  • file CLAUDE.md
Review config paths
  • Claude Code .claude
  • Claude Code .claude/skills
  • Claude Code .claude/skills/docker-build
  • Claude Code .claude/skills/docker-build/SKILL.md
  • Claude Code .claude/skills/support-new-model
  • Claude Code .claude/skills/support-new-model/SKILL.md
  • Claude Code CLAUDE.md