← Back to search

github Active AI dev

Repository profile

flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

Python Apache-2.0 main Stack scanned README.md

Open website Open GitHub

Stars: 5,959
Forks: 1,154
Watchers: 52
Issues: 793
Commits: 2,552
Awesome lists: 1

Repository updates

Get generated flashinfer-ai/flashinfer development summaries by email, or follow the weekly and monthly RSS feeds.

Weekly RSS Monthly RSS

Activity and growth

Tracked growth, recent movement, and commit velocity from stored repository snapshots.

Latest capture 2026-07-15 03:11

Star growth, last 7 days: 0 0.0%
Commit velocity, last 7 days: 0 0.0%
Stars since baseline: +285
Snapshot coverage: 5

Tracked growth

5 captures since 2026-05-25

Stars from baseline +285

Time horizon

All tracked data

Custom start Custom end

Stars history

Total stars

Commits history

Default branch commits

Detected stack

Frameworks, package managers, ecosystems, and dependency manifests found during catalog scans.

Scanned 2026-07-15 03:11

Stack signals: 0
Package managers: 2
Manifest files: 5
Dependencies: 45

Frameworks and tools

No framework dependencies detected.

PEP 517 pip python

Dependency files

5 manifests

pyproject.toml python ecosystem, 4 dependencies
requirements.txt python ecosystem, 16 dependencies
docs/requirements.txt python ecosystem, 6 dependencies
flashinfer-cubin/pyproject.toml python ecosystem, 9 dependencies
flashinfer-jit-cache/pyproject.toml python ecosystem, 10 dependencies

Classification

Searchable topics, generated tags, and stack labels that explain where this repository fits.

Topics: 10
Tags: 0
Stacks: 0

Topics

#attention #cuda #distributed-inference #gpu #jit #large-large-models #llm-inference #moe #nvidia #pytorch

Generated tags

No generated tags yet.

Stack labels

No stack labels yet.

AI development signals

Agent instructions and tool configuration paths found in the repository tree.

10 paths

AI agent config detected

10 config paths 5 files 5 directories

Agent instructions Claude Code 9

Key config paths

dir .claude
file AGENTS.md
file CLAUDE.md

Review config paths

Claude Code .claude
Claude Code .claude/skills
Claude Code .claude/skills/add-cuda-kernel
Claude Code .claude/skills/add-cuda-kernel/SKILL.md
Claude Code .claude/skills/benchmark-kernel
Claude Code .claude/skills/benchmark-kernel/SKILL.md
Claude Code .claude/skills/debug-cuda-crash
Claude Code .claude/skills/debug-cuda-crash/SKILL.md
Agent instructions AGENTS.md
Claude Code CLAUDE.md

Similar repositories

Nearest indexed repositories by embedding similarity.

Tiiny-AI/PowerInfer

High-speed Large Language Model Serving for Local Deployment

9,636 stars

C++ 1 awesome list

Dao-AILab/flash-attention

Fast and memory-efficient exact attention

24,454 stars

Python 1 awesome list

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

85,648 stars

Python 3 awesome lists

mit-han-lab/duo-attention

[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

539 stars

Python 0 awesome lists

OptimalScale/LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

8,484 stars

Python 2 awesome lists

linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

6,505 stars

Python 2 awesome lists

Metadata

Language: Python
License: Apache-2.0
Default branch: main
Created: 2023-07-22
First commit: 2023-07-22
Last pushed: 2026-07-15
GitHub updated: 2026-07-15
Last synced: 2026-07-15 03:11
Stack detected: 2026-07-15 03:11
Archived: no

Links and files

GitHub Website

https://flashinfer.ai

README

Appears in

Awesome Opensource Ai

flashinfer-ai/flashinfer

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

Tiiny-AI/PowerInfer

Dao-AILab/flash-attention

vllm-project/vllm

mit-han-lab/duo-attention

OptimalScale/LMFlow

linkedin/Liger-Kernel

Metadata

Links and files

Appears in

How it works

Pricing

Follow repository updates

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

Tiiny-AI/PowerInfer

Dao-AILab/flash-attention

vllm-project/vllm

mit-han-lab/duo-attention

OptimalScale/LMFlow

linkedin/Liger-Kernel

Metadata

Links and files

Appears in