Sign in
← Back to search

Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Stars
23,905
Forks
2,756
Commits
1491
Language
Python
Awesome lists
1

Similar repositories

certik/fastGPT

Fast GPT-2 inference written in Fortran

201 stars
Fortran 1 awesome list

kvcache-ai/ktransformers

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

17195 stars
Python 1 awesome list

pytorch/ao

PyTorch native quantization and sparsity for training and inference

2833 stars
Python 1 awesome list

OptimalScale/LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

8486 stars
Python 2 awesome lists

Tiiny-AI/PowerInfer

High-speed Large Language Model Serving for Local Deployment

9494 stars
C++ 1 awesome list

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 20:54

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2022-05-19
  • First commit: —
  • Last pushed: 2026-05-24
  • Archived: no
  • Stack detected: —
  • License: BSD-3-Clause

AI development signals

AI agent config detected

2 config paths 2 files 0 directories
Agent instructions Claude Code

Key config paths

  • file AGENTS.md
  • file CLAUDE.md