Sign in
← Back to search
Stars
4,494
Forks
489
Commits
2257
Language
C++
Awesome lists
1

Similar repositories

kvcache-ai/ktransformers

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

17195 stars
Python 1 awesome list

NVIDIA/Model-Optimizer

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

2759 stars
Python 1 awesome list

NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

16443 stars
Python 2 awesome lists

certik/fastGPT

Fast GPT-2 inference written in Fortran

201 stars
Fortran 1 awesome list

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 21:13

Stars history

Total stars

Commits history

Default branch commits

Metadata

AI development signals

No AI development config files detected.