Sign in
← Back to search

kvcache-ai/ktransformers

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Stars
17,195
Forks
1,294
Commits
1275
Language
Python
Awesome lists
1

Similar repositories

OpenNMT/CTranslate2

Fast inference engine for Transformer models

4494 stars
C++ 1 awesome list

NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

16443 stars
Python 2 awesome lists

NVIDIA/Model-Optimizer

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

2759 stars
Python 1 awesome list

InternLM/xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

5137 stars
Python 1 awesome list

huggingface/transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

160951 stars
Python 5 awesome lists

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 21:02

Stars history

Total stars

Commits history

Default branch commits

Metadata

AI development signals

No AI development config files detected.