Sign in
← Back to search

Tiiny-AI/PowerInfer

High-speed Large Language Model Serving for Local Deployment

Stars
9,494
Forks
575
Commits
1594
Language
C++
Awesome lists
1

Similar repositories

OptimalScale/LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

8486 stars
Python 2 awesome lists

jianzhnie/LLamaTuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

620 stars
Python 1 awesome list

sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

28241 stars
Python 4 awesome lists

artidoro/qlora

QLoRA: Efficient Finetuning of Quantized LLMs

10914 stars
Jupyter Notebook 1 awesome list

hpcaitech/ColossalAI

Making large AI models cheaper, faster and more accessible

41386 stars
Python 4 awesome lists

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 21:18

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2023-12-15
  • First commit: —
  • Last pushed: 2026-05-11
  • Archived: no
  • Stack detected: —
  • License: MIT

AI development signals

No AI development config files detected.