Sign in
← Back to search

bitsandbytes-foundation/bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Stars
8,225
Forks
855
Commits
1151
Language
Python
Awesome lists
1

Similar repositories

pytorch/ao

PyTorch native quantization and sparsity for training and inference

2833 stars
Python 1 awesome list

NVIDIA/Model-Optimizer

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

2759 stars
Python 1 awesome list

artidoro/qlora

QLoRA: Efficient Finetuning of Quantized LLMs

10914 stars
Jupyter Notebook 1 awesome list

jianzhnie/LLamaTuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

620 stars
Python 1 awesome list

Tiiny-AI/PowerInfer

High-speed Large Language Model Serving for Local Deployment

9494 stars
C++ 1 awesome list

OptimalScale/LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

8486 stars
Python 2 awesome lists

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 20:52

Stars history

Total stars

Commits history

Default branch commits

Metadata

AI development signals

AI agent config detected

1 config path 1 file 0 directories
Claude Code

Key config paths

  • file CLAUDE.md