bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
QKeras: a quantization deep learning library for Tensorflow Keras
Accessible large language models via k-bit quantization for PyTorch.
PyTorch native quantization and sparsity for training and inference
Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.
QLoRA: Efficient Finetuning of Quantized LLMs
A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
3 captures since 2026-05-22