huggingface/optimum
๐ Accelerate inference and training of ๐ค Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
๐ Accelerate inference and training of ๐ค Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
๐๏ธ Fine-tune, build, and deploy open-source LLMs easily!
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
๐ A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Efficient Triton Kernels for LLM Training
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
1 capture since 2026-05-25