hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Making large AI models cheaper, faster and more accessible
Ongoing research training transformer models at scale
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Minimalistic large language model 3D-parallelism training
A Next-Generation Training Engine Built for Ultra-Large MoE Models
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
2 captures since 2026-05-25
AI agent config detected
Key config paths
AGENTS.md
CLAUDE.md