huggingface/nanotron
Minimalistic large language model 3D-parallelism training
Ongoing research training transformer models at scale
Minimalistic large language model 3D-parallelism training
A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
A Next-Generation Training Engine Built for Ultra-Large MoE Models
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
OpenMMLab Foundational Library for Training Deep Learning Models
1 capture since 2026-05-25
AI agent config detected
Key config paths
.agents
.claude
.coderabbit.yaml
.cursorrules
AGENTS.md
CLAUDE.md
1 more config path detected.
.agents
.agents/skills
.claude
.claude/settings.json
.claude/skills
.coderabbit.yaml
.cursorrules
AGENTS.md
CLAUDE.md
greptile.json