huggingface/nanotron
Minimalistic large language model 3D-parallelism training
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Minimalistic large language model 3D-parallelism training
Ongoing research training transformer models at scale
Go ahead and axolotl questions
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
OpenMMLab Foundational Library for Training Deep Learning Models
1 capture since 2026-05-25
AI agent config detected
Key config paths
.agents
.claude
.codex/AGENTS.md
docs/zh_cn/legacy/chat/agent.md
.agents
.agents/skills
.claude
.claude/CLAUDE.md
.claude/rules
.claude/rules/datasets_architecture.md
.claude/skills
.claude/skills/model_normalize
.claude/skills/model_normalize/convert_interns2_to_fp8.sh
.claude/skills/model_normalize/heuristics.py
.claude/skills/model_normalize/hf_to_fp8.py
.claude/skills/model_normalize/model_normalize.py
.claude/skills/model_normalize/repack_hf.py
.claude/skills/model_normalize/SKILL.md
.claude/skills/model_patch
.claude/skills/model_patch/model_patch.py
.claude/skills/model_patch/SKILL.md
.claude/skills/sphinx-debug
.claude/skills/sphinx-debug/SKILL.md
.claude/skills/xtuner-sync-supported-models
.claude/skills/xtuner-sync-supported-models/scripts
.claude/skills/xtuner-sync-supported-models/scripts/scan_model_configs.py
.claude/skills/xtuner-sync-supported-models/SKILL.md
.codex/AGENTS.md
Showing the first 24 paths. 1 more detected.