NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Minimalistic large language model 3D-parallelism training
Ongoing research training transformer models at scale
A Next-Generation Training Engine Built for Ultra-Large MoE Models
A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
The best ChatGPT that $100 can buy.
🦖 X—LLM: Cutting Edge & Easy LLM Finetuning
A PyTorch native platform for training generative AI models
1 capture since 2026-05-25
AI agent config detected
Key config paths
.cursor
.cursor
.cursor/rules
.cursor/rules/performance-optimization.mdc
.cursor/rules/philosophy.mdc
.cursor/rules/pipeline-parallelism.mdc
.cursor/rules/project-overview.mdc
.cursor/rules/tensor-parallelism.mdc
.cursor/rules/troubleshooting.mdc