NVIDIA-NeMo/RL
Scalable toolkit for efficient model reinforcement
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Scalable toolkit for efficient model reinforcement
Minimalistic large language model 3D-parallelism training
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Ongoing research training transformer models at scale
A framework for few-shot evaluation of language models.
The hub for EleutherAI's work on interpretability and learning dynamics
2 captures since 2026-05-25