huggingface/nanotron
Minimalistic large language model 3D-parallelism training
TensorFlow-based neural network library
Minimalistic large language model 3D-parallelism training
Build Graph Nets in Tensorflow
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Supercharge Your Model Training
Scalable toolkit for efficient model reinforcement
A Data Streaming Library for Efficient Neural Network Training
3 captures since 2026-05-22