Sign in
← Back to search

huggingface/nanotron

Minimalistic large language model 3D-parallelism training

Stars
2,699
Forks
309
Commits
1269
Language
Python
Awesome lists
2

Similar repositories

NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

16443 stars
Python 2 awesome lists

InternLM/xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

5137 stars
Python 1 awesome list

NVIDIA/Model-Optimizer

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

2759 stars
Python 1 awesome list

karpathy/nanochat

The best ChatGPT that $100 can buy.

54267 stars
Python 2 awesome lists

bobazooba/xllm

🦖 X—LLM: Cutting Edge & Easy LLM Finetuning

408 stars
Python 1 awesome list

pytorch/torchtitan

A PyTorch native platform for training generative AI models

5378 stars
Python 2 awesome lists

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 21:00

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2023-09-11
  • First commit: —
  • Last pushed: 2026-04-07
  • Archived: no
  • Stack detected: —
  • License: Apache-2.0

AI development signals

AI agent config detected

8 config paths 6 files 2 directories
Cursor 8

Key config paths

  • dir .cursor
Review config paths
  • Cursor .cursor
  • Cursor .cursor/rules
  • Cursor .cursor/rules/performance-optimization.mdc
  • Cursor .cursor/rules/philosophy.mdc
  • Cursor .cursor/rules/pipeline-parallelism.mdc
  • Cursor .cursor/rules/project-overview.mdc
  • Cursor .cursor/rules/tensor-parallelism.mdc
  • Cursor .cursor/rules/troubleshooting.mdc