Sign in
← Back to search

NVIDIA-NeMo/RL

Scalable toolkit for efficient model reinforcement

Stars
1,654
Forks
394
Commits
943
Language
Python
Awesome lists
1

Similar repositories

EleutherAI/gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

7432 stars
Python 3 awesome lists

modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, Phi4, ...) (AAAI 2025).

14255 stars
Python 1 awesome list

OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

9545 stars
Python 2 awesome lists

huggingface/nanotron

Minimalistic large language model 3D-parallelism training

2699 stars
Python 2 awesome lists

hiyouga/LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

71580 stars
Python 1 awesome list

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 21:10

Stars history

Total stars

Commits history

Default branch commits

Metadata

AI development signals

AI agent config detected

42 config paths 28 files 14 directories
Agent instructions Agent workspace 23 Claude Code 18

Key config paths

  • dir .agents
  • dir .claude
  • file AGENTS.md
  • file CLAUDE.md
Review config paths
  • Agent workspace .agents
  • Agent workspace .agents/contributor-skills
  • Agent workspace .agents/contributor-skills/build-and-dependency
  • Agent workspace .agents/contributor-skills/build-and-dependency/SKILL.md
  • Agent workspace .agents/contributor-skills/cicd
  • Agent workspace .agents/contributor-skills/cicd/SKILL.md
  • Agent workspace .agents/contributor-skills/config-conventions
  • Agent workspace .agents/contributor-skills/config-conventions/SKILL.md
  • Agent workspace .agents/contributor-skills/contributing
  • Agent workspace .agents/contributor-skills/contributing/SKILL.md
  • Agent workspace .agents/contributor-skills/copyright
  • Agent workspace .agents/contributor-skills/copyright/SKILL.md
  • Agent workspace .agents/contributor-skills/error-handling
  • Agent workspace .agents/contributor-skills/error-handling/SKILL.md
  • Agent workspace .agents/contributor-skills/linting-and-formatting
  • Agent workspace .agents/contributor-skills/linting-and-formatting/SKILL.md
  • Agent workspace .agents/contributor-skills/review-pr
  • Agent workspace .agents/contributor-skills/review-pr/advanced.md
  • Agent workspace .agents/contributor-skills/review-pr/SKILL.md
  • Agent workspace .agents/contributor-skills/session-memory
  • Agent workspace .agents/contributor-skills/session-memory/SKILL.md
  • Agent workspace .agents/contributor-skills/testing
  • Agent workspace .agents/contributor-skills/testing/SKILL.md
  • Claude Code .claude

Showing the first 24 paths. 18 more detected.