CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Train transformer language models with reinforcement learning.
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Agentic RL Training at Scale
TensorFlow Reinforcement Learning
Reinforcement Learning in PyTorch
No description.
slime is an LLM post-training framework for RL Scaling.
3 captures since 2026-05-22
AI agent config detected
Key config paths
.ai/AGENTS.md
.cursor
AGENTS.md
CLAUDE.md