PrimeIntellect-ai/verifiers
Our library for RL environments + evals
Agentic RL Training at Scale
Our library for RL environments + evals
slime is an LLM post-training framework for RL Scaling.
OpenClaw-RL: Train any agent simply by talking
Simple RL training for reasoning
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Go ahead and axolotl questions
1 capture since 2026-05-25
AI agent config detected
Key config paths
.claude
.cursor
AGENTS.md
CLAUDE.md
.claude
.claude/skills
.cursor
AGENTS.md
CLAUDE.md