Gen-Verse/OpenClaw-RL
OpenClaw-RL: Train any agent simply by talking
slime is an LLM post-training framework for RL Scaling.
OpenClaw-RL: Train any agent simply by talking
Democratizing Reinforcement Learning for LLMs
SGLang is a high-performance serving framework for large language models and multimodal models.
Agentic RL Training at Scale
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Simple RL training for reasoning
1 capture since 2026-05-25
AI agent config detected
Key config paths
.agents
.claude
.agents
.agents/skills
.claude
.claude/skills
.claude/skills/add-dynamic-filter
.claude/skills/add-dynamic-filter/SKILL.md
.claude/skills/add-eval-dataset-config
.claude/skills/add-eval-dataset-config/SKILL.md
.claude/skills/add-reward-function
.claude/skills/add-reward-function/SKILL.md
.claude/skills/add-rollout-function
.claude/skills/add-rollout-function/SKILL.md
.claude/skills/add-tests-and-ci
.claude/skills/add-tests-and-ci/SKILL.md