vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
Evaluate and improve models and agents using environments
Reinforcement Learning in PyTorch
Agentic RL Training at Scale
3 captures since 2026-05-22
AI agent config detected
Key config paths
AGENTS.md