DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
OpenClaw-RL: Train any agent simply by talking
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Reinforcement Learning in PyTorch
Simple RL training for reasoning
An offline deep reinforcement learning library
3 captures since 2026-05-22