vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
An offline deep reinforcement learning library
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Deep Reinforcement Learning for Keras.
Reinforcement Learning in PyTorch
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
3 captures since 2026-05-22