huggingface/trl
Train transformer language models with reinforcement learning.
TensorFlow Reinforcement Learning
Train transformer language models with reinforcement learning.
Deep learning library featuring a higher-level API for TensorFlow.
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Deep Reinforcement Learning for Keras.
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Reinforcement Learning in PyTorch
3 captures since 2026-05-22