Sign in
← Back to search

CarperAI/trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Stars
4,748
Forks
484
Commits
349
Language
Python
Awesome lists
1

Similar repositories

huggingface/trl

Train transformer language models with reinforcement learning.

18462 stars
Python 2 awesome lists

vwxyzjn/cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

9869 stars
Python 1 awesome list

astooke/rlpyt

Reinforcement Learning in PyTorch

2275 stars
Python 1 awesome list

rllm-org/rllm

Democratizing Reinforcement Learning for LLMs

5563 stars
Python 1 awesome list

lucidrains/PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

7865 stars
Python 1 awesome list

Tracked growth

1 capture since 2026-05-27

Latest capture 2026-05-27 12:30

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2022-10-03
  • First commit: 2020-03-27
  • Last pushed: 2024-01-08
  • Archived: no
  • Stack detected: —
  • License: MIT

AI development signals

No AI development config files detected.