Sign in
← Back to search

lucidrains/PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Stars
7,865
Forks
676
Commits
Language
Python
Awesome lists
1

Similar repositories

rllm-org/rllm

Democratizing Reinforcement Learning for LLMs

5563 stars
Python 1 awesome list

conceptofmind/LaMDA-rlhf-pytorch

Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.

468 stars
Python 2 awesome lists

EgoAlpha/prompt-in-context-learning

Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.

2236 stars
Jupyter Notebook 2 awesome lists

OptimalScale/LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

8486 stars
Python 2 awesome lists

FranxYao/chain-of-thought-hub

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

2774 stars
Jupyter Notebook 2 awesome lists

CarperAI/trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

4748 stars
Python 1 awesome list

Tracked growth

1 capture since 2026-05-27

Latest capture 2026-05-27 12:48

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2022-12-09
  • First commit: —
  • Last pushed: 2025-10-11
  • Archived: no
  • Stack detected: —
  • License: MIT

AI development signals

No AI development config files detected.

Appears in