rllm-org/rllm
Democratizing Reinforcement Learning for LLMs
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Democratizing Reinforcement Learning for LLMs
Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.
Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
1 capture since 2026-05-27