← Back to search

github Active

Repository profile

PKU-Alignment/safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python Apache-2.0 main Stack scanned README.md

Open website Open GitHub

Stars: 1,611
Forks: 133
Watchers: 15
Issues: 18
Commits: 111
Awesome lists: 1

Repository updates

Get generated PKU-Alignment/safe-rlhf development summaries by email, or follow the weekly and monthly RSS feeds.

Weekly RSS Monthly RSS

Activity and growth

Tracked growth, recent movement, and commit velocity from stored repository snapshots.

Latest capture 2026-07-15 03:16

Star growth, last 7 days: 0 0.0%
Commit velocity, last 7 days: 0 0.0%
Stars since baseline: +7
Snapshot coverage: 5

Tracked growth

5 captures since 2026-05-25

Stars from baseline +7

Time horizon

All tracked data

Custom start Custom end

Stars history

Total stars

Commits history

Default branch commits

Detected stack

Frameworks, package managers, ecosystems, and dependency manifests found during catalog scans.

Scanned 2026-07-15 03:16

Stack signals: 0
Package managers: 2
Manifest files: 3
Dependencies: 0

Frameworks and tools

No framework dependencies detected.

PEP 517 pip python

Dependency files

3 manifests

pyproject.toml python ecosystem, 0 dependencies
requirements.txt python ecosystem, 0 dependencies
setup.py python ecosystem, 0 dependencies

Classification

Searchable topics, generated tags, and stack labels that explain where this repository fits.

Topics: 20
Tags: 0
Stacks: 0

Topics

#ai-safety #alpaca #beaver #datasets #deepspeed #gpt #large-language-models #llama #llm #llms #reinforcement-learning #reinforcement-learning-from-human-feedback #rlhf #safe-reinforcement-learning #safe-reinforcement-learning-from-human-feedback #safe-rlhf #safety #transformer #transformers #vicuna

Generated tags

No generated tags yet.

Stack labels

No stack labels yet.

AI development signals

Agent instructions and tool configuration paths found in the repository tree.

0 paths

No AI development config files detected.

Similar repositories

Nearest indexed repositories by embedding similarity.

vwxyzjn/cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

10,087 stars

Python 1 awesome list

jianzhnie/LLamaTuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

620 stars

Python 1 awesome list

hkust-nlp/simpleRL-reason

Simple RL training for reasoning

3,868 stars

Python 1 awesome list

RUCAIBox/R1-Searcher

R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

720 stars

Python 1 awesome list

DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

13,568 stars

Python 5 awesome lists

OptimalScale/LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

8,484 stars

Python 2 awesome lists

Metadata

Language: Python
License: Apache-2.0
Default branch: main
Created: 2023-05-15
First commit: 2023-05-15
Last pushed: 2025-11-24
GitHub updated: 2026-07-14
Last synced: 2026-07-15 03:16
Stack detected: 2026-07-15 03:16
Archived: no

Links and files

GitHub Website

https://pku-beaver.github.io

README

403 Forbidden | https://api.github.com/repos/PKU-Alignment/safe-rlhf/readme | message=API rate limit exceeded for user ID 260990068. If you reach out to GitHub Support for help, please include the request ID E734:29F22F:DEE3997:D40C72A:6A56FBA3 and timestamp 2026-07-15 03:16:51 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (htt | rate_limit_remaining=0 | rate_limit_reset=1784088007

Appears in

Awesome Opensource Ai

PKU-Alignment/safe-rlhf

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

vwxyzjn/cleanrl

jianzhnie/LLamaTuner

hkust-nlp/simpleRL-reason

RUCAIBox/R1-Searcher

DLR-RM/stable-baselines3

OptimalScale/LMFlow

Metadata

Links and files

Appears in

How it works

Pricing

Follow repository updates

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

vwxyzjn/cleanrl

jianzhnie/LLamaTuner

hkust-nlp/simpleRL-reason

RUCAIBox/R1-Searcher

DLR-RM/stable-baselines3

OptimalScale/LMFlow

Metadata

Links and files

Appears in