Sign in
← Back to search
Stars
1,604
Forks
132
Commits
111
Language
Python
Awesome lists
1

Similar repositories

vwxyzjn/cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

9869 stars
Python 1 awesome list

jianzhnie/LLamaTuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

620 stars
Python 1 awesome list

RUCAIBox/R1-Searcher

R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

713 stars
Python 1 awesome list

DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

13335 stars
Python 5 awesome lists

OptimalScale/LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

8486 stars
Python 2 awesome lists

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 21:14

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2023-05-15
  • First commit: —
  • Last pushed: 2025-11-24
  • Website: https://pku-beaver.github.io
  • Archived: no
  • Stack detected: —
  • License: Apache-2.0

AI development signals

No AI development config files detected.