Sign in
← Back to search

hkust-nlp/simpleRL-reason

Simple RL training for reasoning

Stars
3,856
Forks
289
Commits
60
Language
Python
Awesome lists
1

Similar repositories

RUCAIBox/R1-Searcher

R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

713 stars
Python 1 awesome list

vwxyzjn/cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

9869 stars
Python 1 awesome list

Alibaba-NLP/ZeroSearch

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

1274 stars
Python 1 awesome list

FreedomIntelligence/LLMZoo

⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡

2947 stars
Python 1 awesome list

Jiayi-Pan/TinyZero

Minimal reproduction of DeepSeek R1-Zero

13112 stars
Python 2 awesome lists

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 21:00

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2025-01-25
  • First commit: —
  • Last pushed: 2025-12-23
  • Archived: no
  • Stack detected: —
  • License: MIT

AI development signals

No AI development config files detected.