Search awesome repositories

unslothai/unsloth

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

Python #agent #deepseek #fine-tuning #gemma #gemma3 AI dev signals 3 awesome lists 5435 commits first commit 2023-11-29 3 history points updated 2026-05-29

★ 65,276

Website ↗ GitHub ↗

ray-project/ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python #data-science #deep-learning #deployment #distributed #hyperparameter-optimization AI dev signals 4 awesome lists 30467 commits 4 history points updated 2026-05-25

★ 42,668

Website ↗ GitHub ↗

sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python #attention #blackwell #cuda #deepseek #diffusion AI dev signals 4 awesome lists 13116 commits 3 history points updated 2026-05-25

★ 28,241

Website ↗ GitHub ↗

AI4Finance-Foundation/FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

Jupyter Notebook FastAPI Gradio Jupyter LangChain npmpip #chatgpt #finance #fingpt #fintech #large-language-models 3 awesome lists 687 commits first commit 2023-02-11 4 history points updated 2026-06-01

★ 20,386

Website ↗ GitHub ↗

Unity-Technologies/ml-agents

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

C# #deep-learning #deep-reinforcement-learning #machine-learning #neural-networks #reinforcement-learning 1 awesome list 3561 commits 1 history point updated 2026-05-20

★ 19,434

Website ↗ GitHub ↗

datawhalechina/leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

Jupyter Notebook #bert #chatgpt #cnn #deep-learning #diffusion 2 awesome lists 608 commits first commit 2019-04-04 2 history points updated 2025-11-23

★ 16,584

GitHub ↗

DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python pytest PEP 517pip #baselines #gsde #gym #machine-learning #openai 5 awesome lists 930 commits first commit 2019-09-05 8 history points updated 2026-05-11

★ 13,367

Website ↗ GitHub ↗

Farama-Foundation/Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python #api #gym #reinforcement-learning 3 awesome lists 2584 commits 3 history points updated 2026-05-14

★ 11,943

Website ↗ GitHub ↗

vwxyzjn/cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python #a2c #actor-critic #advantage-actor-critic #ale #atari 1 awesome list 843 commits first commit 2019-06-07 3 history points updated 2026-04-20

★ 9,869

Website ↗ GitHub ↗

OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python #large-language-models #proximal-policy-optimization #raylib #reinforcement-learning #reinforcement-learning-from-human-feedback 2 awesome lists 1538 commits 1 history point updated 2026-05-15

★ 9,545

Website ↗ GitHub ↗

VowpalWabbit/vowpal_wabbit

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

C++ CMake.NET SDKnpm #active-learning #c-plus-plus #contextual-bandits #cpp #learning-to-search 2 awesome lists 10552 commits first commit 2009-04-29 12 history points updated 2026-05-08

★ 8,681

Website ↗ GitHub ↗

lucidrains/PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python #artificial-intelligence #attention-mechanisms #deep-learning #human-feedback #reinforcement-learning 1 awesome list 1 history point updated 2025-10-11

★ 7,865

GitHub ↗

tensorlayer/TensorLayer

Deep Learning and Reinforcement Learning Library for Scientists and Engineers

Python #a3c #artificial-intelligence #chatbot #deep-learning #dqn 1 awesome list 3353 commits first commit 2016-06-07 3 history points updated 2023-02-18

★ 7,389

Website ↗ GitHub ↗

tensorpack/tensorpack

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

Python #deep-learning #machine-learning #neural-networks #reinforcement-learning #tensorflow 1 awesome list 2944 commits first commit 2015-12-25 3 history points updated 2023-08-06

★ 6,291

GitHub ↗

rllm-org/rllm

Democratizing Reinforcement Learning for LLMs

Python #agent-framework #agentic-workflow #coding-agent #distributed-training #llm-reasoning AI dev signals 1 awesome list 1841 commits 1 history point updated 2026-05-22

★ 5,563

Website ↗ GitHub ↗

keras-rl/keras-rl

Deep Reinforcement Learning for Keras.

Python #keras #machine-learning #neural-networks #reinforcement-learning #tensorflow 1 awesome list 308 commits first commit 2016-07-02 3 history points updated 2023-09-17

★ 5,556

Website ↗ GitHub ↗

google-deepmind/open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

C++ #cpp #games #multiagent #python #reinforcement-learning 1 awesome list 5412 commits 1 history point updated 2026-05-23

★ 5,237

GitHub ↗

InternLM/xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python #agent #deepseek-v3 #gpt-oss #intern-s1 #internvl AI dev signals 1 awesome list 1184 commits 1 history point updated 2026-05-25

★ 5,137

Website ↗ GitHub ↗

huggingface/deep-rl-class

This repo contains the Hugging Face Deep Reinforcement Learning Course.

MDX #deep-learning #deep-reinforcement-learning #reinforcement-learning #reinforcement-learning-excercises 2 awesome lists 1150 commits 1 history point updated 2026-04-17

★ 4,886

GitHub ↗

CarperAI/trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python #machine-learning #pytorch #reinforcement-learning 1 awesome list 349 commits first commit 2020-03-27 1 history point updated 2024-01-08

★ 4,748

GitHub ↗

google-deepmind/acme

A library of reinforcement learning components and agents

Python #agents #reinforcement-learning #research AI dev signals 1 awesome list 1224 commits first commit 2020-05-15 3 history points updated 2026-04-08

★ 3,989

GitHub ↗

polyaxon/polyaxon

Open Source AI Infra & Engineering Control Plane

#agents #artificial-intelligence #data-science #deep-learning #harness AI dev signals 3 awesome lists 10366 commits 2 history points updated 2026-04-26

★ 3,706

Website ↗ GitHub ↗

opendilab/DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python #atari #distributed-reinforcement-learning #distributed-system #drl #exploration-exploitation 2 awesome lists 858 commits first commit 2021-07-08 3 history points updated 2025-12-07

★ 3,618

Website ↗ GitHub ↗

Farama-Foundation/PettingZoo

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Python #api #gym #gymnasium #multi-agent-reinforcement-learning #multiagent-reinforcement-learning 1 awesome list 4631 commits first commit 2020-01-21 3 history points updated 2026-05-27

★ 3,425

Website ↗ GitHub ↗

aiming-lab/MetaClaw

🦞 Just talk to your agent — it learns and EVOLVES 🧬.

Python Celery FastAPI Flask pytest npmPEP 517pip #agent #ai-agent #continual-learning #fine-tuning #llm AI dev signals 1 awesome list 188 commits first commit 2026-03-09 2 history points updated 2026-05-23

★ 3,423

Website ↗ GitHub ↗

catalyst-team/catalyst

Accelerated deep learning R&D

Python #computer-vision #deep-learning #distributed-computing #image-classification #image-processing 2 awesome lists 1698 commits first commit 2018-08-20 3 history points updated 2025-06-27

★ 3,376

Website ↗ GitHub ↗

tensorforce/tensorforce

Tensorforce: a TensorFlow library for applied reinforcement learning

Python #control #deep-reinforcement-learning #reinforcement-learning #system-control #tensorflow 1 awesome list 2118 commits first commit 2016-09-29 3 history points updated 2026-05-09

★ 3,307

GitHub ↗

SkyworkAI/Skywork-R1V

Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.

Python #deepseek-r1 #grpo #llm #multimodal-r1 #multimodal-understanding 1 awesome list 280 commits 1 history point updated 2025-12-15

★ 3,161

Website ↗ GitHub ↗

tensorflow/agents

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

Python #bandits #contextual-bandits #dqn #multi-armed-bandits #reinforcement-learning 1 awesome list 2322 commits first commit 2018-11-06 3 history points updated 2026-01-16

★ 3,011

GitHub ↗

DLR-RM/rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Python #deep-reinforcement-learning #gym #hyperparameter-optimization #hyperparameter-search #hyperparameter-tuning 1 awesome list 401 commits 1 history point updated 2026-04-23

★ 2,807

Website ↗ GitHub ↗

Search awesome repositories

Find repositories

Put your repository first

How it works

Pricing

How it works

Pricing