Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
GitHub projects from awesome lists
Search names, descriptions, topics, tags, and stacks, then tune results by ecosystem, freshness, health, and cross-list signal.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
SGLang is a high-performance serving framework for large language models and multimodal models.
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Deep Learning and Reinforcement Learning Library for Scientists and Engineers
A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility
Democratizing Reinforcement Learning for LLMs
Deep Reinforcement Learning for Keras.
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
A Next-Generation Training Engine Built for Ultra-Large MoE Models
This repo contains the Hugging Face Deep Reinforcement Learning Course.
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
A library of reinforcement learning components and agents
Open Source AI Infra & Engineering Control Plane
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
🦞 Just talk to your agent — it learns and EVOLVES 🧬.
Accelerated deep learning R&D
Tensorforce: a TensorFlow library for applied reinforcement learning
Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.