RUCAIBox/R1-Searcher
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
No description.
Simple RL training for reasoning
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
Dr. Zero Self-Evolving Search Agents without Training Data
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning
2 captures since 2026-05-23