Alibaba-NLP/ZeroSearch
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
Dr. Zero Self-Evolving Search Agents without Training Data
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
Simple RL training for reasoning
Minimal reproduction of DeepSeek R1-Zero
No description.
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
2 captures since 2026-05-23