pat-jj/s3
[EMNLP'25] s3 - ⚡ Efficient & Effective Search Agent Training via RL for RAG (RLVR for Search with Minimal Data)
No description.
[EMNLP'25] s3 - ⚡ Efficient & Effective Search Agent Training via RL for RAG (RLVR for Search with Minimal Data)
A school for camelids
No description.
AI agents running research on single-GPU nanochat training automatically
A collection of one-click self-hosted AI
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
2 captures since 2026-05-23