ServiceNow/WorkArena
WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?
🌎💪 BrowserGym, a Gym environment for web task automation
A verified version of the WebArena Benchmark
[EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning
[NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
1 capture since 2026-05-30