Sign in
โ† Back to search

ServiceNow/webarena-verified

A verified version of the WebArena Benchmark

Stars
38
Forks
4
Commits
22
Language
Python
Awesome lists
1

Similar repositories

ServiceNow/WorkArena

WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?

251 stars
Python 1 awesome list

web-arena-x/webarena

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

1489 stars
Python 2 awesome lists

ServiceNow/BrowserGym

๐ŸŒŽ๐Ÿ’ช BrowserGym, a Gym environment for web task automation

1228 stars
Python 1 awesome list

claw-eval/claw-eval

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

609 stars
Python 2 awesome lists

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 20:57

Stars history

Total stars

Commits history

Default branch commits

Metadata

AI development signals

No AI development config files detected.