← Back to search

github Active AI dev

Repository profile

TIGER-AI-Lab/ClawBench

Open-source benchmark for browser AI agents on daily tasks.

Python Apache-2.0 main Stack scanned README.md

Open website Open GitHub

Stars: 471
Forks: 27
Watchers: 9
Issues: 41
Commits: 358
Awesome lists: 1

Repository updates

Get generated TIGER-AI-Lab/ClawBench development summaries by email, or follow the weekly and monthly RSS feeds.

Weekly RSS Monthly RSS

Activity and growth

Tracked growth, recent movement, and commit velocity from stored repository snapshots.

Latest capture 2026-07-12 03:10

Star growth, last 7 days: 0 0.0%
Commit velocity, last 7 days: 0 0.0%
Stars since baseline: +124
Snapshot coverage: 4

Tracked growth

4 captures since 2026-05-30

Stars from baseline +124

Time horizon

All tracked data

Custom start Custom end

Stars history

Total stars

Commits history

Default branch commits

Detected stack

Frameworks, package managers, ecosystems, and dependency manifests found during catalog scans.

Scanned 2026-07-12 03:10

Stack signals: 3
Package managers: 1
Manifest files: 4
Dependencies: 79

Frameworks and tools

FastAPI web framework · high confidence
pytest test framework · high confidence
Starlette web framework · medium confidence

uv python

Dependency files

4 manifests

pyproject.toml python ecosystem, 11 dependencies
uv.lock python ecosystem, 49 dependencies
src/clawbench/runtime/runtime-server/pyproject.toml python ecosystem, 3 dependencies
src/clawbench/runtime/runtime-server/uv.lock python ecosystem, 16 dependencies

Classification

Searchable topics, generated tags, and stack labels that explain where this repository fits.

Topics: 20
Tags: 0
Stacks: 3

Topics

#agent-evaluation #agentic-ai #ai-agent-benchmark #ai-agents #benchmark #browser-agent #browser-automation #browser-use #chrome-agent #chrome-extension #computer-use #dataset #evaluation #everyday-tasks #llm #llm-evaluation #online-tasks #real-world-benchmark #web-agent #web-agents

Generated tags

No generated tags yet.

Stack labels

FastAPI pytest Starlette

AI development signals

Agent instructions and tool configuration paths found in the repository tree.

1 path

AI agent config detected

1 config path 1 file 0 directories

Agent instructions

Key config paths

file AGENTS.md

Similar repositories

Nearest indexed repositories by embedding similarity.

InternLM/WildClawBench

An in-the-wild benchmark for AI agents in the OpenClaw Environment.

475 stars

Python 1 awesome list

reacher-z/HarnessBench

No description.

30 stars

Python 1 awesome list

pinchbench/skill

PinchBench is a benchmarking system for evaluating LLM models as OpenClaw coding agents. Made with 🦀 by the humans at https://kilo.ai

1,279 stars

Python 1 awesome list

claw-eval/claw-eval

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

716 stars

Python 2 awesome lists

harbor-framework/terminal-bench

A benchmark for LLMs on complicated tasks in the terminal

2,450 stars

Python 2 awesome lists

vivekchand/clawmetry

See your agent think. Real-time observability for 14 AI agent runtimes - OpenClaw, NVIDIA NemoClaw, Claude Code, Codex & 8 more.

387 stars

Python 1 awesome list

Metadata

Language: Python
License: Apache-2.0
Default branch: main
Created: 2026-04-10
First commit: 2026-04-10
Last pushed: 2026-07-11
GitHub updated: 2026-07-11
Last synced: 2026-07-12 03:10
Stack detected: 2026-07-12 03:10
Archived: no

Links and files

GitHub Website

https://claw-bench.com

README

Appears in

Awesome Agent Harness

TIGER-AI-Lab/ClawBench

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

InternLM/WildClawBench

reacher-z/HarnessBench

pinchbench/skill

claw-eval/claw-eval

harbor-framework/terminal-bench

vivekchand/clawmetry

Metadata

Links and files

Appears in

How it works

Pricing

Follow repository updates

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

InternLM/WildClawBench

reacher-z/HarnessBench

pinchbench/skill

claw-eval/claw-eval

harbor-framework/terminal-bench

vivekchand/clawmetry

Metadata

Links and files

Appears in