← Back to search

github Active AI dev

Repository profile

sierra-research/tau2-bench

τ-Bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains

Python MIT main Stack scanned README.md

Open website Open GitHub

Stars: 1,578
Forks: 402
Watchers: 12
Issues: 149
Commits: 165
Awesome lists: 1

Repository updates

Get generated sierra-research/tau2-bench development summaries by email, or follow the weekly and monthly RSS feeds.

Weekly RSS Monthly RSS

Activity and growth

Tracked growth, recent movement, and commit velocity from stored repository snapshots.

Latest capture 2026-07-15 03:11

Star growth, last 7 days: 0 0.0%
Commit velocity, last 7 days: 0 0.0%
Stars since baseline: +348
Snapshot coverage: 5

Tracked growth

5 captures since 2026-05-25

Stars from baseline +348

Time horizon

All tracked data

Custom start Custom end

Stars history

Total stars

Commits history

Default branch commits

Detected stack

Frameworks, package managers, ecosystems, and dependency manifests found during catalog scans.

Scanned 2026-07-15 03:11

Stack signals: 4
Package managers: 3
Manifest files: 6
Dependencies: 317

Frameworks and tools

FastAPI web framework · high confidence
pytest test framework · high confidence
React frontend framework · high confidence
Vite build tool · high confidence

npm PEP 517 uv javascript python

Dependency files

6 manifests

pyproject.toml python ecosystem, 49 dependencies
uv.lock python ecosystem, 0 dependencies
web/leaderboard/package.json javascript ecosystem, 12 dependencies
web/leaderboard/package-lock.json javascript ecosystem, 246 dependencies
src/experiments/agentify_tau_bench/pyproject.toml python ecosystem, 10 dependencies
src/experiments/agentify_tau_bench/uv.lock python ecosystem, 0 dependencies

Classification

Searchable topics, generated tags, and stack labels that explain where this repository fits.

Topics: 5
Tags: 0
Stacks: 4

Topics

#ai #benchmark #conversational-agents #language-model-agent #llm

Generated tags

No generated tags yet.

Stack labels

FastAPI pytest React Vite

AI development signals

Agent instructions and tool configuration paths found in the repository tree.

12 paths

AI agent config detected

12 config paths 10 files 2 directories

Agent instructions 7 Cursor 5

Key config paths

2 more config paths detected.

Review config paths

Cursor .cursor
Cursor .cursor/rules
Cursor .cursor/rules/audio-native-provider.md
Cursor .cursor/rules/background-audio-files.md
Cursor .cursor/rules/nova-sonic.md
Agent instructions AGENTS.md
Agent instructions src/tau2/agent/AGENTS.md
Agent instructions src/tau2/domains/AGENTS.md
Agent instructions src/tau2/evaluator/AGENTS.md
Agent instructions src/tau2/voice/AGENTS.md
Agent instructions src/tau2/voice/audio_native/AGENTS.md
Agent instructions tests/AGENTS.md

Similar repositories

Nearest indexed repositories by embedding similarity.

sierra-research/tau-bench

Code and Data for Tau-Bench

1,322 stars

Python 1 awesome list

THUDM/AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

3,579 stars

Python 4 awesome lists

harbor-framework/terminal-bench

A benchmark for LLMs on complicated tasks in the terminal

2,450 stars

Python 2 awesome lists

Ayanami0730/deep_research_bench

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

784 stars

Python 1 awesome list

UKGovernmentBEIS/inspect_evals

Collection of evals for Inspect AI

583 stars

Python 1 awesome list

TheAgentCompany/TheAgentCompany

An agent benchmark with tasks in a simulated software company.

740 stars

Python 1 awesome list

Metadata

Language: Python
License: MIT
Default branch: main
Created: 2025-06-09
First commit: 2025-06-10
Last pushed: 2026-07-15
GitHub updated: 2026-07-15
Last synced: 2026-07-15 03:11
Stack detected: 2026-07-15 03:11
Archived: no

Links and files

GitHub Website

https://www.taubench.com

README

Appears in

Awesome Agent Harness

sierra-research/tau2-bench

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

sierra-research/tau-bench

THUDM/AgentBench

harbor-framework/terminal-bench

Ayanami0730/deep_research_bench

UKGovernmentBEIS/inspect_evals

TheAgentCompany/TheAgentCompany

Metadata

Links and files

Appears in

How it works

Pricing

Follow repository updates

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

sierra-research/tau-bench

THUDM/AgentBench

harbor-framework/terminal-bench

Ayanami0730/deep_research_bench

UKGovernmentBEIS/inspect_evals

TheAgentCompany/TheAgentCompany

Metadata

Links and files

Appears in