Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
GitHub projects from awesome lists
Search names, descriptions, topics, tags, and stacks, then tune results by ecosystem, freshness, health, and cross-list signal.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Open-source benchmark for browser AI agents on 153 everyday online tasks across 144 live websites. 5-layer recording + DOM-match + LLM judge. Top score 33.3%.
Secure runtime to sandbox AI agent tasks. Run untrusted code in isolated WebAssembly environments.
The production-ready agent harness framework for Python
🛡️The governance runtime for AI agents. Intercept actions, enforce guard policies, require approvals, and produce audit-ready decision trails.
WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?
Open Python agent harness for production AI apps: tools, MCP, memory, workspace, telemetry, subagents, background tasks, and OmniServe APIs.
Secure local dev environment for collaboration with AI coding agents
The harness layer for Claude Code — a reference implementation of harness engineering with hook-enforced dual review, state-machine gates that survive context compaction, and fail-closed safety where it counts. Quality gates that AI can't skip.
Runtime for long-horizon agents
HexAgent – An Agent harness that gives any LLM a computer to complete tasks the way humans do
Universally Triggered Agent Harness - An OpenClaw-like Inngest-powered personal agent
Tandem is the authority layer for AI-first work: runtime authority for agents, tools, memory, approvals, and audit trails.
Evaluation harness for OpenHands V1.
No description.
This repository defines AGENT.md, a standardized format that lets your codebase speak directly to any agentic coding tool.
A verified version of the WebArena Benchmark
No description.