Awesome List

Awesome Agent Harness

An awesome list of Agent Harness engineering resources, including GitHub projects, tools, benchmarks, and practical guides.

Picrew/awesome-agent-harness #agent-harness#context-engineering#harness-engineering

Open GitHub

List stars: 1,018
README repos: 231
Indexed repos: 227
List commits: 34
Forks: 82
Open issues: 10

Tracked list growth

GitHub stars and default-branch commits for Picrew/awesome-agent-harness.

Latest scan 2026-06-03 10:49

Likes history

GitHub stars

Commits history

Default branch commits

Indexed repositories

227 repos currently saved from this list.

No filters applied

Latest repo push 2026-06-03

Filter this list

Search within Awesome Agent Harness or narrow by ecosystem and project health.

Search repositories

Search mode

Keyword Semantic

Tune results

The controls most people need first.

Language

Freshness

Sort

Direction

More filters Topics, generated tags, stack, age, archive status, and growth.

Ecosystem

GitHub topic

Generated tag

Framework or stack

Package manager

Health

Minimum stars

Repository age

Uses known first-commit dates.

Archive status

AI development signals

Momentum

Unmaintained for

Commit velocity

Star growth

Reset filters

Highlighted

Open highlighted repo slot

Put your repository first

Promote a GitHub repo at the top of Awesome repository list views for 7 days.

SWE-agent/SWE-ReX

Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.

Python #agent#agents#ai#aws pushed 2026-05-18 348 commits 1 list mention

★ 508

Website ↗ GitHub ↗

pydantic/pydantic-ai-harness

Batteries for your Pydantic AI agent.

Python pytest uv pushed 2026-06-03 48 commits first commit 2026-03-20 1 list mention AI dev signals

★ 494

GitHub ↗

SponsioLabs/Sponsio

Deterministic safety solutions for probabilistic AI agents

Python #agent-guardrails#agent-harness#agent-runtime#agent-safety pushed 2026-05-26 117 commits 1 list mention AI dev signals

★ 445

Website ↗ GitHub ↗

AVIDS2/memorix

Open-source cross-agent memory layer for coding agents via MCP. Compatible with Cursor, Claude Code, Codex, Windsurf, Gemini CLI, GitHub Copilot, Kiro, OpenCode, Antigravity, and Trae.

TypeScript #agent-memory#ai-coding#claude-code#codex pushed 2026-05-22 420 commits first commit 2026-02-14 1 list mention AI dev signals

★ 439

GitHub ↗

Th0rgal/sandboxed.sh

Safe runtime for autonomous on-chain AI agents: isolated sandboxes, Library skills, encrypted secrets, and OKX read-only security checks.

Rust #ai-agents#autonomous-agents#claude#claude-code pushed 2026-05-25 1,329 commits 1 list mention AI dev signals

★ 438

Website ↗ GitHub ↗

InternLM/WildClawBench

An in-the-wild benchmark for AI agents in the OpenClaw Environment.

Python #agentic-ai#agentic-evaluation#agents#benchmarks pushed 2026-05-19 10 commits 1 list mention

★ 407

Website ↗ GitHub ↗

scaleapi/SWE-bench_Pro-os

SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?

Python pushed 2026-05-18 75 commits 1 list mention

★ 398

GitHub ↗

IBM/mcp

A collection of Model Context Protocol (MCP) servers, clients and developer tools by IBM.

#agents#llm#mcp#modelcontextprotocol pushed 2026-05-04 59 commits first commit 2025-04-02 1 list mention

★ 380

Website ↗ GitHub ↗

awslabs/agent-evaluation

A generative AI-powered framework for testing virtual agents.

Python pytestStreamlitTornado PEP 517pip pushed 2025-12-15 276 commits first commit 2024-03-19 1 list mention

★ 364

Website ↗ GitHub ↗

clawdotnet/openclaw.net

Self-hosted OpenClaw gateway + agent runtime in .NET (NativeAOT-friendly)

C# #agent-harness#agent-runtime#agentqi#ai-agent pushed 2026-05-25 387 commits 1 list mention AI dev signals

★ 347

GitHub ↗

TIGER-AI-Lab/ClawBench

Open-source benchmark for browser AI agents on 153 everyday online tasks across 144 live websites. 5-layer recording + DOM-match + LLM judge. Top score 33.3%.

Python #agent-evaluation#agentic-ai#ai-agent-benchmark#ai-agents pushed 2026-05-25 310 commits first commit 2026-04-10 1 list mention AI dev signals

★ 347

Website ↗ GitHub ↗

capsulerun/capsule

Secure runtime to sandbox AI agent tasks. Run untrusted code in isolated WebAssembly environments.

Rust Expresspytest Cargonpm #agentic-workflow#ai-agents#code-execution#code-interpreter pushed 2026-05-26 689 commits first commit 2025-12-01 2 list mentions

★ 288

GitHub ↗

manthanguptaa/water

The production-ready agent harness framework for Python

Python #agent-harness#agents#framework#harness pushed 2026-03-24 174 commits 1 list mention

★ 288

Website ↗ GitHub ↗

ucsandman/DashClaw

🛡️Decision infrastructure for AI agents. Intercept actions, enforce guard policies, require approvals, and produce audit-ready decision trails.

JavaScript #agent-framework#agent-governance#agent-runtime#ai-agents pushed 2026-05-25 1,750 commits 1 list mention AI dev signals

★ 268

Website ↗ GitHub ↗

ServiceNow/WorkArena

WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?

Python pushed 2026-04-25 213 commits 1 list mention

★ 251

Website ↗ GitHub ↗

omnirexflora-labs/omnicoreagent

Open Python agent harness for production AI apps: tools, MCP, memory, workspace, telemetry, subagents, background tasks, and OmniServe APIs.

Python #agent#agent-harness#ai-agents#background-tasks pushed 2026-05-25 650 commits 1 list mention

★ 241

Website ↗ GitHub ↗

mattolson/agent-sandbox

Secure local dev environment for collaboration with AI coding agents

Python #agent-harness#agent-sandbox#agents#coding-agents pushed 2026-05-26 560 commits 1 list mention AI dev signals

★ 174

GitHub ↗

sd0xdev/sd0x-dev-flow

The harness layer for Claude Code — a reference implementation of harness engineering with hook-enforced dual review, state-machine gates that survive context compaction, and fail-closed safety where it counts. Quality gates that AI can't skip.

JavaScript #agent-harness#claude-code#claude-code-plugin#codex pushed 2026-05-14 456 commits 1 list mention AI dev signals

★ 157

GitHub ↗

SouthBridgeAI/hankweave-runtime

Runtime for long-horizon agents

JavaScript pushed 2026-03-20 722 commits 1 list mention

★ 123

GitHub ↗

UnicomAI/hexagent

HexAgent – An Agent harness that gives any LLM a computer to complete tasks the way humans do

Python #agent-harness#agents#cowork pushed 2026-05-19 105 commits 1 list mention AI dev signals

★ 122

GitHub ↗

inngest/utah

Universally Triggered Agent Harness - An OpenClaw-like Inngest-powered personal agent

TypeScript #agent-harness#ai-agent#durable-execution#event-driven pushed 2026-05-18 41 commits 1 list mention AI dev signals

★ 116

GitHub ↗

frumu-ai/tandem

Tandem is the authority layer for AI-first work: runtime authority for agents, tools, memory, approvals, and audit trails.

Rust AstroAxumpytest Cargonpm #agentic-workflow#anthropic#governed-execution#help-wanted pushed 2026-06-02 2,146 commits first commit 2026-01-17 1 list mention AI dev signals

★ 106

Website ↗ GitHub ↗

OpenHands/benchmarks

Evaluation harness for OpenHands V1.

Python pushed 2026-05-24 414 commits 1 list mention AI dev signals

★ 85

GitHub ↗

agentmd/agent.md

This repository defines AGENT.md, a standardized format that lets your codebase speak directly to any agentic coding tool.

#agentmd#rfc pushed 2025-07-10 3 commits first commit 2025-07-09 1 list mention

★ 82

Website ↗ GitHub ↗

ucsb-mlsec/terminal-bench-env

No description.

Python pushed 2026-03-24 31 commits 1 list mention

★ 82

GitHub ↗

ServiceNow/webarena-verified

A verified version of the WebArena Benchmark

Python pushed 2026-03-08 22 commits 1 list mention

★ 38

Website ↗ GitHub ↗

reacher-z/HarnessBench

No description.

Python pushed 2026-05-12 7 commits first commit 2026-04-16 1 list mention

★ 11

GitHub ↗

Activity

Default branch: main
Last pushed: 2026-06-02
GitHub updated: 2026-06-03
Created: 2026-03-30
First commit: -
Last scanned: 2026-06-03 10:49
Watchers: 1

Indexed repo mix

Repo stars: 4,941,544
Repo forks: 761,784
Active: 227
Archived: 0

Languages

Python (104) TypeScript (68) Rust (18) Go (9) JavaScript (6) Shell (4) C# (3) HTML (3) Java (3) Swift (2) Elixir (1) Jupyter Notebook (1)

Awesome Agent Harness

Tracked list growth

Likes history

Commits history

Indexed repositories

Filter this list

Put your repository first

How it works

Pricing