Awesome List

Awesome Agent Harness

An awesome list of Agent Harness engineering resources, including GitHub projects, tools, benchmarks, and practical guides.

Picrew/awesome-agent-harness #agent-harness#context-engineering#harness-engineering

Open GitHub

List stars: 1,018
README repos: 231
Indexed repos: 227
List commits: 34
Forks: 82
Open issues: 10

Tracked list growth

GitHub stars and default-branch commits for Picrew/awesome-agent-harness.

Latest scan 2026-06-03 10:49

Likes history

GitHub stars

Commits history

Default branch commits

Indexed repositories

227 repos currently saved from this list.

No filters applied

Latest repo push 2026-06-03

Filter this list

Search within Awesome Agent Harness or narrow by ecosystem and project health.

Search repositories

Search mode

Keyword Semantic

Tune results

The controls most people need first.

Language

Freshness

Sort

Direction

More filters Topics, generated tags, stack, age, archive status, and growth.

Ecosystem

GitHub topic

Generated tag

Framework or stack

Package manager

Health

Minimum stars

Repository age

Uses known first-commit dates.

Archive status

AI development signals

Momentum

Unmaintained for

Commit velocity

Star growth

Reset filters

Highlighted

Open highlighted repo slot

Put your repository first

Promote a GitHub repo at the top of Awesome repository list views for 7 days.

docker/docker-agent

AI Agent Builder and Runtime by Docker Engineering

Go CobragRPC GoStarlette BundlerGo modules #agents#ai pushed 2026-06-02 6,824 commits first commit 2025-05-06 1 list mention AI dev signals

★ 2,979

Website ↗ GitHub ↗

RunMaestro/Maestro

Agent Orchestration Command Center

TypeScript #ai-agents#claude-code#codex#generative-ai pushed 2026-05-26 3,530 commits first commit 2025-11-24 4 list mentions AI dev signals

★ 2,954

Website ↗ GitHub ↗

lmnr-ai/lmnr

Laminar - open-source observability platform purpose-built for AI agents. YC S24.

TypeScript #agent-observability#agents#ai#ai-observability pushed 2026-05-25 1,536 commits 1 list mention AI dev signals

★ 2,949

Website ↗ GitHub ↗

awslabs/agentcore-samples

Amazon Bedrock Agentcore accelerates AI agents into production with the scale, reliability, and security, critical to real-world deployment.

Jupyter Notebook FastAPIFlaskJupyter npmPEP 517 #agent#agentic-ai#agents#authentication pushed 2026-06-02 573 commits first commit 2025-07-16 2 list mentions AI dev signals

★ 2,929

Website ↗ GitHub ↗

modelscope/evalscope

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python #evaluation#llm#performance#rag pushed 2026-05-25 777 commits 2 list mentions AI dev signals

★ 2,844

Website ↗ GitHub ↗

open-gitagent/opengap

A framework-agnostic, git-native standard for defining AI agents

TypeScript npm #agent#agent-framework#agent-skills#agents pushed 2026-05-28 93 commits first commit 2026-02-24 1 list mention AI dev signals

★ 2,796

Website ↗ GitHub ↗

openclaw/acpx

Headless CLI client for stateful Agent Client Protocol (ACP) sessions

TypeScript #agentclientprotocol pushed 2026-05-25 402 commits 1 list mention AI dev signals

★ 2,745

Website ↗ GitHub ↗

google/agents-cli

The CLI and skills that turn any coding assistant into an expert at creating, evaluating, and deploying AI agents on Google Cloud.

PEP 517 #adk#agent-development-kit#agents#coding-agent pushed 2026-06-01 22 commits first commit 2026-04-14 1 list mention

★ 2,693

Website ↗ GitHub ↗

awslabs/aidlc-workflows

AI-Driven Life Cycle (AI-DLC) adaptive workflow steering rules for AI coding agents

Python pytestStarlette PEP 517uv pushed 2026-06-03 186 commits first commit 2025-11-13 1 list mention AI dev signals

★ 2,673

GitHub ↗

Yuyz0112/claude-code-reverse

A Tool to Visualize Claude Code's LLM Interactions

JavaScript pushed 2025-08-26 16 commits 1 list mention

★ 2,370

Website ↗ GitHub ↗

kubernetes-sigs/agent-sandbox

agent-sandbox enables easy management of isolated, stateful, singleton workloads, ideal for use cases like AI agent runtimes.

Go pushed 2026-05-22 507 commits 2 list mentions AI dev signals

★ 2,360

Website ↗ GitHub ↗

NVIDIA/NeMo-Agent-Toolkit

The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.

Python pushed 2026-05-21 1,336 commits 1 list mention AI dev signals

★ 2,329

Website ↗ GitHub ↗

harbor-framework/terminal-bench

A benchmark for LLMs on complicated tasks in the terminal

Python FastAPIpytestReact CargoPEP 517 pushed 2026-01-22 903 commits first commit 2025-01-17 2 list mentions AI dev signals

★ 2,305

Website ↗ GitHub ↗

microsoft/agent-governance-toolkit

AI Agent Governance Toolkit — Policy enforcement, zero-trust identity, execution sandboxing, and reliability engineering for autonomous AI agents. Covers 10/10 OWASP Agentic Top 10.

Python #agent-framework#ai-agents#ai-safety#compliance pushed 2026-05-25 1,726 commits 3 list mentions AI dev signals

★ 2,227

GitHub ↗

harbor-framework/harbor

Harbor is a framework for running agent evaluations and creating and using RL environments.

Python #evals#rl-environments#terminal-bench pushed 2026-05-25 971 commits 2 list mentions AI dev signals

★ 2,113

Website ↗ GitHub ↗

codejunkie99/agentic-stack

One brain, many harnesses. Portable .agent/ folder (memory + skills + protocols) that plugs into Claude Code, Cursor, Windsurf, OpenCode, OpenClaw, Hermes, or DIY Python — and keeps its knowledge when you switch.

Python React npmpip pushed 2026-05-25 162 commits first commit 2026-04-15 2 list mentions AI dev signals

★ 2,065

GitHub ↗

apache/burr

Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.

Python ExpressFastAPILangChain npmPEP 517 #ai#burr#chatbot-framework#dags pushed 2026-06-02 925 commits first commit 2024-01-29 2 list mentions

★ 2,017

Website ↗ GitHub ↗

MicrosoftDocs/mcp

Official Microsoft Learn MCP Server and CLI tool – powering LLMs and AI agents with real-time, trusted Microsoft docs & code samples.

TypeScript #ai#ai-agents#cli#copilot pushed 2026-05-23 68 commits 1 list mention AI dev signals

★ 1,659

Website ↗ GitHub ↗

GoogleCloudPlatform/scion

No description.

Go AstroCobragRPC Go Go modulesnpm pushed 2026-06-02 2,788 commits first commit 2025-12-20 1 list mention AI dev signals

★ 1,575

GitHub ↗

stakpak/agent

Ship your code, on autopilot. An open source agent that lives on your machines 24/7 and keeps your apps running. 🦀

Rust #agent#ai-agent#autonomous-agent#devops pushed 2026-05-20 2,999 commits 2 list mentions AI dev signals

★ 1,560

Website ↗ GitHub ↗

web-arena-x/webarena

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Python #agent#nlp pushed 2025-11-26 203 commits first commit 2023-07-24 2 list mentions

★ 1,489

Website ↗ GitHub ↗

Infisical/agent-vault

A HTTP credential proxy and vault for AI agents like Claude Code, OpenClaw, Hermes, custom agents + harnesses, and more.

Go #agents#ai-agents#secrets-management pushed 2026-05-28 215 commits first commit 2026-03-27 1 list mention AI dev signals

★ 1,484

Website ↗ GitHub ↗

OpenCoworkAI/open-cowork

Open-source AI agent desktop app for Windows & macOS. One-click install Claude Code, MCP tools, and Skills — with sandbox isolation, multi-model support, and Feishu/Slack integration.

TypeScript #ai-agent#ai-coding#ai-tools#anthropic pushed 2026-05-23 689 commits 1 list mention AI dev signals

★ 1,412

GitHub ↗

rivet-dev/sandbox-agent

Run Coding Agents in Sandboxes. Control Them Over HTTP. Supports Claude Code, Codex, OpenCode, and Amp.

TypeScript #agent#ai#amp#claude pushed 2026-03-30 419 commits first commit 2026-01-25 1 list mention AI dev signals

★ 1,411

Website ↗ GitHub ↗

google/oss-fuzz-gen

LLM powered fuzzing via OSS-Fuzz.

Python FastAPIStarlette PEP 517pip #ai#fuzzing#llm#security pushed 2026-03-17 752 commits first commit 2024-01-25 1 list mention

★ 1,402

GitHub ↗

e2b-dev/desktop

E2B Desktop Sandbox for LLMs. E2B Sandbox with desktop graphical environment that you can connect to any LLM for secure computer use.

Python ElectronpytestVite npmpnpm #ai#computer#desktop#e2b pushed 2026-06-02 336 commits first commit 2024-05-26 1 list mention

★ 1,396

Website ↗ GitHub ↗

m0n0x41d/haft

Engineering decisions engine that know when they're stale. Frame, compare, decide — with evidence decay and parity enforcement. For Claude Code, Cursor, Gemini CLI, Codex and more.

Go #ai-agents#ai-coding#ai-skills#air pushed 2026-05-25 1,001 commits 1 list mention AI dev signals

★ 1,333

Website ↗ GitHub ↗

Yuan-lab-LLM/ClawManager

A Kubernetes-native control plane for AI agent instance management, with governed AI access, runtime orchestration, and reusable resources across multiple agent runtimes.

TypeScript #hermes#kubernetes#openclaw#webtop pushed 2026-05-23 154 commits 1 list mention

★ 1,296

Website ↗ GitHub ↗

langchain-ai/deepagentsjs

The batteries included agent harness.

TypeScript #ai#deepagents#langchain#langgraph pushed 2026-05-22 490 commits 1 list mention AI dev signals

★ 1,259

Website ↗ GitHub ↗

sierra-research/tau2-bench

τ-Bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains

Python #ai#benchmark#conversational-agents#language-model-agent pushed 2026-05-21 147 commits 1 list mention AI dev signals

★ 1,230

Website ↗ GitHub ↗

agentbay-ai/wuying-agentbay-sdk

The Cloud Sandbox Built for AI Agents

Python FastAPIgRPC GoLangChain Go modulesMaven #agent#agentbay#ai#sandbox pushed 2026-05-19 2,020 commits first commit 2025-05-26 1 list mention AI dev signals

★ 1,129

Website ↗ GitHub ↗

future-agi/future-agi

Open-source, end-to-end platform for evaluating, observing, and improving LLM and AI agent applications. Tracing · Evals · Simulations · Datasets · Gateway · Guardrails. Self-hostable. Apache 2.0.

Python CeleryDjangoDjango REST Framework Go modulesnpm #ai#ai-gateway#evals#llm pushed 2026-06-02 1,069 commits first commit 2026-04-23 2 list mentions

★ 1,075

Website ↗ GitHub ↗

thClaws/thClaws

Open-source AI agent harness in native Rust — GUI, CLI, headless, and webapp from one binary. Multi-provider, MCP, skills, plugins, agent teams.

Rust #agent-harness#agent-teams#ai-agent#anthropic pushed 2026-05-30 382 commits first commit 2026-04-20 3 list mentions AI dev signals

★ 1,058

Website ↗ GitHub ↗

first-fluke/oh-my-agent

Portable, vendor-agnostic agent harness for project-specific skills, workflows, and agent teams aligned with your codebase, conventions, and engineering standards.

TypeScript #agent-harness#agent-skills#agentic-coding#ai-agents pushed 2026-05-26 1,963 commits 1 list mention AI dev signals

★ 1,020

Website ↗ GitHub ↗

Arize-ai/openinference

OpenTelemetry Instrumentation for AI Observability

Python #aiops#gemini#hacktoberfest#haystack pushed 2026-05-26 1,882 commits first commit 2023-12-26 2 list mentions AI dev signals

★ 991

Website ↗ GitHub ↗

stanford-iris-lab/meta-harness

Reference code for the Meta-Harness paper.

Python #harness-engineering#llm-agents pushed 2026-04-29 11 commits 2 list mentions

★ 953

Website ↗ GitHub ↗

tensorlakeai/tensorlake

Tensorlake is a serverless runtime for sandboxes and deploying background agentic applications

Python pushed 2026-05-25 688 commits 1 list mention AI dev signals

★ 925

Website ↗ GitHub ↗

NVIDIA-NeMo/Gym

Evaluate and improve models and agents using environments

Python #agents#benchmarks#environments#evaluation pushed 2026-05-25 569 commits 1 list mention AI dev signals

★ 916

Website ↗ GitHub ↗

Chorus-AIDLC/Chorus

The Agent Harness for AI-Human Collaboration, inspired by the AI-DLC (AI-Driven Development Lifecycle)

TypeScript #agent-harness#ai-agents#ai-dlc#claude-code pushed 2026-05-25 480 commits 1 list mention AI dev signals

★ 915

Website ↗ GitHub ↗

rasbt/mini-coding-agent

Minimal and readable coding agent harness implementation in Python to explain the core components of coding agents.

Python #agents#ai#large-language-models#llms pushed 2026-04-07 15 commits 2 list mentions

★ 881

Website ↗ GitHub ↗

abshkbh/arrakis

A fully customizable and self-hosted sandboxing solution for AI agent code execution and computer use. It features out-of-the-box support for backtracking, a simple REST API and Python SDK, automatic port forwarding, and secure MicroVM isolation. Perfect for safely running, testing, and backtracking multi-step agent workflows.

Go gRPC Go Go modules pushed 2025-06-02 240 commits first commit 2024-08-04 1 list mention

★ 815

GitHub ↗

context-space/context-space

Ultimate Context Engineering Infrastructure, starting from MCPs and Integrations

Go GingRPC GoNext.js Go modulespnpm #agent#agents#ai#ai-agent pushed 2025-10-22 104 commits first commit 2025-07-08 1 list mention AI dev signals

★ 810

Website ↗ GitHub ↗

Git-on-my-level/codex-autorunner

No description.

Python FastAPIpytestSvelte npmPEP 517 pushed 2026-06-02 2,008 commits first commit 2025-12-09 1 list mention AI dev signals

★ 808

Website ↗ GitHub ↗

agentscope-ai/agentscope-runtime

A production-ready runtime framework for agent apps with secure tool sandboxing, Agent-as-a-Service APIs, scalable deployment, full-stack observability, and broad framework compatibility.

Python CeleryExpressFastAPI npmPEP 517 #a2a#agent#agentscope#agno pushed 2026-05-21 287 commits first commit 2025-08-14 2 list mentions AI dev signals

★ 805

Website ↗ GitHub ↗

SafeRL-Lab/cheetahclaws

CheetahClaws: A Fast and Easy-to-Use Agent Harness Infrastructure for Long-Horizon, Multi-Model, and Tool-Using AI Systems

Python PEP 517pip #agentic-ai#claude#claude-code#memory pushed 2026-06-02 666 commits first commit 2026-04-01 1 list mention AI dev signals

★ 710

Website ↗ GitHub ↗

TheAgentCompany/TheAgentCompany

An agent benchmark with tasks in a simulated software company.

Python #agent#ai#ai-benchmark#ai-research pushed 2025-11-17 729 commits 1 list mention

★ 710

Website ↗ GitHub ↗

claw-eval/claw-eval

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

Python FastAPIpytest PEP 517pip #agent#harness#llm#openclaw pushed 2026-05-17 43 commits first commit 2026-03-17 2 list mentions

★ 632

Website ↗ GitHub ↗

matevip/mateclaw

🤖 MateClaw — Your second brain with Multi-Agent Orchestration, MCP Protocol, Skills & Memory, Dream, and Multi-Channel Support. Built on Spring AI Alibaba.

Java #agent#agent-harness#ai-agent#dingtalk-robot pushed 2026-05-31 1,020 commits first commit 2026-04-04 2 list mentions AI dev signals

★ 533

Website ↗ GitHub ↗

UKGovernmentBEIS/inspect_evals

Collection of evals for Inspect AI

Python pushed 2026-05-25 2,431 commits 1 list mention AI dev signals

★ 512

Website ↗ GitHub ↗

neosigmaai/auto-harness

Bring your own agent and build a self-improving agentic system. Automatically mine failures, optimize the agent harness, and gate against regressions.

Python pushed 2026-04-29 11 commits 2 list mentions

★ 509

Website ↗ GitHub ↗

Activity

Default branch: main
Last pushed: 2026-06-02
GitHub updated: 2026-06-03
Created: 2026-03-30
First commit: -
Last scanned: 2026-06-03 10:49
Watchers: 1

Indexed repo mix

Repo stars: 4,941,544
Repo forks: 761,784
Active: 227
Archived: 0

Languages

Python (104) TypeScript (68) Rust (18) Go (9) JavaScript (6) Shell (4) C# (3) HTML (3) Java (3) Swift (2) Elixir (1) Jupyter Notebook (1)

Awesome Agent Harness

Tracked list growth

Likes history

Commits history

Indexed repositories

Filter this list

Put your repository first

How it works

Pricing