Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Awesome List
Awesome list for AI agent harness engineering: tools, patterns, evals, memory, MCP, permissions, observability, and orchestration.
GitHub stars and default-branch commits for ai-boost/awesome-harness-engineering.
125 repos currently saved from this list.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Minimal and readable coding agent harness implementation in Python to explain the core components of coding agents.
A production-ready runtime framework for agent apps with secure tool sandboxing, Agent-as-a-Service APIs, scalable deployment, full-stack observability, and broad framework compatibility.
Official, AWS-supported MCP servers, skills, and plugins to help AI agents build on AWS
Stash — persistent memory layer for AI agents. Episodes, facts, and working context stored in Postgres. MCP server included. Self-hosted, single binary, no cloud required.
Every practical and proposed defense against prompt injection.
Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.
Bring your own agent and build a self-improving agentic system. Automatically mine failures, optimize the agent harness, and gate against regressions.
Audit-grade multi-agent orchestration for CLI coding agents (Claude Code, Codex, Gemini CLI, +40 more). HMAC-chained audit log, signed agent cards, per-artefact lineage, air-gap deploy. The orchestrator your compliance team will sign off on. https://bernstein.run
Action-aware permissions for coding agents. A deterministic safety guard that keeps you in the flow.
State machine guardrails for AI agents
Browser Use Box: a 24/7 Claude Code agent for Playwright-style browser automation with Browser Use Cloud, Telegram, and a real browser on any box you own.
React components for visualizing traces from AI agents
MCP Fusion - The TypeScript framework for secure MCP servers.
tui-use lets agents interact with programs that expect a human at the keyboard — REPLs, debuggers, TUI apps, and anything else bash can't reach.
Demonstration of an agent harness with access to tools like Slack, GitHub, and Linear so it can act as your own complete software engineer.
Meta Harness Implementation
Continual harness optimization
swebench repro script for running confucius-code-agent (CCA)
Automated harness evolution for AI agents. A Claude Code plugin that iteratively optimizes system prompts, routing, retrieval, and orchestration code using full-trace counterfactual diagnosis. Based on Meta-Harness (Lee et al., 2026).
Agent debugging skill. Stop AI debugging guesswork with runtime evidence.
Build your own coding agent workshop - Feb 19th 2026
Human-in-the-Loop Protocol for Autonomous Agent Services — Open Standard (v0.8)
No description.
Point-in-time snapshot of projects describing themselves as AI agent harnesses (April 2026)
ML6 x AISO Agent Workshop (March 2026)