Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Awesome List
An awesome list of Agent Harness engineering resources, including GitHub projects, tools, benchmarks, and practical guides.
GitHub stars and default-branch commits for Picrew/awesome-agent-harness.
227 repos currently saved from this list.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Flexible and powerful framework for managing multiple AI agents and handling complex conversations
✨ Build AI agents and web apps — with a single binary.
Agentic orchestrator for parallel coding agents — plans tasks, spawns agents, and autonomously handles CI fixes, merge conflicts, and code reviews.
Open-source observability for your GenAI or LLM application, based on OpenTelemetry
This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.
Coding agents from your phone, desktop and CLI
Coding Agent Harness
Plano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.
Demo of a customer service use case implemented with the OpenAI Agents SDK
From a goal to a task DAG, automatically. TypeScript-native multi-agent orchestration.
🧱 secure, local and programmable sandboxes for AI agents
OpenShell is the safe, private runtime for autonomous AI agents.
A model-driven approach to building AI agents in just a few lines of code.
Instant, Concurrent, Secure & Lightweight Sandbox for AI Agents.
🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓
Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including CrewAI, Agno, OpenAI Agents SDK, Langchain, Autogen, AG2, and CamelAI
Orchestration layer for coding agents (Claude Code, Codex)
Your super agent for work: local-first, learn your working context in mins and never forget it.
Own your AI. The native macOS harness for AI agents -- any model, persistent memory, autonomous execution, cryptographic identity. Built in Swift. Fully offline. Open source.
Orchestrate sandboxed coding agents in TypeScript with sandcastle.run()
Build and deploy AI Agents on Cloudflare
SWE-bench: Can Language Models Resolve Real-world Github Issues?
All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.
An open-source Collaborative Multi-Agent OS for transparent, human-in-the-loop task coordination via Matrix rooms.
The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!
TencentDB Agent Memory delivers fully local long-term memory for AI Agents via a 4-tier progressive pipeline, with zero external API dependencies.
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!
A simple SWE style browser agent framework that achieves SOTA results on long horizon web tasks.
Robust, fast, scalable, and sandboxed open-source online code execution system for humans and AI.
Our library for RL environments + evals
Latitude is the open-source agent engineering platform
Plugins from the Cursor community
An AI Gateway, registry, and proxy that sits in front of any MCP, A2A, or REST/gRPC APIs, exposing a unified endpoint with centralized discovery, guardrails and management. Optimizes Agent & Tool calling, and supports plugins.
Real-time transport layer for Java AI agents. Build once with @Agent — deliver over WebSocket, SSE, gRPC, and WebTransport/HTTP3. Talk MCP, A2A and AG-UI.
Enterprise AI Platform with guardrails, MCP registry, gateway & orchestrator
The sandbox agent framework.
Open-source security automation platform for teams and AI agents
TinyAGI is the agent teams orchestrator for One Person Company. (fka TinyClaw)
LangChain 🔌 MCP
A meta-skill that designs domain-specific agent teams, defines specialized agents, and generates the skills they use.
Agent Skills as a Memory Layer
Agent framework for the JVM. Pronounced Em-BAY-bel /ɛmˈbeɪbəl/
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Devon: An open-source pair programmer
OpenSource Claude Cowork. A desktop AI assistant that helps you with programming, file management, and any task you can describe.
Self-hosted, open-source agent skill registry for enterprises. Publish & version skill packages, govern with RBAC and audit logs, deploy on-premise with Docker or Kubernetes.
The platform for LLM evaluations and AI agent testing
Catalog of official Microsoft MCP (Model Context Protocol) server implementations for AI-powered data access and tool integration
A lightweight, powerful framework for multi-agent workflows and voice agents
Next Generation Agentic Proxy for AI Agents and MCP servers