Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Awesome List
Awesome list for AI agent harness engineering: tools, patterns, evals, memory, MCP, permissions, observability, and orchestration.
GitHub stars and default-branch commits for ai-boost/awesome-harness-engineering.
125 repos currently saved from this list.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
The open source coding agent.
Production-ready platform for agentic workflow development.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Model Context Protocol Servers
🙌 OpenHands: AI-Driven Development
Daytona is a Secure and Elastic Infrastructure for Running AI-Generated Code
Bash is all you need - A nano claude code–like 「agent harness」, built from 0 to 1
A programming framework for agentic AI
Universal memory layer for AI Agents
The best-benchmarked open-source AI memory system. And it's free.
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM]
aider is AI pair programming in your terminal
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
"CLI-Anything: Making ALL Software Agent-Native" -- CLI-Hub: https://clianything.cc/
Installable GitHub library of 1,400+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and more. Includes installer CLI, bundles, workflows, and official/community skill collections.
Playwright MCP server
Build resilient agents.
Composio powers 1000+ toolkits, tool search, context management, authentication, and a sandboxed workbench to help you build AI agents that turn intent into action.
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
🤗 smolagents: a barebones library for agents that think in code.
A lightweight, powerful framework for multi-agent workflows
OpenViking is an open-source context database designed specifically for AI Agents(such as openclaw). OpenViking unifies the management of context (memory, resources, and skills) that Agents need through a file system paradigm, enabling hierarchical context delivery and self-evolving.
From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.
The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered applications and agents
Symphony turns project work into isolated, autonomous implementation runs, allowing teams to manage work instead of supervising coding agents.
Agent2Agent (A2A) is an open protocol enabling communication and interoperability between opaque agentic applications.
The batteries-included agent harness.
Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.
Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, DeepSeek, and more. Simple declarative configs with command line and CI/CD integration. Used by OpenAI and Anthropic.
An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
#1 Persistent memory for AI coding agents based on real-world benchmarks
AI Agent Framework, the Pydantic way
Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).
A collection of projects designed to help developers quickly get started with building deployable applications using the Claude API
The LLM Evaluation Framework
Context window optimization for AI coding agents. Sandboxes tool output, 98% reduction. 15 platforms
A format specification for describing a visual identity to coding agents. DESIGN.md gives agents a persistent, structured understanding of a design system.
DeepSeek-native AI coding agent for your terminal. Engineered around prefix-cache stability — leave it running.
AG-UI: the Agent-User Interaction Protocol. Bring Agents into Frontend Applications.
Structured Outputs
Browser Harness | Self-healing harness that enables LLMs to complete any task.
"OpenHarness: Open Agent Harness with a Built-in Personal Agent--Ohmo!"
Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞
Open Source framework for voice and multimodal conversational AI
Open-source, secure environment with real-world tools for enterprise-grade agents.
Secure, Fast, and Extensible Sandbox runtime for AI agents.
Multi-Agent Harness for Production AI