Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
GitHub projects from awesome lists
Search names, descriptions, topics, tags, and stacks, then tune results by ecosystem, freshness, health, and cross-list signal.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Official Compound Engineering plugin for Claude Code, Codex, Cursor, and more
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
Open source agentic operating system
Build reliable customer-facing AI agents with Parlant: an interaction control harness optimized for controlled, consistent, and predictable LLM interactions.
#1 Persistent memory for AI coding agents based on real-world benchmarks
Open-source Agent Operating System
AI Agent Framework, the Pydantic way
Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).
Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view
A comprehensive collection of Agent Skills for context engineering, multi-agent architectures, and production agent systems. Use when building, optimizing, or debugging agent systems that require effective context management.
The LLM Evaluation Framework
Context window optimization for AI coding agents. Sandboxes tool output, 98% reduction. 15 platforms
Autonomous multi-session AI coding
Browser Harness | Self-healing harness that enables LLMs to complete any task.
Eigent: The Open Source Cowork Desktop to Unlock Your Exceptional Productivity. Local and Free Alternative to Claude Cowork.
Supercharge Your LLM Application Evaluations 🚀
"OpenHarness: Open Agent Harness with a Built-in Personal Agent--Ohmo!"
A framework for few-shot evaluation of language models.
Open-source, secure environment with real-world tools for enterprise-grade agents.
IronClaw is an Agent OS focused on privacy, security and extensibility
A blazing fast AI Gateway with integrated guardrails. Route to 1,600+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.
Agent S: an open agentic framework that uses computers like a human
TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.
Code Editor for the AI Agents Era - Run an army of Claude Code, Codex, etc. on your machine
Secure, Fast, and Extensible Sandbox runtime for AI agents.
A framework for building, orchestrating and deploying AI agents and multi-agent workflows with support for Python and .NET.
GitHub Copilot CLI brings the power of Copilot coding agent directly to your terminal.
Multi-Agent Harness for Production AI
AI Observability & Evaluation