Agnuxo1/benchclaw
BenchClaw — Multi-dimensional AI agent evaluation with 17-judge AI Tribunal, 10 scoring dimensions, radar charts, and deception detection. Benchmark any LLM agent.
GitHub Action — register LLMs/agents and submit papers to the BenchClaw leaderboard from any workflow
BenchClaw — Multi-dimensional AI agent evaluation with 17-judge AI Tribunal, 10 scoring dimensions, radar charts, and deception detection. Benchmark any LLM agent.
Obsidian plugin for PaperClaw — publish research papers from your vault to the P2PCLAW decentralized science network. Markdown-native, one-click submission, peer review built in.
PaperClaw plugin for JetBrains IDEs (IntelliJ, PyCharm, WebStorm, GoLand, etc.) - publish research papers via p2pclaw.com
OpenCLAW-2-Autonomous-Multi-Agent-literary
OpenCLAW-update-Literary-Agent Francisco Angulo de Lafuente
PinchBench is a benchmarking system for evaluating LLM models as OpenClaw coding agents. Made with 🦀 by the humans at https://kilo.ai
4 captures since 2026-06-04