Sign in
← Back to search

pinchbench/skill

PinchBench is a benchmarking system for evaluating LLM models as OpenClaw coding agents. Made with 🦀 by the humans at https://kilo.ai

Stars
1,196
Forks
131
Commits
383
Language
Python
Awesome lists
1

Similar repositories

InternLM/WildClawBench

An in-the-wild benchmark for AI agents in the OpenClaw Environment.

407 stars
Python 1 awesome list

claw-eval/claw-eval

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

609 stars
Python 2 awesome lists

TIGER-AI-Lab/ClawBench

Open-source benchmark for browser AI agents on 153 everyday online tasks across 144 live websites. 5-layer recording + DOM-match + LLM judge. Top score 33.3%.

347 stars
Python 1 awesome list

ClawBio/ClawBio

🦖 ClawBio - The first bioinformatics-native AI agent skill library. Local-first. Reproducible. Built on OpenClaw.

870 stars
Python 1 awesome list

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 21:14

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2026-02-11
  • First commit: —
  • Last pushed: 2026-05-22
  • Website: https://pinchbench.com
  • Archived: no
  • Stack detected: —
  • License: MIT

AI development signals

AI agent config detected

60 config paths 49 files 11 directories
Agent workspace 60

Key config paths

  • dir .agents
Review config paths
  • Agent workspace .agents
  • Agent workspace .agents/skills
  • Agent workspace .agents/skills/building-dashboards
  • Agent workspace .agents/skills/building-dashboards/.meta
  • Agent workspace .agents/skills/building-dashboards/.meta/.gitkeep
  • Agent workspace .agents/skills/building-dashboards/README.md
  • Agent workspace .agents/skills/building-dashboards/reference
  • Agent workspace .agents/skills/building-dashboards/reference/chart-config.md
  • Agent workspace .agents/skills/building-dashboards/reference/chart-cookbook.md
  • Agent workspace .agents/skills/building-dashboards/reference/design-playbook.md
  • Agent workspace .agents/skills/building-dashboards/reference/layout-recipes.md
  • Agent workspace .agents/skills/building-dashboards/reference/metrics-mpl.md
  • Agent workspace .agents/skills/building-dashboards/reference/smartfilter.md
  • Agent workspace .agents/skills/building-dashboards/reference/splunk-migration.md
  • Agent workspace .agents/skills/building-dashboards/reference/templates
  • Agent workspace .agents/skills/building-dashboards/reference/templates/api-health.json
  • Agent workspace .agents/skills/building-dashboards/reference/templates/blank.json
  • Agent workspace .agents/skills/building-dashboards/reference/templates/org-usage-cost-control.json
  • Agent workspace .agents/skills/building-dashboards/reference/templates/service-overview-with-filters.json
  • Agent workspace .agents/skills/building-dashboards/reference/templates/service-overview.json
  • Agent workspace .agents/skills/building-dashboards/scripts
  • Agent workspace .agents/skills/building-dashboards/scripts/axiom-api
  • Agent workspace .agents/skills/building-dashboards/scripts/dashboard-chart-patch
  • Agent workspace .agents/skills/building-dashboards/scripts/dashboard-copy

Showing the first 24 paths. 36 more detected.