github Active AI dev

Repository profile

pinchbench/skill

PinchBench is a benchmarking system for evaluating LLM models as OpenClaw coding agents. Made with 🦀 by the humans at https://kilo.ai

Python MIT main Stack scanned README.md

Open website Open GitHub

Stars: 1,279
Forks: 147
Watchers: 12
Issues: 21
Commits: 383
Awesome lists: 1

Repository updates

Get generated pinchbench/skill development summaries by email, or follow the weekly and monthly RSS feeds.

Weekly RSS Monthly RSS

Activity and growth

Tracked growth, recent movement, and commit velocity from stored repository snapshots.

Latest capture 2026-07-16 03:02

Star growth, last 7 days: 0 0.0%
Commit velocity, last 7 days: 0 0.0%
Stars since baseline: +83
Snapshot coverage: 5

Tracked growth

5 captures since 2026-05-25

Stars from baseline +83

Time horizon

All tracked data

Custom start Custom end

Stars history

Total stars

Commits history

Default branch commits

Detected stack

Frameworks, package managers, ecosystems, and dependency manifests found during catalog scans.

Scanned 2026-07-16 03:02

Stack signals: 1
Package managers: 1
Manifest files: 1
Dependencies: 9

Frameworks and tools

pytest test framework · high confidence

PEP 517 python

Dependency files

1 manifest

pyproject.toml python ecosystem, 9 dependencies

Classification

Searchable topics, generated tags, and stack labels that explain where this repository fits.

Topics: 0
Tags: 0
Stacks: 1

Topics

No topics indexed.

Generated tags

No generated tags yet.

Stack labels

pytest

AI development signals

Agent instructions and tool configuration paths found in the repository tree.

60 paths

AI agent config detected

60 config paths 49 files 11 directories

Agent workspace 60

Key config paths

dir .agents

Review config paths

Agent workspace .agents
Agent workspace .agents/skills
Agent workspace .agents/skills/building-dashboards
Agent workspace .agents/skills/building-dashboards/.meta
Agent workspace .agents/skills/building-dashboards/.meta/.gitkeep
Agent workspace .agents/skills/building-dashboards/README.md
Agent workspace .agents/skills/building-dashboards/reference
Agent workspace .agents/skills/building-dashboards/reference/chart-config.md
Agent workspace .agents/skills/building-dashboards/reference/chart-cookbook.md
Agent workspace .agents/skills/building-dashboards/reference/design-playbook.md
Agent workspace .agents/skills/building-dashboards/reference/layout-recipes.md
Agent workspace .agents/skills/building-dashboards/reference/metrics-mpl.md
Agent workspace .agents/skills/building-dashboards/reference/smartfilter.md
Agent workspace .agents/skills/building-dashboards/reference/splunk-migration.md
Agent workspace .agents/skills/building-dashboards/reference/templates
Agent workspace .agents/skills/building-dashboards/reference/templates/api-health.json
Agent workspace .agents/skills/building-dashboards/reference/templates/blank.json
Agent workspace .agents/skills/building-dashboards/reference/templates/org-usage-cost-control.json
Agent workspace .agents/skills/building-dashboards/reference/templates/service-overview-with-filters.json
Agent workspace .agents/skills/building-dashboards/reference/templates/service-overview.json
Agent workspace .agents/skills/building-dashboards/scripts
Agent workspace .agents/skills/building-dashboards/scripts/axiom-api
Agent workspace .agents/skills/building-dashboards/scripts/dashboard-chart-patch
Agent workspace .agents/skills/building-dashboards/scripts/dashboard-copy

Showing the first 24 paths. 36 more detected.

Similar repositories

Nearest indexed repositories by embedding similarity.

InternLM/WildClawBench

An in-the-wild benchmark for AI agents in the OpenClaw Environment.

475 stars

Python 1 awesome list

Agnuxo1/benchclaw

BenchClaw — Multi-dimensional AI agent evaluation with 17-judge AI Tribunal, 10 scoring dimensions, radar charts, and deception detection. Benchmark any LLM agent.

6 stars

HTML 0 awesome lists

claw-eval/claw-eval

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

716 stars

Python 2 awesome lists

TIGER-AI-Lab/ClawBench

Open-source benchmark for browser AI agents on daily tasks.

471 stars

Python 1 awesome list

TheAgentCompany/TheAgentCompany

An agent benchmark with tasks in a simulated software company.

740 stars

Python 1 awesome list

harbor-framework/terminal-bench

A benchmark for LLMs on complicated tasks in the terminal

2,450 stars

Python 2 awesome lists

Metadata

Language: Python
License: MIT
Default branch: main
Created: 2026-02-11
First commit: 2026-02-11
Last pushed: 2026-07-02
GitHub updated: 2026-07-15
Last synced: 2026-07-16 03:02
Stack detected: 2026-07-16 03:02
Archived: no

Links and files

GitHub Website

https://pinchbench.com

README

Appears in

Awesome Opensource Ai

pinchbench/skill

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

InternLM/WildClawBench

Agnuxo1/benchclaw

claw-eval/claw-eval

TIGER-AI-Lab/ClawBench

TheAgentCompany/TheAgentCompany

harbor-framework/terminal-bench

Metadata

Links and files

Appears in

How it works

Pricing

Follow repository updates

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

InternLM/WildClawBench

Agnuxo1/benchclaw

claw-eval/claw-eval

TIGER-AI-Lab/ClawBench

TheAgentCompany/TheAgentCompany

harbor-framework/terminal-bench

Metadata

Links and files

Appears in