Sign in
← Back to search

google/BIG-bench

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Stars
3,242
Forks
617
Commits
Language
Python
Awesome lists
2

Similar repositories

LiveBench/LiveBench

LiveBench: A Challenging, Contamination-Free LLM Benchmark

1179 stars
Python 1 awesome list

pinchbench/skill

PinchBench is a benchmarking system for evaluating LLM models as OpenClaw coding agents. Made with 🦀 by the humans at https://kilo.ai

1196 stars
Python 1 awesome list

InternLM/WildClawBench

An in-the-wild benchmark for AI agents in the OpenClaw Environment.

407 stars
Python 1 awesome list

Tracked growth

1 capture since 2026-05-27

Latest capture 2026-05-27 12:47

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2021-01-15
  • First commit: —
  • Last pushed: 2024-07-19
  • Archived: yes
  • Stack detected: —
  • License: Apache-2.0

AI development signals

No AI development config files detected.