Sign in
← Back to search

LiveBench/LiveBench

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Stars
1,179
Forks
108
Commits
364
Language
Python
Awesome lists
1

Similar repositories

openai/mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

1540 stars
Python 1 awesome list

EvolvingLMMs-Lab/lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

4157 stars
Python 1 awesome list

THUDM/AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

3451 stars
Python 4 awesome lists

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 21:04

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2024-06-12
  • First commit: —
  • Last pushed: 2026-05-22
  • Archived: no
  • Stack detected: —
  • License: NOASSERTION

AI development signals

No AI development config files detected.