Sign in
← Back to search
Stars
3,273
Forks
616
Commits
4183
Language
Python
Awesome lists
1

Similar repositories

openai/mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

1540 stars
Python 1 awesome list

google/BIG-bench

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

3242 stars
Python 2 awesome lists

EvolvingLMMs-Lab/lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

4157 stars
Python 1 awesome list

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 20:56

Stars history

Total stars

Commits history

Default branch commits

Metadata

AI development signals

No AI development config files detected.