← Back to search

github Active

Repository profile

openai/evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python NOASSERTION main Stack scanned README.md

Open GitHub

Stars: 18,916
Forks: 3,024
Watchers: 281
Issues: 218
Commits: 691
Awesome lists: 4

Repository updates

Get generated openai/evals development summaries by email, or follow the weekly and monthly RSS feeds.

Weekly RSS Monthly RSS

Activity and growth

Tracked growth, recent movement, and commit velocity from stored repository snapshots.

Latest capture 2026-07-15 03:16

Star growth, last 7 days: 0 0.0%
Commit velocity, last 7 days: 0 0.0%
Stars since baseline: +385
Snapshot coverage: 6

Tracked growth

6 captures since 2026-05-25

Stars from baseline +385

Time horizon

All tracked data

Custom start Custom end

Stars history

Total stars

Commits history

Default branch commits

Detected stack

Frameworks, package managers, ecosystems, and dependency manifests found during catalog scans.

Scanned 2026-07-15 03:16

Stack signals: 0
Package managers: 2
Manifest files: 13
Dependencies: 0

Frameworks and tools

No framework dependencies detected.

PEP 517 pip python

Dependency files

13 manifests

pyproject.toml python ecosystem, 0 dependencies
evals/elsuite/hr_ml_agent_bench/requirements.txt python ecosystem, 0 dependencies
evals/solvers/providers/google/requirements.txt python ecosystem, 0 dependencies
evals/elsuite/multistep_web_tasks/docker/homepage/requirements.txt python ecosystem, 0 dependencies
evals/elsuite/steganography/scripts/dataset/requirements.txt python ecosystem, 0 dependencies
evals/elsuite/text_compression/scripts/dataset/requirements.txt python ecosystem, 0 dependencies
evals/elsuite/hr_ml_agent_bench/benchmarks/bipedal_walker/scripts/requirements.txt python ecosystem, 0 dependencies
evals/elsuite/hr_ml_agent_bench/benchmarks/cartpole/scripts/requirements.txt python ecosystem, 0 dependencies
5 more files

Classification

Searchable topics, generated tags, and stack labels that explain where this repository fits.

Topics: 0
Tags: 0
Stacks: 0

Topics

No topics indexed.

Generated tags

No generated tags yet.

Stack labels

No stack labels yet.

AI development signals

Agent instructions and tool configuration paths found in the repository tree.

0 paths

No AI development config files detected.

Similar repositories

Nearest indexed repositories by embedding similarity.

openai/simple-evals

No description.

4,570 stars

Python 3 awesome lists

evidentlyai/evidently

Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.

7,693 stars

Jupyter Notebook 4 awesome lists

huggingface/lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

2,476 stars

Python 2 awesome lists

hidai25/eval-view

Regression testing for AI agents. Snapshot behavior,diff tool calls,catch regressions in CI. Works with LangGraph, CrewAI, OpenAI, Anthropic.

121 stars

Python 1 awesome list

modelscope/evalscope

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

3,078 stars

Python 2 awesome lists

evalplus/evalplus

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

1,770 stars

Python 2 awesome lists

Metadata

Language: Python
License: NOASSERTION
Default branch: main
Created: 2023-01-23
First commit: 2023-03-14
Last pushed: 2026-04-14
GitHub updated: 2026-07-14
Last synced: 2026-07-15 03:16
Stack detected: 2026-07-15 03:16
Archived: no

Links and files

GitHub README

403 Forbidden | https://api.github.com/repos/openai/evals/readme | message=API rate limit exceeded for user ID 260990068. If you reach out to GitHub Support for help, please include the request ID AD42:219112:DA8BF2E:CFAEE82:6A56FB8E and timestamp 2026-07-15 03:16:30 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (htt | rate_limit_remaining=0 | rate_limit_reset=1784088007

openai/evals

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

openai/simple-evals

evidentlyai/evidently

huggingface/lighteval

hidai25/eval-view

modelscope/evalscope

evalplus/evalplus

Metadata

Links and files

Appears in

How it works

Pricing

Follow repository updates

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

openai/simple-evals

evidentlyai/evidently

huggingface/lighteval

hidai25/eval-view

modelscope/evalscope

evalplus/evalplus

Metadata

Links and files

Appears in