Sign in
← Back to search

safety-research/bloom

bloom - evaluate any behavior immediately  🌸🌱

Stars
1,332
Forks
169
Commits
234
Language
Python
Awesome lists
1

Similar repositories

LiveBench/LiveBench

LiveBench: A Challenging, Contamination-Free LLM Benchmark

1179 stars
Python 1 awesome list

claw-eval/claw-eval

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

632 stars
Python 2 awesome lists

allenai/olmocr

Toolkit for linearizing PDFs for LLM datasets/training

17353 stars
Python 1 awesome list

huggingface/lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

2428 stars
Python 2 awesome lists

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 21:17

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2025-06-24
  • First commit: —
  • Last pushed: 2026-05-07
  • Archived: no
  • Stack detected: —
  • License: MIT

AI development signals

No AI development config files detected.