Sign in
← Back to search

EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of language models.

Stars
12,684
Forks
3,289
Commits
4023
Language
Python
Awesome lists
4

Similar repositories

EvolvingLMMs-Lab/lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

4157 stars
Python 1 awesome list

sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

28241 stars
Python 4 awesome lists

huggingface/lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

2428 stars
Python 2 awesome lists

modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, Phi4, ...) (AAAI 2025).

14255 stars
Python 1 awesome list

eth-sri/lmql

A language for constraint-guided and efficient LLM programming.

4184 stars
Python 2 awesome lists

Tracked growth

3 captures since 2026-05-25

Latest capture 2026-05-25 20:56

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2020-08-28
  • First commit: —
  • Last pushed: 2026-05-11
  • Website: https://www.eleuther.ai
  • Archived: no
  • Stack detected: —
  • License: MIT

AI development signals

No AI development config files detected.