Sign in
← Back to search
Stars
15,854
Forks
1,487
Commits
9548
Language
Python
Awesome lists
5

Similar repositories

huggingface/lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

2428 stars
Python 2 awesome lists

evalplus/evalplus

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

1749 stars
Python 2 awesome lists

hidai25/eval-view

Regression testing for AI agents. Snapshot behavior,diff tool calls,catch regressions in CI. Works with LangGraph, CrewAI, OpenAI, Anthropic.

112 stars
Python 1 awesome list

confident-ai/deepteam

DeepTeam is a framework to red team LLMs and LLM systems.

1817 stars
Python 1 awesome list

EvolvingLMMs-Lab/lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

4157 stars
Python 1 awesome list

MigoXLab/dingo

Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool

702 stars
Python 2 awesome lists

Tracked growth

6 captures since 2026-05-23

Latest capture 2026-06-02 06:56

Stars history

Total stars

Commits history

Default branch commits

Detected stack

Frameworks and tools

  • LangChain · ai framework · high confidence
  • LlamaIndex · ai framework · high confidence
  • Next.js · web framework · high confidence
  • pytest · test framework · high confidence
  • React · frontend framework · high confidence
  • Tailwind CSS · css framework · high confidence
npm Poetry Yarn

Dependency files

  • pyproject.toml · python · 57 dependencies
  • poetry.lock · python · 0 dependencies
  • docs/package.json · javascript · 26 dependencies
  • docs/yarn.lock · javascript · 512 dependencies

Metadata

  • Created: 2023-08-10
  • First commit: 2023-08-10
  • Last pushed: 2026-06-01
  • Website: https://deepeval.com
  • Archived: no
  • Stack detected: 2026-06-02 06:56
  • License: Apache-2.0

AI development signals

No AI development config files detected.