Sign in
← Back to search
github Active

Repository profile

JudgmentLabs/judgeval

The Continuous-Improvement Stack for Agents. Our environment data and evals power agent improvement and monitoring.

Python Apache-2.0 main Stack scanned README.md
Stars
1,037
Forks
93
Watchers
7
Issues
18
Commits
1,757
Awesome lists
1

Activity and growth

Tracked growth, recent movement, and commit velocity from stored repository snapshots.

Latest capture 2026-06-12 10:50

Star growth, last 7 days
0 0.0%
Commit velocity, last 7 days
0 0.0%
Stars since baseline
0
Snapshot coverage
1

Tracked growth

1 capture since 2026-06-12

Stars from baseline 0

Stars history

Total stars

Commits history

Default branch commits

Detected stack

Frameworks, package managers, ecosystems, and dependency manifests found during catalog scans.

Scanned 2026-06-12 10:50

Stack signals
4
Package managers
1
Manifest files
11
Dependencies
144

Frameworks and tools

  • FastAPI web framework · high confidence
  • pytest test framework · high confidence
  • Starlette web framework · medium confidence
  • Streamlit app framework · high confidence
uv python

Dependency files

11 manifests
  • pyproject.toml python ecosystem, 37 dependencies
  • uv.lock python ecosystem, 0 dependencies
  • examples/basic-distributed-tracing/pyproject.toml python ecosystem, 4 dependencies
  • examples/basic-evaluation/pyproject.toml python ecosystem, 1 dependency
  • examples/basic-linked-trace/pyproject.toml python ecosystem, 1 dependency
  • examples/basic-tracing/pyproject.toml python ecosystem, 2 dependencies
  • examples/claude-agent-sdk/pyproject.toml python ecosystem, 2 dependencies
  • examples/google-adk/pyproject.toml python ecosystem, 2 dependencies
  • 3 more files

Classification

Searchable topics, generated tags, and stack labels that explain where this repository fits.

Topics
15
Tags
0
Stacks
4

AI development signals

Agent instructions and tool configuration paths found in the repository tree.

0 paths
No AI development config files detected.

Similar repositories

Nearest indexed repositories by embedding similarity.

awslabs/agent-evaluation

A generative AI-powered framework for testing virtual agents.

366 stars
Python 1 awesome list

vostride/agent-qa

The self-improving Agentic QA harness with Memory. Write tests in natural language.
 Catch regressions before releases ship.

90 stars
TypeScript 1 awesome list

truera/trulens

Evaluation and Tracking for LLM Experiments and AI Agents

3,381 stars
Python 2 awesome lists

Agnuxo1/benchclaw

BenchClaw — Multi-dimensional AI agent evaluation with 17-judge AI Tribunal, 10 scoring dimensions, radar charts, and deception detection. Benchmark any LLM agent.

5 stars
HTML 0 awesome lists

kayba-ai/agentic-context-engine

🧠 Make your agents learn from experience. Now available as a hosted solution at kayba.ai

2,344 stars
Python 2 awesome lists