Sign in
← Back to search

UKGovernmentBEIS/inspect_evals

Collection of evals for Inspect AI

Stars
512
Forks
334
Commits
2431
Language
Python
Awesome lists
1

Similar repositories

aliasrobotics/cai

Cybersecurity AI (CAI), the framework for AI Security

8809 stars
Python 2 awesome lists

evalplus/evalplus

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

1749 stars
Python 2 awesome lists

SWE-bench/SWE-bench

SWE-bench: Can Language Models Resolve Real-world Github Issues?

5010 stars
Python 2 awesome lists

EgoAlpha/prompt-in-context-learning

Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.

2236 stars
Jupyter Notebook 2 awesome lists

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 20:58

Stars history

Total stars

Commits history

Default branch commits

Metadata

AI development signals

AI agent config detected

50 config paths 25 files 25 directories
Agent instructions Claude Code 48 Windsurf

Key config paths

  • dir .claude
  • dir .windsurf
  • file AGENTS.md
  • file CLAUDE.md
  • file src/inspect_evals/moru/CLAUDE.md
Review config paths
  • Claude Code .claude
  • Claude Code .claude/skills
  • Claude Code .claude/skills/build-repo-context
  • Claude Code .claude/skills/build-repo-context/SKILL.md
  • Claude Code .claude/skills/check-trajectories-workflow
  • Claude Code .claude/skills/check-trajectories-workflow/SKILL.md
  • Claude Code .claude/skills/ci-maintenance-workflow
  • Claude Code .claude/skills/ci-maintenance-workflow/SKILL.md
  • Claude Code .claude/skills/code-quality-fix-all
  • Claude Code .claude/skills/code-quality-fix-all/SKILL.md
  • Claude Code .claude/skills/code-quality-review-all
  • Claude Code .claude/skills/code-quality-review-all/assets
  • Claude Code .claude/skills/code-quality-review-all/assets/results-template.json
  • Claude Code .claude/skills/code-quality-review-all/SKILL.md
  • Claude Code .claude/skills/create-eval
  • Claude Code .claude/skills/create-eval/SKILL.md
  • Claude Code .claude/skills/ensure-test-coverage
  • Claude Code .claude/skills/ensure-test-coverage/references
  • Claude Code .claude/skills/ensure-test-coverage/references/test-patterns.md
  • Claude Code .claude/skills/ensure-test-coverage/SKILL.md
  • Claude Code .claude/skills/eval-quality-workflow
  • Claude Code .claude/skills/eval-quality-workflow/SKILL.md
  • Claude Code .claude/skills/eval-report-workflow
  • Claude Code .claude/skills/eval-report-workflow/references

Showing the first 24 paths. 26 more detected.