aliasrobotics/cai
Cybersecurity AI (CAI), the framework for AI Security
Collection of evals for Inspect AI
Cybersecurity AI (CAI), the framework for AI Security
Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024
SWE-bench: Can Language Models Resolve Real-world Github Issues?
[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI
No description.
Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.
1 capture since 2026-05-25
AI agent config detected
Key config paths
.claude
.windsurf
AGENTS.md
CLAUDE.md
src/inspect_evals/moru/CLAUDE.md
.claude
.claude/skills
.claude/skills/build-repo-context
.claude/skills/build-repo-context/SKILL.md
.claude/skills/check-trajectories-workflow
.claude/skills/check-trajectories-workflow/SKILL.md
.claude/skills/ci-maintenance-workflow
.claude/skills/ci-maintenance-workflow/SKILL.md
.claude/skills/code-quality-fix-all
.claude/skills/code-quality-fix-all/SKILL.md
.claude/skills/code-quality-review-all
.claude/skills/code-quality-review-all/assets
.claude/skills/code-quality-review-all/assets/results-template.json
.claude/skills/code-quality-review-all/SKILL.md
.claude/skills/create-eval
.claude/skills/create-eval/SKILL.md
.claude/skills/ensure-test-coverage
.claude/skills/ensure-test-coverage/references
.claude/skills/ensure-test-coverage/references/test-patterns.md
.claude/skills/ensure-test-coverage/SKILL.md
.claude/skills/eval-quality-workflow
.claude/skills/eval-quality-workflow/SKILL.md
.claude/skills/eval-report-workflow
.claude/skills/eval-report-workflow/references
Showing the first 24 paths. 26 more detected.