vibrantlabsai/ragas
Supercharge Your LLM Application Evaluations 🚀
RAG evaluation without the need for "golden answers"
Supercharge Your LLM Application Evaluations 🚀
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
The LLM Evaluation Framework
Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool
2 captures since 2026-05-23
AI agent config detected
Key config paths
CLAUDE.md