vibrantlabsai/ragas
Supercharge Your LLM Application Evaluations ๐
Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view
Supercharge Your LLM Application Evaluations ๐
๐ข Open-Source Evaluation & Testing library for LLM Agents
Adding guardrails to large language models.
DeepTeam is a framework to red team LLMs and LLM systems.
RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connect external tools and data sources.
RAG evaluation without the need for "golden answers"
1 capture since 2026-05-25