Generate QA datasets & evaluate RAG systems with failure diagnosis. Any LLM.
{ "mcpServers": { "ragscore": { "command": "uvx", "args": [ "ragscore" ] } } }