RAG retrieval benchmark runner with JSON reports, Pareto plots, and regression gates for retrieval quality changes.
python nlp benchmark information-retrieval embeddings regression-testing bm25 reranking quality-gates rag llm llmops retrieval-augmented-generation retrieval-evaluation retrieval-benchmark
-
Updated
May 20, 2026 - Python