File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change 22
33[ ![ PyPI - Version] ( https://img.shields.io/pypi/v/eval-protocol )] ( https://pypi.org/project/eval-protocol/ )
44
5- EP is an open protocol that standardizes how developers author evals for large
6- language model (LLM) applications.
5+ EP is an open specification, Python SDK, and pytest wrapper that provides a
6+ standardized way to write evaluations for large language model (LLM)
7+ applications. Start with simple single-turn evals for model selection and prompt
8+ engineering, then scale up to complex multi-turn reinforcement learning (RL) for
9+ agents using Model Context Protocol (MCP). EP ensures consistent patterns for
10+ writing evals, storing traces, and saving results—enabling you to build
11+ sophisticated agent evaluations that work across real-world scenarios, from
12+ markdown generation tasks to customer service agents with tool calling
13+ capabilities.
714
815## Quick Example
916
You can’t perform that action at this time.
0 commit comments