We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 6d27f8b commit 0b19932Copy full SHA for 0b19932
1 file changed
eval_protocol/benchmarks/ifeval/test_ifeval.py
@@ -77,7 +77,7 @@ def preprocess_row(self, row: EvaluationRow) -> EvaluationRow:
77
def test_ifeval_benchmark(row: EvaluationRow) -> EvaluationRow:
78
"""
79
Evaluate instruction-following constraints.
80
-
+
81
Returns partial credit score (0.0 to 1.0) representing the fraction
82
of constraints satisfied in the response.
83
0 commit comments