You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: eval_protocol/types/types.py
+8-6Lines changed: 8 additions & 6 deletions
Original file line number
Diff line number
Diff line change
@@ -12,8 +12,8 @@ class TerminationReason(str, Enum):
12
12
MAX_STEPS: Trajectory ends because we hit the step limit
13
13
CONTROL_PLANE_SIGNAL: Trajectory ends because the control plane signals termination (e.g. env goal reached or failure condition)
14
14
USER_STOP: Trajectory ends because the simulated user signals to stop
15
-
ERROR: Trajectory ends because of an error
16
-
INTERRUPTED: Trajectory is interrupted by some non-system related reason (e.g. policy returns unexpected response and we need to terminate the rollout).
15
+
SKIPPABLE_ERROR: Trajectory ends because of an error, this trajectory can be discarded/skipped during postprocessing/evaluation.
16
+
NON_SKIPPABLE_ERROR: Trajectory is interrupted due to some non-skippable error (e.g. policy returns unexpected response and we need to terminate the rollout).
17
17
STOP: Trajectory ends by the policy (mapped to llm response stop reason "stop")
18
18
LENGTH: Trajectory ends by the policy (mapped to llm response stop reason "length")
19
19
TOOL_CALLS: Trajectory ends by the policy with a hanging tool call response (mapped to llm response stop reason "tool_calls")
@@ -22,8 +22,8 @@ class TerminationReason(str, Enum):
0 commit comments