Skip to content

Commit fb1792e

Browse files
authored
Logging UI Changes (#94)
1 parent d9a5516 commit fb1792e

File tree

8 files changed

+138
-137
lines changed

8 files changed

+138
-137
lines changed

vite-app/dist/assets/index-BXWyy1QT.js

Lines changed: 0 additions & 131 deletions
This file was deleted.

vite-app/dist/assets/index-BXWyy1QT.js.map

Lines changed: 0 additions & 1 deletion
This file was deleted.

vite-app/dist/assets/index-CHd5YP8V.css

Lines changed: 1 addition & 0 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

vite-app/dist/assets/index-S2Cps9N5.js

Lines changed: 131 additions & 0 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

vite-app/dist/assets/index-S2Cps9N5.js.map

Lines changed: 1 addition & 0 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

vite-app/dist/assets/index-SA8VJz3D.css

Lines changed: 0 additions & 1 deletion
This file was deleted.

vite-app/dist/index.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,8 +5,8 @@
55
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
66
<title>EP | Log Viewer</title>
77
<link rel="icon" href="/assets/favicon-BkAAWQga.png" />
8-
<script type="module" crossorigin src="/assets/index-BXWyy1QT.js"></script>
9-
<link rel="stylesheet" crossorigin href="/assets/index-SA8VJz3D.css">
8+
<script type="module" crossorigin src="/assets/index-S2Cps9N5.js"></script>
9+
<link rel="stylesheet" crossorigin href="/assets/index-CHd5YP8V.css">
1010
</head>
1111
<body>
1212
<div id="root"></div>

vite-app/src/types/eval-protocol.ts

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -51,8 +51,9 @@ export const EvaluateResultSchema = z.object({
5151
step_outputs: z.array(StepOutputSchema).optional().describe('For RL, a list of outputs for each conceptual step, providing base rewards.'),
5252
error: z.string().optional().describe('Optional error message if the evaluation itself encountered an issue.'),
5353
trajectory_info: z.record(z.string(), z.any()).optional().describe('Additional trajectory-level information (duration, steps, termination_reason, etc.).'),
54-
final_control_plane_info: z.record(z.string(), z.any()).optional().describe('The final control plane state that led to termination.')
55-
});
54+
final_control_plane_info: z.record(z.string(), z.any()).optional().describe('The final control plane state that led to termination.'),
55+
agg_score: z.number().optional().describe('The aggregated score of the evaluation across all runs.'),
56+
standard_error: z.number().optional().describe('The standard error of the evaluation across all runs.')});
5657

5758
export const CompletionParamsSchema = z.record(z.string(), z.any());
5859

0 commit comments

Comments
 (0)