[bot] Streaming aggregator collapses multiple choices (n>1) into a single output

## Summary

The `BraintrustStream` streaming aggregator merges all choice indices into a single `OutputChoice` at index 0. When a Chat Completions request uses `n > 1` to generate multiple parallel completions, the streamed chunks from different choice indices are concatenated into one string, producing a corrupted single output instead of preserving each choice separately.

## What is missing

OpenAI Chat Completions streaming with `n > 1` sends chunks tagged with a `choices[].index` field to distinguish parallel completions:

```json
{"choices": [{"index": 0, "delta": {"role": "assistant", "content": "Hello"}}]}
{"choices": [{"index": 1, "delta": {"role": "assistant", "content": "Hi"}}]}
{"choices": [{"index": 0, "delta": {"content": " world"}}]}
{"choices": [{"index": 1, "delta": {"content": " there"}}]}
```

The expected aggregated output should be two choices:
- Choice 0: `"Hello world"`
- Choice 1: `"Hi there"`

Currently in `src/stream.rs:727-807`, the `aggregate()` function:

1. **Ignores the choice `index`** — loops over all `chunk.choices` (line 771) but accumulates into a single `aggregated_content: String` and single `role: Option<String>` regardless of which choice index the delta belongs to
2. **Hardcodes a single output** — creates one `OutputChoice` at index 0 (line 804) and wraps it in a single-element vector (line 806)
3. **Corrupts content** — text from choice 0 and choice 1 are interleaved into one string (`"Hello Hi world there"`)
4. **Loses finish reasons** — only the last `finish_reason` seen across all choices is kept (line 773-777), discarding per-choice finish reasons

The fix would involve:
- Tracking per-index state (content, role, tool_calls, finish_reason) using a `HashMap<usize, ChoiceState>` or similar
- Building one `OutputChoice` per distinct index seen in the stream
- Returning all choices in the `FinalizedStream::output` vector

## Braintrust docs status

**supported** — Braintrust's streaming documentation states that streaming responses are fully supported and automatically aggregated. The OpenAI integration docs do not specifically mention `n > 1` handling. Status: **unclear** for multi-choice streaming specifically.

- https://www.braintrust.dev/docs/integrations/ai-providers/openai
- https://www.braintrust.dev/docs/instrument/trace-llm-calls

## Upstream sources

- OpenAI Chat Completions `n` parameter: https://platform.openai.com/docs/api-reference/chat/create — "`n`: How many chat completion choices to generate for each input message"
- OpenAI streaming format uses `choices[].index` to distinguish parallel completions: https://platform.openai.com/docs/api-reference/chat/streaming
- The `index` field is present on every streamed choice delta

## Relationship to existing issues

- **Distinct from #30** (streaming tool call aggregation): #30 is about missing `tool_calls` field in `StreamDelta`. This issue is about the aggregation logic ignoring the choice `index`, which affects all content (text, tool_calls, finish_reason) across multiple choices.
- **Distinct from #32** (streaming usage detail loss): #32 is about missing fields in `StreamUsage`. This issue is about structural aggregation logic.
- **Distinct from #41** (StreamUsage serde alias bug): #41 is about field name aliasing. This is about multi-choice index tracking.

## Local files inspected

- `src/stream.rs:727-807` — `aggregate()` uses single `aggregated_content`, `role`, `finish_reason` variables for all choices; creates one `OutputChoice` at index 0 on line 804
- `src/stream.rs:658-664` — `StreamChoice` struct does have an implicit index from the array position but no explicit `index` field parsed from the JSON
- `src/stream.rs:354-395` — `OutputChoice` struct supports an `index` field, so the output type can represent multiple choices
- `src/stream.rs:530-540` — `FinalizedStream` stores `output: Vec<OutputChoice>`, capable of holding multiple choices

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bot] Streaming aggregator collapses multiple choices (n>1) into a single output #48

Summary

What is missing

Braintrust docs status

Upstream sources

Relationship to existing issues

Local files inspected

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[bot] Streaming aggregator collapses multiple choices (n>1) into a single output #48

Description

Summary

What is missing

Braintrust docs status

Upstream sources

Relationship to existing issues

Local files inspected

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions