bug: LLM grader reports 'no response provided' when system prompt is present in input

## Bug Description

When running evals with a system prompt in the input messages, the LLM grader frequently reports "No response content was provided to evaluate" even when the model actually generated a correct response. This was discovered during benchmarking with `with-superpowers` vs `without-superpowers` experiments.

## Reproduction

1. Create an eval with a system message followed by a user message:
```yaml
tests:
  - id: test-1
    input:
      - role: system
        content: "You are a helpful assistant that thinks step by step..."
      - role: user
        content: "Solve this logic puzzle..."
    assertions:
      - type: llm-grader
        prompt: "Check if the response correctly solves the puzzle"
```

2. Run against multiple targets:
```bash
agentv eval test.EVAL.yaml --target azure --experiment with-system-prompt
agentv eval test.EVAL.yaml --target gemini --experiment with-system-prompt
```

3. Observe that the LLM grader returns low scores with reasoning like "No response content was provided to evaluate" even when the model's actual response is correct.

## Observed behavior

| Experiment | Target | Actual Response | Grader Score | Grader Reasoning |
|---|---|---|---|---|
| without-superpowers | gemini | Correct (A=Knave, B=Knave) | 1.0 | Correct analysis |
| with-superpowers | gemini | Correct (A=Knave, B=Knave) | 0.5 | "No response text was provided" |
| without-superpowers | azure | Correct | 0.97 | Good solution |
| with-superpowers | azure | Correct | 0.0 | "No solution steps were included" |

The `contains` evaluator confirms the correct content IS present in the response. The issue is specific to the LLM grader's prompt construction when system messages are in the input.

## Expected behavior

The LLM grader should correctly receive and evaluate the model's response regardless of whether the input includes system messages.

## Environment

- agentv: latest main (post PR #977)
- Targets: azure (gpt-5.4-mini), gemini (gemini-3-flash-preview)
- Both used as grader and generator

## Likely cause

The LLM grader prompt construction in `packages/core/src/evaluation/evaluators/llm-grader.ts` may not correctly handle multi-turn inputs with system messages when building the evaluation context for the judge model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: LLM grader reports 'no response provided' when system prompt is present in input #982

Bug Description

Reproduction

Observed behavior

Expected behavior

Environment

Likely cause

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Experiment	Target	Actual Response	Grader Score	Grader Reasoning
without-superpowers	gemini	Correct (A=Knave, B=Knave)	1.0	Correct analysis
with-superpowers	gemini	Correct (A=Knave, B=Knave)	0.5	"No response text was provided"
without-superpowers	azure	Correct	0.97	Good solution
with-superpowers	azure	Correct	0.0	"No solution steps were included"

bug: LLM grader reports 'no response provided' when system prompt is present in input #982

Description

Bug Description

Reproduction

Observed behavior

Expected behavior

Environment

Likely cause

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions