feat: attention visualization script for player encoder transformer steps by Copilot · Pull Request #94 · spktrm/porygon2

Copilot · 2026-05-09T06:22:34Z

Summary

Adds a script (scripts/viz_attention.py) that visualises the per-head attention scores at every transformer step inside the player encoder, making attention bottlenecks immediately visible.

Changes

`rl/model/modules.py`

Added two sow calls inside MultiHeadAttention.__call__ after computing attn_probs and attn_entropy:

self.sow("intermediates", "attn_weights", attn_probs.astype(jnp.float32))
self.sow("intermediates", "attn_entropy", jnp.nan_to_num(attn_entropy, nan=0.0).astype(jnp.float32))

These are zero-overhead no-ops during normal training/inference — Flax only writes them when mutable=['intermediates'] is explicitly passed to apply.

`scripts/viz_attention.py`

New standalone visualisation script that:

Accepts --ckpt, --generation, --output, --traj-step CLI arguments
Loads a single game-state example via get_ex_player_step()
Calls Encoder._embed_local_timestep directly (bypassing vmap) to capture local_timestep_decoder attention
Calls Encoder._batched_forward directly (bypassing the outer vmap) to capture input_decoder, history_decoder, and all state_transformer attention steps
Produces per-head heatmaps (PNG) for all 9 MHA calls across 5 transformer components:
- local_timestep_decoder – field tokens → relevant entity/edge tokens
- input_decoder – latent queries → current game-state tokens
- history_decoder – latent queries → past timestep embeddings
- state_transformer_kv_enc (×2 layers) – self-attention on the latent KV sequence
- state_transformer_q_self (×2 layers) – self-attention on output-state tokens
- state_transformer_q_cross (×2 layers) – cross-attention output-state → latent
Produces an entropy summary bar chart (attn_entropy_summary.png) — bars close to 0 indicate heads with highly concentrated (bottlenecked) attention

Usage

# Random params (good for checking shapes)
python -m scripts.viz_attention

# With a checkpoint, custom output directory, specific trajectory step
python -m scripts.viz_attention \
    --ckpt ./ckpts/gen9/ckpt_0001.pkl \
    --output ./my_viz \
    --traj-step 2 \
    --generation 9

…tiHeadAttention Agent-Logs-Url: https://github.com/spktrm/porygon2/sessions/167e84cb-6d94-461d-960e-70f4dda5197f Co-authored-by: spktrm <72776130+spktrm@users.noreply.github.com>

Agent-Logs-Url: https://github.com/spktrm/porygon2/sessions/167e84cb-6d94-461d-960e-70f4dda5197f Co-authored-by: spktrm <72776130+spktrm@users.noreply.github.com>

Copilot AI and others added 2 commits May 9, 2026 06:19

feat: add attention visualization script and sow intermediates in Mul…

255fe74

…tiHeadAttention Agent-Logs-Url: https://github.com/spktrm/porygon2/sessions/167e84cb-6d94-461d-960e-70f4dda5197f Co-authored-by: spktrm <72776130+spktrm@users.noreply.github.com>

fix: move Patch import to module level, fix docstring truncation

04b60c9

Agent-Logs-Url: https://github.com/spktrm/porygon2/sessions/167e84cb-6d94-461d-960e-70f4dda5197f Co-authored-by: spktrm <72776130+spktrm@users.noreply.github.com>

Copilot AI assigned Copilot and spktrm May 9, 2026

Copilot created this pull request from a session on behalf of spktrm May 9, 2026 06:23 View session

Copilot AI requested a review from spktrm May 9, 2026 06:23

Copilot finished work on behalf of spktrm May 9, 2026 06:23

better evidence for better ally enemy targeting

7398dd5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: attention visualization script for player encoder transformer steps#94

feat: attention visualization script for player encoder transformer steps#94
Copilot wants to merge 3 commits into
better_arch_v2from
copilot/visualize-attention-scores

Copilot AI commented May 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Copilot AI commented May 9, 2026

Summary

Changes

rl/model/modules.py

scripts/viz_attention.py

Usage

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

`rl/model/modules.py`

`scripts/viz_attention.py`