Bench/context sweep snapshot #627
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What
Add env-gated matmul shape aggregation (
MLLM_MATMUL_SHAPE_LOG) to report top GEMM/SGEMM shapes by estimated FLOPs.Add
sweep_context_v2.shto run context sweep (prefill TTFT + decode-heavy).Add snapshot generator script (
make_snapshot_nopandas.py) and ship a ready-to-share snapshot bundle:bench_artifacts/data/*.csv)bench_artifacts/plots/*.png)bench_artifacts/snapshot.md)bench_artifacts/README.md)Why
How to reproduce
bench_artifacts/README.md.