Add sparsity-aware modelling pipeline by fisherxue · Pull Request #33 · Accelergy-Project/accelforge

fisherxue · 2026-02-23T16:34:49Z

Sparse tensor modelling support, including sparsity propagation through memory hierarchy, compression formats and metadata costs, latency/energy adjustments for skipping/gating
Benchmarks validated against Sparseloop artifacts

…timization

…utilization model

…lity

…roduction suite

…tion

…atch project style

…st-hoc correction

Drain reads are included in psum_spad bandwidth in AccelForge but modeled at the NoC level in Sparseloop, causing an exact 13/12 overhead for conv3-5. Widen cycle tolerance for those layers accordingly. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Unwrap Optional (Union[X, None]) before calling get_args so that EvalableList and EvalableDict correctly find their inner type argument instead of failing with "Expected exactly one type argument". Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Convenience property that returns True when the optimization is position-skipping (self-conditioned), i.e. kind == "position_skipping" with no explicit condition_on target. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Add tile_shape dict and total_parent_fill_write_actions fields for use by the sparse pipeline's latency bandwidth calculations. Remove unused stride_and_halo argument from analyze_reuse call. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Updated execution timestamps and outputs for all 6 reproduction notebooks (fig1, fig12, fig13, fig15, lab4, table7). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

The latency model previously only used compute_latency_ratio to scale MAC cycles, which reduced cycles for both gating and skipping. Gated and skipped compute counts only went to the energy path, so the arch ERT's per-action latency had no effect on cycle computation. Now LatencyInfo carries gated_compute_count and skipped_compute_count, and run_model populates them in the Compute node's action table. This lets the arch control whether gated/skipped computes contribute cycles via their per-action latency (e.g., gated_compute: latency: 1 means gating does not reduce cycles, matching the conceptual model). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Adds InvalidMappingError and a validate parameter (default True) to evaluate_mapping so hand-crafted mappings are checked for memory capacity overflow (including metadata occupancy), spatial fanout violations, and architecture tile/loop constraints. The mapper's internal evaluate_mapping call passes validate=False since it already validates during tile enumeration. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fisherxue force-pushed the sparsity-support branch from 99e6989 to 0b8550e Compare March 3, 2026 02:37

fisherxue and others added 29 commits March 2, 2026 21:38

Add density model, format occupancy, and workload density support

8840a5e

Add sparse frontend spec parsing, occupancy, and access counts

7acc86a

Add SAF pipeline, propagation, and compute classification

a1b991a

Integrate sparse adjustment pipeline with model evaluation

9114fd0

Add sparse energy model with per-rank format metadata

ef3f495

Add sparse latency model, density sweep, energy fixes, and first-k op…

dcebf91

…timization

Add EyerissV2 PE-level reproduction (Fig 12) within 0.6%

baf2b6b

Add Eyeriss v1 spatial model reproduction (Table 7)

7914299

Add Fig 13 DSTC reproduction, Fig 15 STC configs, and position-space …

b933c67

…utilization model

Rerun all 6 reproduction notebooks with fresh outputs

f140ae9

Refactor sparse_adjustment.py and _compute_sparse_latency for readabi…

0497533

…lity

Add 'plotting' to __all__ in __init__.py

6bb4c11

Update __all__ to include 'roofline' and remove 'plotting'

35dd51e

Fix "Above" resolving to empty set

a78a32e

Run Black

d8eb886

GPT3 with KV cache

1cb86b7

Clean up tests: replace fig1-only regressions with 6-architecture rep…

dbb0b00

…roduction suite

Clean up internal comments and expand user-facing parameter documenta…

3723876

…tion

Fix 12 code review issues in sparse pipeline

7dd6370

Refactor sparse.py frontend: trim docstrings, add input validation, m…

138db9a

…atch project style

Refactor: move halo/stride reuse into Reservation mechanism

fe06089

Inline sparse config into arch YAML: add framework support

8baeecc

Migrate test configs to inline sparse, remove separate sparse YAML files

b2dbbb9

Update reproduction notebooks for inline sparse config

b6fca6f

Refactor: consolidate temporal-reuse and SAF-update helpers

7ded12d

Sparsity support, temporal reuse fix, and updated notebook takeaways

2aa1a20

Validate compute-level tiling and add temporal reuse test

d074d69

Refactor apply_sparse_adjustments into 5 pipeline phases

6931b59

Use uneven mapping in temporal reuse test

9c26ea2

fisherxue added 3 commits March 2, 2026 21:46

Refactor temporal reuse: fold fill/drain into regular actions with po…

e2befa7

…st-hoc correction

Remove _apply_temporal_reuse_corrections post-processing step

c8382cf

Add temporal reuse tracking for weight fills at shared_glb

2f88a78

fisherxue force-pushed the sparsity-support branch from 0b8550e to 2f88a78 Compare March 3, 2026 03:02

fisherxue and others added 14 commits March 3, 2026 00:29

Add is_self_conditioned property to ActionOptimization

ecba8d8

Convenience property that returns True when the optimization is position-skipping (self-conditioned), i.e. kind == "position_skipping" with no explicit condition_on target. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Re-run sparseloop reproduction notebooks with latest changes

fcd7cb2

Updated execution timestamps and outputs for all 6 reproduction notebooks (fig1, fig12, fig13, fig15, lab4, table7). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

infra readme

48e6e59

building everything for all platforms script

c67663d

all-infra should work

2c8508b

fixed misdefined extra infra

173e904

pushes all infra

5d01d33

fixed latest amend

8e3ad6e

fix arch specs for sparsity

4144a56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sparsity-aware modelling pipeline#33

Add sparsity-aware modelling pipeline#33
fisherxue wants to merge 46 commits intoAccelergy-Project:mainfrom
fisherxue:sparsity-support

fisherxue commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

fisherxue commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants