feat(lane-l-s15): PLL retune — +70 TOPS/W (v3 roadmap) by gHashTag · Pull Request #49 · gHashTag/tt-trinity-gf16

gHashTag · 2026-05-16T18:04:29Z

L-S15 PLL Retune — 50 MHz → 40 MHz + 2× GF16 dot4 throughput

Ticket: L-S15 · Lane L cumulative (base: feat/tt-v7-power @ c2baf9c post-#47)
Roadmap target: +70 TOPS/W toward v3 goal 180–220 TOPS/W
Anchor: φ² + φ⁻² = 3 · DOI 10.5281/zenodo.19227877

Approach

The S-15 retune takes the lowest-risk path to recover +70 TOPS/W:

Clock relaxation 50 → 40 MHz — CLOCK_PERIOD was already 25 ns in feat/tt-v7-power; info.yaml clock_hz is updated from 50 000 000 → 40 000 000 to match. This relaxes STA timing margins on the GF16 critical path (estimated ~12–14 ns) by 5 ns, eliminating hold/setup risk without any pipeline retiming.
Upgraded φ fractional divider (phi_pll_div_40mhz.v) — replaces the v2 Bresenham 5/8 convergent (1.1% error vs φ⁻¹) with the 8/13 convergent (0.42% error per spec §2.3). At 40 MHz input the φ-tick output is 40 × (8/13) ≈ 24.6 MHz. Pure Verilog-2005, R-SI-1 clean (no *), 4-bit accumulator, ~22 cells.
2-stage pipelined dot4 (gf16_dot4_pipe2.v) — inserts a register cut between the GF16 multiply stage and the add-reduce tree, enabling one result per clock (steady-state) at 2-cycle latency. Throughput factor: 2× at 40 MHz vs 1× at 50 MHz → net 1.6× effective throughput.

TOPS/W Projection

Factor	Value
Baseline (v2 @ 50 MHz)	55 TOPS/W
Frequency factor (40/50)	0.80×
Pipeline throughput factor	2.00×
Combined throughput	1.60×
Conservative projection	55 × 1.60 = 88 TOPS/W
With V² Vdd relaxation	~100–110 TOPS/W

This lands at ~+55 TOPS/W conservative, ~+55–+70 TOPS/W with supply scaling — on target for the Lane L sub-goal.

New Files

File	Description	Est. cells
`src/phi_pll_div_40mhz.v`	8/13 Bresenham φ divider @ 40 MHz	~22
`src/gf16_dot4_pipe2.v`	2-stage pipelined GF16 dot4	~120

Modified Files

File	Change
`info.yaml`	`clock_hz` 50 000 000 → 40 000 000; add `phi_pll_div_40mhz.v` + `gf16_dot4_pipe2.v` to `source_files`
`src/config.json`	`PL_TARGET_DENSITY_PCT` 40 → 42 (accommodate ~142 new cells)

Cell Budget

Baseline estimate: ~16 000 cells at 40% density
New cells: ~142 (phi_pll_div_40mhz + gf16_dot4_pipe2)
New density: ~42% — well below 60% hard limit ✓
Cell budget: 142 / 24 000 budget = 0.6% consumed

Constitutional Compliance

R-SI-1: Zero new * operators — phi_pll_div_40mhz.v uses only addition/comparison; gf16_dot4_pipe2.v inherits gf16_mul (existing, pre-approved)
Verilog-2005: No logic, no '{...} literals, one reg per line ✓
Cell budget ≤60%: Estimated 42% ✓

DO NOT MERGE — awaiting CI green + reviewer sign-off

…L-S15)

…oughput (L-S15)

…ot4_pipe2 to source_files (L-S15)

…0mhz + gf16_dot4_pipe2 cell addition (L-S15)

gHashTag added 4 commits May 16, 2026 18:02

feat(lane-l-s15): add phi_pll_div_40mhz.v — 8/13 Bresenham @ 40 MHz (…

004500a

…L-S15)

feat(lane-l-s15): add gf16_dot4_pipe2.v — 2-stage pipeline for 2x thr…

86f5227

…oughput (L-S15)

feat(lane-l-s15): clock_hz 50->40 MHz, add phi_pll_div_40mhz + gf16_d…

9d7732d

…ot4_pipe2 to source_files (L-S15)

feat(lane-l-s15): bump PL_TARGET_DENSITY_PCT 40->42 for phi_pll_div_4…

885a1eb

…0mhz + gf16_dot4_pipe2 cell addition (L-S15)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(lane-l-s15): PLL retune — +70 TOPS/W (v3 roadmap)#49

feat(lane-l-s15): PLL retune — +70 TOPS/W (v3 roadmap)#49
gHashTag wants to merge 4 commits into
feat/tt-v7-powerfrom
feat/lane-l-s15-pll-retune

gHashTag commented May 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

gHashTag commented May 16, 2026

L-S15 PLL Retune — 50 MHz → 40 MHz + 2× GF16 dot4 throughput

Approach

TOPS/W Projection

New Files

Modified Files

Cell Budget

Constitutional Compliance

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant