[herd] Global count of useful hardware update by maranget · Pull Request #1733 · herd/herdtools7

maranget · 2026-03-02T13:21:49Z

This PR is an evolution of PR #1730. The number of useful hardware updates is evaluated by a global scan of events at the very end of execution candidate construction. The scan counts the number of explicit write effects to locations that can be page table entries of values whose AF flag can be zero.

The technique sounds more general and should yield a better upper bound than the local technique of PR #1730.

Additionally , we perform the simple identification of the case of a failing CAS that nevertheless writes the value read from memory into memory again. In such a case, there is no need to count another hardware update.

lib/misc.ml

herd/tests/instructions/AArch64.kvm/A032.litmus.expected

herd/tests/instructions/AArch64.kvm/A033.litmus

The function [delay_kont] can be used to extract the value returned by a monad (second argument below, type `'a`). ``` val delay_kont : string -> 'a t -> ('a -> 'a t -> 'b t) -> 'b t ``` The continuation function `(fun v mv -> ... )` can then examine the returned value `v` and combine the monad `mv` independently, which proves very convenient in many occasion. The change performed by this commit permits affine (_i.e._ one or zero effective occurrence), while linear usage (exactly one occurrence) was mandatory before.

+ Efficient group function: sort, then group. + Suffix based generators: - generate all suffixes, - cross product of suffixes.

When TTHM=HA or TTHM=HD are active, HW update of the AF flag is performed. This include the so-called "spurious" updates that are performed independently of test code. For efficiency reason we limit the number of such spurious updates to what is necessary. We do so by a global scan of the execution candidates counting the writes that may unset the AF flag in the final set of effects. Notice that we also consider the initial writes in this scan. We perform one optimisation: by exception, when a write effect value is the same as the value read by the same instruction from the same location, there is no need to add a supplementary spurious update as the (potential) update associated to the write that stored the value has already been counted and is sufficient.

As introduced by PR #1733 each (potential) spurious update of the AF flag is "motivated" by a write that can unset this AF flag. This PR reinforces the link by adding an equation that equates the value written and the value read by the update, resulting in better performance in some cases.

maranget · 2026-04-02T17:32:14Z

Hi @relokin and @HadrienRenaud, do I have your approval for merging?

relokin

This PR looks good to me, thanks Luc!

maranget mentioned this pull request Mar 2, 2026

[herd] More efficient computation of atomic load X stores pairs #1735

Merged

maranget force-pushed the global-hwupdates branch from c9f7855 to bbcb779 Compare March 4, 2026 17:56

HadrienRenaud reviewed Mar 5, 2026

View reviewed changes

lib/misc.ml Show resolved Hide resolved

maranget force-pushed the global-hwupdates branch from bbcb779 to f760a93 Compare March 23, 2026 16:12

maranget marked this pull request as ready for review March 30, 2026 08:15

maranget mentioned this pull request Mar 30, 2026

[herd] Attempt to limit the number of spurious AF updates #1730

Closed

maranget force-pushed the global-hwupdates branch from f760a93 to 172203a Compare March 30, 2026 09:23

relokin reviewed Mar 31, 2026

View reviewed changes

herd/tests/instructions/AArch64.kvm/A032.litmus.expected Show resolved Hide resolved

relokin reviewed Mar 31, 2026

View reviewed changes

herd/tests/instructions/AArch64.kvm/A033.litmus Outdated Show resolved Hide resolved

maranget added 5 commits April 1, 2026 10:51

[herd] Small simplification/optimisation

ab19f50

[lib] Additions to the Misc module

819cf87

+ Efficient group function: sort, then group. + Suffix based generators: - generate all suffixes, - cross product of suffixes.

[herd] Add tests

5247305

maranget force-pushed the global-hwupdates branch from 172203a to 5247305 Compare April 1, 2026 11:35

maranget mentioned this pull request Apr 1, 2026

[herd] more on spurious updates #1772

Draft

relokin approved these changes Apr 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[herd] Global count of useful hardware update#1733

[herd] Global count of useful hardware update#1733
maranget wants to merge 5 commits intomasterfrom
global-hwupdates

maranget commented Mar 2, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maranget commented Apr 2, 2026

Uh oh!

relokin left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

maranget commented Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maranget commented Apr 2, 2026

Uh oh!

relokin left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

maranget commented Mar 2, 2026 •

edited

Loading