Update aarch64.cat to newer version & add NoRet support by ThomasHaas · Pull Request #986 · hernanponcedeleon/Dat3M

ThomasHaas · 2026-02-12T15:41:53Z

I think I got every feature we need. We should add a few more litmus tests though.

~~TODO~~:

~~Add the two new MSA litmus tests from herd7 for the updated MSA support~~
~~Add litmus tests for NoRet support~~
-- ~~The tests are added, but we are missing support for LDADD, STADD, and CAS instructions in the grammar/parser.~~
~~The grammar needs some cleanup.~~

~~There are still missing variants of STADD, but at least we have enough support for the litmus tests that show correct NoRet behavior.~~

ThomasHaas · 2026-02-12T16:14:52Z

We can add the litmus tests from herd7. I just tested one of them, and got the expected outcome with the changes in this PR.

benchmarks/mixed/store-to-load-forwarding1.c

ThomasHaas · 2026-02-12T17:51:30Z

The RelationAnalysisTest just ignores the skip list and the expected values list and so also tries to parse the unsupported litmus tests...

ThomasHaas · 2026-02-15T14:51:35Z

Herd7 has some more tests for NoRet loads.

ThomasHaas · 2026-02-15T21:29:37Z

We should have all variants of STXXX and LDXXX now, including the min/max variants.
I think the only thing that is missing are more litmus tests

dartagnan/src/main/java/com/dat3m/dartagnan/parsers/program/visitors/VisitorLitmusAArch64.java

dartagnan/src/main/java/com/dat3m/dartagnan/program/processing/compilation/VisitorArm8.java

dartagnan/src/test/resources/ARM8-expected.csv

ThomasHaas · 2026-02-16T18:58:26Z

dartagnan/src/main/java/com/dat3m/dartagnan/program/processing/compilation/VisitorArm8.java

+        Expression cmpValue = cas.getExpectedValue();
+        Local captureCmpVal = null;
+        if (cmpValue.getRegs().contains(resultRegister)) {
+            Register tmpReg = cas.getFunction().newRegister(resultRegister.getType());
+            captureCmpVal = newLocal(tmpReg, cmpValue);
+            cmpValue = tmpReg;
+        }
+
+        Expression newValue = cas.getStoreValue();
+        Local captureNewVal = null;
+        if (newValue.getRegs().contains(resultRegister)) {
+            Register tmpReg = cas.getFunction().newRegister(resultRegister.getType());
+            captureNewVal = newLocal(tmpReg, newValue);
+            newValue = tmpReg;
+        }


Btw. this case can be simplified if we perform the load always into a fresh register r_fresh and only at the end assign resultReg=r_fresh.
However, this solution has one disadvantage: SCCP will propagate the resultReg=r_fresh assignment, thereby only leaving the r_fresh register. Therefore, this transformation effectively eliminates the original register name.
If we don't care about this, then we can use the easier solution. If we want to preserve the original register name, we have to do this complex capturing process.
Some middleground would be to use a fresh register and just give it a name related to the original one, e.g., by adding a unique suffix.

EDIT: In general, I'm pretty sure that we have several compilation schemes that would fail if the result register was used as part of any operand. This is certainly something we should fix in a streamlined way.

However, this solution has one disadvantage: SCCP will propagate the resultReg=r_fresh assignment, thereby only leaving the r_fresh register. Therefore, this transformation effectively eliminates the original register name.

Can't we prevent this by using NOOPT?

EDIT: In general, I'm pretty sure that we have several compilation schemes that would fail if the result register was used as part of any operand. This is certainly something we should fix in a streamlined way.

What do you mean by "fail"? Can you give a concrete example where some compilation would fail?

However, this solution has one disadvantage: SCCP will propagate the resultReg=r_fresh assignment, thereby only leaving the r_fresh register. Therefore, this transformation effectively eliminates the original register name.

Can't we prevent this by using NOOPT?

We have NOOPT tags for litmus code (do they survive compilation though?). SCCP will still propagate the assignment, but just not delete the (dead) register assignment. Semantically, the code is correct either way. The question is if we even care about the original registers all too much. For LLVM code, they have random names anyways. For Litmus code, the register assignment will remain for the purpose of exists/forall clauses I think. The witness might look a bit different though.

EDIT: The NOOPT tag does not survive compilation. We could propagate it, or instead use Metadata to signify it (Metadata is propagated through compilation).

EDIT: In general, I'm pretty sure that we have several compilation schemes that would fail if the result register was used as part of any operand. This is certainly something we should fix in a streamlined way.

What do you mean by "fail"? Can you give a concrete example where some compilation would fail?

The compilation won't fail per say, but the semantics becomes wrong.
For example, consider this code from the ARM8 compilation visitor:

@Override public List<Event> visitXchg(Xchg xchg) { Register resultRegister = xchg.getResultRegister(); Expression address = xchg.getAddress(); String loadMo = xchg.hasTag(ARMv8.MO_ACQ) ? ARMv8.MO_ACQ : ""; String storeMo = xchg.hasTag(ARMv8.MO_REL) ? ARMv8.MO_REL : ""; return eventSequence( newRMWLoadExclusiveWithMo(resultRegister, address, loadMo), newRMWStoreExclusiveWithMo(address, xchg.getValue(), true, storeMo) ); }

Now suppose the Xchg was of shape r = Xchg(&x, r), i.e. the exchange value was the same as the result register.
Then the generated load will overwrite r and the generated store will write the just-observed value.

// Wrong: r = load(x); store(x, r); // Correct (A) r_old = r; r = load(x); store(x, r_old); // Correct (B) r_fresh = load(x); store(x, r); r = r_fresh;

am I right that we were never hit by this because of LLVM's SSA form?

Is the only disadvantage of the "perform the load always into a fresh register r_fresh" option that we lose register name? That is a no issue a think: witnesses do not show register names anyway, so this is only visible by the printer (and only after compilation!) which anyway has many similar things (e.g., all the __side_effect registers).

am I right that we were never hit by this because of LLVM's SSA form?

SSA certainly prevents this issue. But I'm 99% sure that we already considered this problem when writing several compiler mappings, for example, in LKMMXchg

... Register dummy = e.getFunction().newRegister(resultRegister.getType()); Load load = newRMWLoadExclusiveWithMo(dummy, address, ARMv8.extractLoadMoFromLKMo(mo)); Store store = newRMWStoreExclusiveWithMo(address, e.getValue(), true, ARMv8.extractStoreMoFromLKMo(mo)); ... return eventSequence( load, store, newLocal(resultRegister, dummy), ...

Is the only disadvantage of the "perform the load always into a fresh register r_fresh" option that we lose register name? That is a no issue a think: witnesses do not show register names anyway, so this is only visible by the printer (and only after compilation!) which anyway has many similar things (e.g., all the __side_effect registers).

Yes, we only lose register names. However, for llvm code, it doesn't matter too much because the register names are meaningless anyways (most of the time at least).
My concern was more about litmus code where we try to stay more truthful to the actul source code. I mean, we even put NOOPT everywhere for no other reason than making the code look like the original (though I think the NOOPT gets lost during compilation anyways...)
I know that there was a time where some optimizations were wrong on litmus code, possibly due to the lack of SSA form, and so we disabled them. But nowadays, everything we do should be sound.

Also witnesses do contain register names: they contain all local events. And even if they didn't they would still contain at least the load events which also assign to a register.

Also witnesses do contain register names: they contain all local events. And even if they didn't they would still contain at least the load events which also assign to a register.

Witnesses (to be clear, I refer to png file) contain local and load events, but we do not display the register name but the value it got in the execution.

I just generated a png witness and it has nodes like

bv32 _mem2Reg#2(1800) <- _mem2Reg#3 + bv(1500)

I mean, how would you even display the value of a local event without the register it assigns to? It would just be a single number connected to nothing?

Ok, you are right we show the register name for Local; it is for Load that we just show the value.

As we discussed, for LLVM register name is not that relevant. I just checked witnesses in litmus code and we have things like bool DUMMY_REG#4(true) <- bv64 DUMMY_REG#6 == bv64 DUMMY_REG#5 in the event.

I do not think things get worse by using r_fresh or r_old.

Then I will change the compilation scheme accordingly.

I changed the compilation and also fixed the one for Xchg.

…han generic ProgramBuilder). Remove A/Q tags from Loads into zero regs (XZR/WZR)

…r/parser support

…rch64.cat version)

Add support for AARCH64 CAS instructions

Added two new common Events RMWOp and RMWFetchOp

Signed-off-by: Hernan Ponce de Leon <hernanl.leon@huawei.com>

…rm8)

ThomasHaas mentioned this pull request Feb 12, 2026

Implement new C++26 atomic operations #984

Open

hernanponcedeleon reviewed Feb 12, 2026

View reviewed changes

benchmarks/mixed/store-to-load-forwarding1.c Outdated Show resolved Hide resolved

ThomasHaas changed the title ~~[DRAFT] Update aarch64.cat to newer version & add NoRet support~~ Update aarch64.cat to newer version & add NoRet support Feb 12, 2026

ThomasHaas force-pushed the update-arm8-support branch from 04f7250 to d3ee07a Compare February 15, 2026 21:02

hernanponcedeleon reviewed Feb 16, 2026

View reviewed changes

ThomasHaas commented Feb 16, 2026

View reviewed changes

hernanponcedeleon force-pushed the update-arm8-support branch 2 times, most recently from f7e311b to 6995f76 Compare February 18, 2026 14:07

ThomasHaas and others added 17 commits February 19, 2026 12:28

Update aarch64.cat

de4d82a

Add Armv8.NoRet tag to Loads into zero registers

c1b02c4

Move NoRet annotation code into right place (AARCH64 visitor rather t…

4632428

…han generic ProgramBuilder). Remove A/Q tags from Loads into zero regs (XZR/WZR)

Add new AARCH64 litmus tests: some are disabled due to missing gramma…

46c55c1

…r/parser support

Updated expected value of store-to-load-forwarding1.c (PASS on new aa…

32f53ae

…rch64.cat version)

Add reference to aarch64 patch that changed behavior of a benchmark.

59aa61c

Add new CAS event common for hardware models

006dde0

Add support for AARCH64 CAS instructions

Added support for parsing LDOP (Aarch64)

1d87202

Added two new common Events RMWOp and RMWFetchOp

Add STADD and variants (except for min/max)

4aa0062

Fixup after rebase

ea3bfa5

ADD SMIN/SMAX/UMIN/UMAX variants of LDXXX/STXXX

5ca2f9e

Minor renaming

363c72b

Delete broken assertionValue rule in LitmusAArch64.g4

cc1121e

Add more NoRet tests

81527c4

Signed-off-by: Hernan Ponce de Leon <hernanl.leon@huawei.com>

Fix

e0c60a6

Signed-off-by: Hernan Ponce de Leon <hernanl.leon@huawei.com>

Feedback

7437427

Changed compilation scheme of CAS/RMWOp and fixed Xchg compilation (A…

33f7ef0

…rm8)

ThomasHaas force-pushed the update-arm8-support branch from 794e645 to 33f7ef0 Compare February 19, 2026 11:28

hernanponcedeleon merged commit b35ef1a into development Feb 19, 2026
7 checks passed

hernanponcedeleon deleted the update-arm8-support branch February 19, 2026 12:53

ThomasHaas mentioned this pull request Feb 19, 2026

Add support for C26 atomic reductions (without compiler mappings) #985

Open

Conversation

ThomasHaas commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ThomasHaas commented Feb 12, 2026

Uh oh!

Uh oh!

ThomasHaas commented Feb 12, 2026

Uh oh!

ThomasHaas commented Feb 15, 2026

Uh oh!

ThomasHaas commented Feb 15, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ThomasHaas Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hernanponcedeleon Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

ThomasHaas Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hernanponcedeleon Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

ThomasHaas Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

hernanponcedeleon Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

ThomasHaas Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hernanponcedeleon Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

ThomasHaas Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

ThomasHaas Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ThomasHaas commented Feb 12, 2026 •

edited

Loading

ThomasHaas Feb 16, 2026 •

edited

Loading

ThomasHaas Feb 16, 2026 •

edited

Loading

ThomasHaas Feb 18, 2026 •

edited

Loading