ISLE opts for `x ± a == y ± b` #10489

scottmcm · 2025-03-30T01:00:02Z

Noticed these looking at niched discriminant calculations, because discr(a) == discr(b) sometimes expands to frobble(a) + OFFSET == frobble(b) + OFFSET, where it's quite helpful to simplify to frobble(a) == frobble(b) instead.

scottmcm · 2025-03-30T01:41:23Z

tests/disas/gc/drc/externref-globals.wat

-;;                                     v70 = iconst.i64 -1
-;; @003b                               v50 = iadd v49, v70  ; v70 = -1
-;;                                     v71 = iconst.i64 0
-;; @003b                               v51 = icmp eq v50, v71  ; v71 = 0
-;; @003b                               brif v51, block5, block6
+;;                                     v93 = iconst.i64 1
+;;                                     v83 = icmp eq v49, v93  ; v93 = 1
+;; @003b                               brif v83, block5, block6


A case in the wild, here: was a + -1 == 0, now a == 1.

cranelift/codegen/src/opts/icmp.isle

Thinking more, it doesn't seem worth it to just change `x + 1 == y` to `x == y - 1` *always*.

cfallin

Nice, thanks!

Allow matching on 3+ variant niche-encoded enums to optimize better While the two-variant case is most common (and already special-cased), it's pretty unusual to actually need the *fully-general* niche-decoding algorithm (that handles things like 200+ variants wrapping the encoding space and such). Layout puts the niche-encoded variants on one end of the natural values, so because enums don't have that many variants, it's quite common that there's no wrapping because the handful of variants just end up after the end of the `bool` or `char` or `newtype_index!` or whatever. This PR thus looks for those cases: situations where the tag's range doesn't actually wrap, and thus we can check for niche-vs-untag in one simple `icmp` without needing to adjust the tag value, and by picking between zero- and sign-extension based on *which* kind of non-wrapping it is, also help LLVM better understand by not forcing it to think about wrapping arithmetic either. It also emits the operations in a more optimization-friendly order. While the MIR Rvalue calculates a discriminant, so that's what we emit, code normally doesn't actually care about the actual discriminant for these niche-encoded enums. Rather, the discriminant is just getting passed to an equality check (for something like `matches!(foo, TerminatorKind::Goto { .. }`) or a `SwitchInt` (when it's being matched on). So while the old code would emit, roughly ```rust if is_niche { tag + ADJUSTMENT } else { UNTAGGED_DISCR } ``` this PR changes it instead to ```rust (if is_niche { tag } else { UNTAGGED_ADJ_DISCR }) + ADJUSTMENT ``` which on its own might seem odd, but it's actually easier to optimize because what we're actually doing is ```rust complicated_stuff() + ADJUSTMENT == 4 ``` or ```rust match complicated_stuff() + ADJUSTMENT { 0 =>…, 1 => …, 2 => …, _ => unreachable } ``` or in the generated `PartialEq` for enums with fieldless variants, ```rust complicated_stuff(a) + ADJUSTMENT == complicated_stuff(b) + ADJUSTMENT ``` and thus that's easy for the optimizer to eliminate the additions: ```rust complicated_stuff() == 2 ``` ```rust match complicated_stuff() { 7 => …, 8 => …, 9 => …, _ => unreachable } ``` ```rust complicated_stuff(a) == complicated_stuff(b) ``` For good measure I went and made sure that cranelift can do this optimization too 🙂 bytecodealliance/wasmtime#10489 r? WaffleLapkin Follow-up to rust-lang#139098 -- EDIT later: I happened to notice rust-lang#110197 (comment) -- it looks like there used to be some optimizations in this code, but they got removed for being wrong. I've added lots of tests here; let's hope I can avoid that fate 😬 (Certainly it would be possible to save some complexity by restricting this to the easy case, where it's unsigned-nowrap, the niches are after the natural payload, and all the variant indexes are small.)

scottmcm requested a review from a team as a code owner March 30, 2025 01:00

scottmcm requested review from abrown and removed request for a team March 30, 2025 01:00

ISLE opts for x ± a == y ± b

2df12fa

scottmcm force-pushed the eq-add branch from 2be8ae4 to 2df12fa Compare March 30, 2025 01:32

scottmcm requested a review from a team as a code owner March 30, 2025 01:32

scottmcm requested review from fitzgen and removed request for a team March 30, 2025 01:32

scottmcm commented Mar 30, 2025

View reviewed changes

cranelift/codegen/src/opts/icmp.isle Outdated Show resolved Hide resolved

scottmcm added 2 commits March 29, 2025 19:33

Don't reassociate across == without a reason

5ad157d

Thinking more, it doesn't seem worth it to just change `x + 1 == y` to `x == y - 1` *always*.

Subsume when we made it strictly better, and handle more orderings

ec68ada

cfallin approved these changes Mar 30, 2025

View reviewed changes

cfallin added this pull request to the merge queue Mar 30, 2025

Merged via the queue into bytecodealliance:main with commit 3468026 Mar 30, 2025
41 checks passed

scottmcm deleted the eq-add branch March 30, 2025 19:02

scottmcm mentioned this pull request Apr 13, 2025

Allow matching on 3+ variant niche-encoded enums to optimize better rust-lang/rust#139729

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ISLE opts for `x ± a == y ± b` #10489

ISLE opts for `x ± a == y ± b` #10489

scottmcm commented Mar 30, 2025

scottmcm Mar 30, 2025

cfallin left a comment

ISLE opts for x ± a == y ± b #10489

ISLE opts for x ± a == y ± b #10489

Conversation

scottmcm commented Mar 30, 2025

scottmcm Mar 30, 2025

Choose a reason for hiding this comment

cfallin left a comment

Choose a reason for hiding this comment

ISLE opts for `x ± a == y ± b` #10489

ISLE opts for `x ± a == y ± b` #10489