x64: Mask shift amounts for small types #4752

afonso360 · 2022-08-23T14:29:19Z

This was reported by @bjorn3 in https://github.com/bjorn3/rustc_codegen_cranelift/pull/1268#issuecomment-1223916783 when we tried to remove the explicit shift amount masking done by cg_clif.

As a consequence of this fix, #4699 is also fixed, which is nice!

This PR also does some other housekeeping:

Enables i128 shifts in the fuzzer
Enables the i128-shifts-small-types.clif runtests and moves them to i128-shifts.clif
Adds run tests that trigger the issue above (which is similar, but not the same as the i128 one)
Adds compile tests for all forms of ishl/sshr/ushr

Fixes: #4699
cc: @elliottt

github-actions · 2022-08-23T14:43:33Z

Subscribe to Label Action

cc @cfallin, @fitzgen

This issue or pull request has been labeled: "cranelift", "cranelift:area:x64", "isle"

Thus the following users have been cc'd because of the following labels:

cfallin: isle
fitzgen: isle

To subscribe or unsubscribe from this label, edit the .github/subscribe-to-label.json configuration file.

Learn more.

elliottt

This looks great to me!

cranelift/codegen/src/isa/x64/inst.isle

afonso360 · 2022-08-23T16:41:44Z

I think this is also going to invalidate all the fuzzer issues that were reported today, since it changes the input format for the fuzzer, not sure how good OSS-Fuzz is with these situations.

jameysharp · 2022-08-23T16:44:37Z

You're right, Afonso. Could you remove the function-generator changes from this PR for now while we sort out all those issues?

They are fixed. But we had a bunch of fuzzgen issues come in, and we don't want to accidentaly mark them as fixed

jameysharp · 2022-08-23T17:07:21Z

The rustfmt CI failure looks like a transient network issue. Let's re-run it after the rest of the jobs complete.

jameysharp · 2022-08-23T18:39:14Z

cranelift/codegen/src/isa/x64/lower/isle.rs

-        if let Some(c) = inputs.constant {
-            let mask = 1_u64.checked_shl(ty.bits()).map_or(u64::MAX, |x| x - 1);
-            return Imm8Gpr::new(Imm8Reg::Imm8 {
-                imm: (c & mask) as u8,
-            })
-            .unwrap();
-        }
-
-        Imm8Gpr::new(Imm8Reg::Reg {
-            reg: self.put_in_regs(val).regs()[0],
-        })
-        .unwrap()


I want to see if I understand why this PR fixes this issue. Could you confirm or correct my interpretation?

It looks like there are two bugs in put_masked_in_imm8_gpr, and I gather that these bugs aren't present in the similar-looking ISLE rules.

If the shift amount is not defined by an iconst instruction, then this function doesn't mask it at all...? The only thing it does is, if the shift amount needs two registers because it's an i128, then it only takes the lower 64-bit register.

If it is a constant, then it's masked by (1 << ty.bits()) - 1. So for an 8-bit LHS, the shift amount is used modulo 256, but it's actually supposed to be modulo 8. For a 64-bit LHS the shift amount is effectively not masked at all. By contrast, the shift_mask function just returns ty.lane_bits() - 1 so it gets this right. (And also handles vector types correctly, I guess?)

I guess the reason these bugs weren't obvious sooner is that wasm only uses i32 and i64 types, and x86 already masks 32-bit and 64-bit shift amounts to 5 and 6 bits, respectively. But narrower 8-bit and 16-bit shifts are also masked to 5 bits on x86, so those were wrong.

Now that I think I understand this, I'm wondering how hard it is in ISLE to avoid emitting the and instruction when the type is i32 or i64.

If the shift amount is not defined by an iconst instruction, then this function doesn't mask it at all...?

Yes, the source of the issue I was trying to fix was exactly this.

It looks like there are two bugs in put_masked_in_imm8_gpr, and I gather that these bugs aren't present in the similar-looking ISLE rules.

Wow! I was just trying to fix the unmasked non const one, didn't actually realize that the const case didn't do the right thing either!

We should probably add some shift_imm test cases as well.

Now that I think I understand this, I'm wondering how hard it is in ISLE to avoid emitting the and instruction when the type is i32 or i64.

I think that's a good improvement, i'll push it soon.

And it turns out the const case was also wrong. Great Catch!

Now that `put_masked_in_imm8_gpr` works properly we can simplify rotl/rotr

afonso360 · 2022-08-24T10:23:47Z

const_to_type_masked_imm8 was also doing the wrong thing, and is now fixed, I've also removed the special cases for rotl/rotr which were using const_to_type_masked_imm8.

We now use the general case that is using put_masked_in_imm8_gpr and doing the right thing for both cases.

These were the only uses of const_to_type_masked_imm8 outside of put_masked_in_imm8_gpr

jameysharp · 2022-08-24T17:31:14Z

Very nice work! Good catch on noticing that const_to_type_masked_imm8 had the same bug. I'm pleased to see so many of those and instructions disappear from the filetests. And it's great to see the ISLE rules and supporting Rust get both simpler and more correct at the same time!

github-actions bot added cranelift Issues related to the Cranelift code generator cranelift:area:x64 Issues related to x64 codegen isle Related to the ISLE domain-specific language labels Aug 23, 2022

afonso360 mentioned this pull request Aug 23, 2022

cranelift-fuzzgen fuzz bug: "assertion failed: (left == right)" #4755

Closed

elliottt approved these changes Aug 23, 2022

View reviewed changes

cranelift/codegen/src/isa/x64/inst.isle Outdated Show resolved Hide resolved

x64: Mask shift amounts for small types

94bd34b

afonso360 force-pushed the x64-fix-shifts branch from ab87ae9 to 94bd34b Compare August 23, 2022 16:44

cranelift: Disable i128 shifts in fuzzer again

41012c9

They are fixed. But we had a bunch of fuzzgen issues come in, and we don't want to accidentaly mark them as fixed

jameysharp enabled auto-merge (squash) August 23, 2022 17:07

jameysharp reviewed Aug 23, 2022

View reviewed changes

jameysharp disabled auto-merge August 23, 2022 18:40

cranelift: Avoid masking shifts for 32 and 64 bit cases

df73ee6

afonso360 mentioned this pull request Aug 24, 2022

cranelift-icache fuzzbug: "internal error: entered unreachable code: Invalid OperandSize: 16" #4756

Closed

afonso360 added 2 commits August 24, 2022 10:56

cranelift: Add const shift tests and fix them

6d17d54

cranelift: Remove const rotl cases

086a9bd

Now that `put_masked_in_imm8_gpr` works properly we can simplify rotl/rotr

jameysharp merged commit d394edc into bytecodealliance:main Aug 24, 2022

afonso360 mentioned this pull request Aug 25, 2022

cranelift: Enable i128 shifts on fuzzer #4783

Merged

This was referenced Aug 26, 2022

cranelift-fuzzgen fuzzbug: "Floating-point-exception in cranelift_filetests::function_runner::CompiledFunction::call::h6386b90d4c398abf" #4760

Closed

Queued cranelift-fuzzgen enhancements #4798

Closed

afonso360 mentioned this pull request Sep 2, 2022

Cranelift: Wrong results in x86_64 and s390x when shifting values of type i16 and i8 #3075

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

x64: Mask shift amounts for small types #4752

x64: Mask shift amounts for small types #4752

afonso360 commented Aug 23, 2022

github-actions bot commented Aug 23, 2022

elliottt left a comment

afonso360 commented Aug 23, 2022 •

edited

Loading

jameysharp commented Aug 23, 2022

jameysharp commented Aug 23, 2022

jameysharp Aug 23, 2022

afonso360 Aug 23, 2022 •

edited

Loading

afonso360 Aug 24, 2022

afonso360 commented Aug 24, 2022 •

edited

Loading

jameysharp commented Aug 24, 2022

x64: Mask shift amounts for small types #4752

x64: Mask shift amounts for small types #4752

Conversation

afonso360 commented Aug 23, 2022

github-actions bot commented Aug 23, 2022

Subscribe to Label Action

elliottt left a comment

Choose a reason for hiding this comment

afonso360 commented Aug 23, 2022 • edited Loading

jameysharp commented Aug 23, 2022

jameysharp commented Aug 23, 2022

jameysharp Aug 23, 2022

Choose a reason for hiding this comment

afonso360 Aug 23, 2022 • edited Loading

Choose a reason for hiding this comment

afonso360 Aug 24, 2022

Choose a reason for hiding this comment

afonso360 commented Aug 24, 2022 • edited Loading

jameysharp commented Aug 24, 2022

afonso360 commented Aug 23, 2022 •

edited

Loading

afonso360 Aug 23, 2022 •

edited

Loading

afonso360 commented Aug 24, 2022 •

edited

Loading