x64: Incorrect out-of-bounds access reported for a misaligned float load #4890

alexcrichton · 2022-09-09T18:13:38Z

This input module:

(module
  (func (param i32) (result f32)
    f32.const 0
    local.get 0
    f32.load offset=1
    f32.copysign
  )
  (memory 1)
  (export "" (func 0))
)

yields:

$ cargo run -q testcase0.shrunken.wat --invoke '' 0
warning: using `--invoke` with a function that takes arguments is experimental and may break in the future
Error: failed to run main module `testcase0.shrunken.wat`

Caused by:
    0: failed to invoke ``
    1: wasm trap: out of bounds memory access
       wasm backtrace:
           0:   0x2e - <unknown>!<wasm function 0>

but the trap reported here is incorrect because there is no out-of-bounds memory access in this module. Instead what's happening is that in the compile function:

0000000000000000 <_wasm_function_0>:
       0:       55                      push   %rbp
       1:       48 89 e5                mov    %rsp,%rbp
       4:       8b c2                   mov    %edx,%eax
       6:       48 8b 4f 50             mov    0x50(%rdi),%rcx
       a:       41 ba 00 00 00 80       mov    $0x80000000,%r10d
      10:       66 45 0f 6e da          movd   %r10d,%xmm11
      15:       66 41 0f 6f c3          movdqa %xmm11,%xmm0
      1a:       0f 55 05 0f 00 00 00    andnps 0xf(%rip),%xmm0        # 30 <_wasm_function_0+0x30>
      21:       44 0f 54 5c 01 01       andps  0x1(%rcx,%rax,1),%xmm11
      27:       41 0f 56 c3             orps   %xmm11,%xmm0
      2b:       48 89 ec                mov    %rbp,%rsp
      2e:       5d                      pop    %rbp
      2f:       c3                      retq

The instruction at 0x21 is segfaulting due to a misaligned address. The segfault is also registered as a trap point in Wasmtime since I believe this is a folding of the f32.load into the f32.copysign and so this could also segfault due to an out-of-bounds memory access.

Bisection reveals that this issue become a segfault in #4730 and then became a trap in #4790 (cc @elliottt). We'll want to be sure to backport the fix for this to the release-1.0.0 branch as well.

The text was updated successfully, but these errors were encountered:

elliottt · 2022-09-09T18:56:35Z

It seems like we want a stronger condition in put_in_xmm_mem, as we need to know that it's a mergeable load and 16-byte aligned to merge the load:

wasmtime/cranelift/codegen/src/isa/x64/lower/isle.rs

Lines 169 to 175 in 2986f6b

    
           if let InputSourceInst::UniqueUse(src_insn, 0) = inputs.inst { 
        
               if let Some((addr_input, offset)) = is_mergeable_load(self.lower_ctx, src_insn) { 
        
                   self.lower_ctx.sink_inst(src_insn); 
        
                   let amode = lower_to_amode(self.lower_ctx, addr_input, offset); 
        
                   return XmmMem::new(RegMem::mem(amode)).unwrap(); 
        
               } 
        
           }

cfallin · 2022-09-09T20:06:27Z

I sort of suspect that we will find most loads encountered by put_in_xmm_mem won't have any known alignment (greater than 1 byte); at least, in CLIF lowered from Wasm, that will be the case, since Wasm SIMD loads must work with unaligned addresses. The simplest fix here may just be to remove load-op merging in the XMM case, and always force to a register (i.e., delete exactly the lines you highlighted). Thoughts?

elliottt · 2022-09-09T20:17:36Z

I was just experimenting with that fix and was going to suggest it :) I'll make a PR!

bjorn3 · 2022-09-09T20:28:46Z

In cg_clif we do almost always know an alignment bigger than 1.

cfallin · 2022-09-09T20:52:23Z

In cg_clif we do almost always know an alignment bigger than 1.

Fair enough; we can eventually look at adding this back, while respecting alignment, but IMHO we should fix the fuzzbug first in the most straightforward way, especially given we want to cherry-pick this to the 1.0 branch too :-)

elliottt · 2022-09-09T20:52:34Z

Here's another option: in is_mergable_load we currently only require the address to be aligned if the type being loaded is a vector type. Why don't we extend this to floating point types as well?

wasmtime/cranelift/codegen/src/isa/x64/lower.rs

Lines 107 to 111 in 2986f6b

    
           // SIMD instructions can only be load-coalesced when the loaded value comes 
        
           // from an aligned address. 
        
           if load_ty.is_vector() && !insn_data.memflags().map_or(false, |f| f.aligned()) { 
        
               return None; 
        
           }

elliottt · 2022-09-09T20:55:14Z

We could also pass a flag to is_mergeable_load that indicates if the destination would be an xmm register, and use that in place of the check for load_ty.is_vector().

cfallin · 2022-09-09T20:56:05Z

Possibly, yeah, though it's less obviously clear to me that this covers all cases (e.g. what if some lowering were to somehow load an integer-typed value into an XMM register?). The proximate cause here is that loads into XMM registers merged into SSE instructions need to be aligned, so I'd prefer to take the more risk-averse option for the initial fix and turn that path off altogether, I think...

elliottt · 2022-09-09T20:57:53Z

For now, never merging the loads is obviously correct, and we can always revisit this later :)

Do not merge loads for xmm registers, as alignment requirements currently aren't satisfied with clif lowered from wasm. Fixes #4890

abrown · 2022-09-12T17:19:32Z

Perhaps there should be an issue to revisit this later? @elliottt brought up what seemed to me like good avenues for fixing this.

alexcrichton · 2022-09-12T18:17:02Z

@elliottt also would you be up for backporting this to the release-1.0.0 branch? If not no worries and I can do that as well.

elliottt · 2022-09-12T18:20:42Z

Absolutely, I'll make a PR today and send it to you for review.

Do not merge loads for xmm registers, as alignment requirements currently aren't satisfied with clif lowered from wasm. Fixes bytecodealliance#4890

alexcrichton added the fuzz-bug Bugs found by a fuzzer label Sep 9, 2022

elliottt mentioned this issue Sep 9, 2022

Don't merge loads for xmm registers #4891

Merged

elliottt closed this as completed in #4891 Sep 12, 2022

elliottt added a commit that referenced this issue Sep 12, 2022

Don't merge loads for xmm registers (#4891)

ad09c27

Do not merge loads for xmm registers, as alignment requirements currently aren't satisfied with clif lowered from wasm. Fixes #4890

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

x64: Incorrect out-of-bounds access reported for a misaligned float load #4890

x64: Incorrect out-of-bounds access reported for a misaligned float load #4890

alexcrichton commented Sep 9, 2022

elliottt commented Sep 9, 2022

cfallin commented Sep 9, 2022

elliottt commented Sep 9, 2022

bjorn3 commented Sep 9, 2022

cfallin commented Sep 9, 2022

elliottt commented Sep 9, 2022 •

edited

Loading

elliottt commented Sep 9, 2022

cfallin commented Sep 9, 2022

elliottt commented Sep 9, 2022

abrown commented Sep 12, 2022

alexcrichton commented Sep 12, 2022

elliottt commented Sep 12, 2022

x64: Incorrect out-of-bounds access reported for a misaligned float load #4890

x64: Incorrect out-of-bounds access reported for a misaligned float load #4890

Comments

alexcrichton commented Sep 9, 2022

elliottt commented Sep 9, 2022

cfallin commented Sep 9, 2022

elliottt commented Sep 9, 2022

bjorn3 commented Sep 9, 2022

cfallin commented Sep 9, 2022

elliottt commented Sep 9, 2022 • edited Loading

elliottt commented Sep 9, 2022

cfallin commented Sep 9, 2022

elliottt commented Sep 9, 2022

abrown commented Sep 12, 2022

alexcrichton commented Sep 12, 2022

elliottt commented Sep 12, 2022

elliottt commented Sep 9, 2022 •

edited

Loading