Allow out-of-range lane indices in swizzle and shuffle instructions #11

stoklund · 2017-04-25T19:24:57Z

As part of #8 I had the opportunity to research the availability and performance of general-purpose shuffle instructions like pshufb. These instructions are widely available and behave like a v8x16.swizzle where the lane indices are provided as an i8x16 vector register instead of as immediate operands. Lanes with an out-of-range selector become 0 in the output vector.

The WebAssembly shuffle and swizzle instructions proposed in #1 can be extended to allow for immediate lane indices that are too large. The corresponding lanes in the output vector would be 0.

Having the possibility of zeroed lanes in the output makes it simpler to combine shuffle results with other vectors using v128.or.

We shouldn't add this feature without examples of code where it is useful.

The text was updated successfully, but these errors were encountered:

gnzlbg · 2018-08-08T11:04:39Z

PR #30 adds shuffleVar/permuteVar which appear to be equivalent to swizzle (but it also implements it for all other vector lane combinations) . Why was swizzle removed ?

This change adds a variable shuffle instruction to SIMD proposal. When indices are out of range, the result is specified as 0 for each lane. This matches hardware behavior on ARM and RISCV architectures. On x86_64 and MIPS, the hardware provides instructions that can select 0 when the high bit is set to 1 (x86_64) or any of the two high bits are set to 1 (MIPS). On these architectures, the backend is expected to emit a pair of instructions, saturating add (saturate(x + (128 - 16)) for x86_64) and permute, to emulate the proposed behavior. To distinguish variable shuffles with immediate shuffles, existing v8x16.shuffle instruction is renamed to v8x16.shuffle2_imm to be explicit about the fact that it shuffles two vectors with an immediate argument. This naming scheme allows for adding variants like v8x16.shuffle2 and v8x16.shuffle1_imm in the future. Fixes #68. Contributes to #24. Fixes #11.

dtig added the post SIMD MVP label Feb 26, 2019

penzn mentioned this issue Mar 8, 2019

[RFC] Dynamic shuffle #68

Closed

zeux mentioned this issue Mar 12, 2019

Add v8x16.shuffle1 instruction and rename v8x16.shuffle to v8x16.shuffle2_imm #71

Merged

dtig closed this as completed in #71 Mar 27, 2019

abrown mentioned this issue Aug 12, 2019

Inefficient x64 codegen for swizzle #93

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow out-of-range lane indices in swizzle and shuffle instructions #11

Allow out-of-range lane indices in swizzle and shuffle instructions #11

stoklund commented Apr 25, 2017

gnzlbg commented Aug 8, 2018

Allow out-of-range lane indices in swizzle and shuffle instructions #11

Allow out-of-range lane indices in swizzle and shuffle instructions #11

Comments

stoklund commented Apr 25, 2017

gnzlbg commented Aug 8, 2018