Slice::contains generates suboptimal assembly code

Given `val: u8`, `slice: &[u8; 8]` and `arr: [u8; 8]`, I expected the following statements to compile down to the same thing : 
```rust
// a
slice.contains(&val);

// b
slice[0] == val
  || slice[1] == val
  || slice[2] == val
  || slice[3] == val
  || slice[4] == val
  || slice[5] == val
  || slice[6] == val
  || slice[7] == val;
  
// c
arr.contains(&val);

// d
arr[0] == val
  || arr[1] == val
  || arr[2] == val
  || arr[3] == val
  || arr[4] == val
  || arr[5] == val
  || arr[6] == val
  || arr[7] == val;
  ```
  
  However, the resulting assembly differs quite a lot:
  * The `a` statement compiles down to a loop, checking one element at a time, except for `T = u8|i8` and `N < 16` where it instead call fall on the fast path of [`memchr`](https://github.com/rust-lang/rust/blob/master/library/core/src/slice/memchr.rs#L43) which gets optimized a little bit better.
  * The `b` statement compiles down to a unrolled-loop, checking one element at a time in a branchless fashion. Most of the time it doesn't give any SIMD instructions.
  * The `c` statement always compiles down to a loop, checking one element at a time, except for `T = u8|i8` and `N >= 16` where it instead call [`memchr_general_case`](https://github.com/rust-lang/rust/blob/master/library/core/src/slice/memchr.rs#L50)
  * The `d` statement always compiles down to a few branchless SIMD instructions for any primitive type used and any array size.
  
Because the slice/array size is known at compile-time and the type checker guarantees that it will be respected by any calling function, I expected the compiler to take this into account while optimizing the resulting assembly. However, this information seems to be lost at some point when using the `contains` method.

`arr.contains(&val)` and `slice.contains(&val)` are simplified as `arr.as_ref().iter().any(|e| *e == val)` and `slice.iter().any(|e| *e == val)` if I'm not mistaken (which is wierd because for some N and T, they don't yield the same assembly). The compiler does not seem to be able to unroll this case.

godbolt links for
[T=u8; N=8](https://godbolt.org/z/bTYnod7hc)
[T=u16; N=8](https://godbolt.org/z/9cc7jKsEz)
[T=u32; N=8](https://godbolt.org/z/9c3KoGj17)
[T=u64; N=8](https://godbolt.org/z/4naeK4Too)

[T=u8; N=16](https://godbolt.org/z/G15cWdfYG)
[T=u16; N=16](https://godbolt.org/z/qKWWPxT67)
[T=u32; N=16](https://godbolt.org/z/rq8ha7xMs)
[T=u64; N=16](https://godbolt.org/z/4naeK4Too)
 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Slice::contains generates suboptimal assembly code #88204

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Slice::contains generates suboptimal assembly code #88204

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions