Specialize equality for [T] and comparison for [u8] to use memcmp when possible #32699

bluss · 2016-04-03T14:17:42Z

Specialize equality for [T] and comparison for [u8] to use memcmp when possible

Where T is a type that can be compared for equality bytewise, we can use
memcmp. We can also use memcmp for PartialOrd, Ord for [u8].

Use specialization to call memcmp in PartialEq for slices for certain element types. This PR does not change the user visible API since the implementation uses an intermediate trait. See commit messages for more information.

The memcmp signature was changed from *const i8 to *const u8 which is in line with how the memcmp function is defined in C (taking const void * arguments, interpreting the values as unsigned bytes for purposes of the comparison).

rust-highfive · 2016-04-03T14:17:53Z

r? @brson

(rust_highfive has picked a reviewer for you, use r? to override)

bluss · 2016-04-03T14:18:05Z

Previous specialization PR: #32586

alexcrichton · 2016-04-03T17:52:08Z

Could we avoid the addition of a new sys module and just move the definition? The eq_slice lang item seems like it should be able to just call as_bytes() == as_bytes() now, right?

bluss · 2016-04-03T18:29:19Z

I have a completely unsubstantiated fear that messing with eq_slice will result in ICEs. We can see what happens if we try.

nagisa · 2016-04-04T00:40:29Z

src/libcore/slice.rs

Shouldn’t we be able to implement this ∀T: Copy?

for example f32 and &i32 are two copy types that don't compare for equality byte by byte

Something like this would be helpful for fast comparisons/hashing.

Where T is a type that can be compared for equality bytewise, we can use memcmp. We can also use memcmp for PartialOrd, Ord for [u8] and by extension &str. This is an improvement for example for the comparison [u8] == [u8] that used to emit a loop that compared the slices byte by byte. One worry here could be that this introduces function calls to memcmp in contexts where it should really inline the comparison or even optimize it out, but llvm takes care of recognizing memcmp specifically.

The old test for Ord used no asserts, and appeared to have a wrong test. (!).

bluss · 2016-04-05T12:07:42Z

I have updated (rebased) the branch. Moving memcmp into slice.rs required specializing PartialOrd, Ord too, so that &str could use that. I'll let travis run the full testsuite on this.

erickt · 2016-04-05T13:34:32Z

src/libcore/slice.rs

 }
+
+impl_marker_for!(BytewiseEquality,
+                 u8 i8 u16 i16 u32 i32 u64 i64 usize isize char bool);


There is room to eek out more performance here if we specialize for [u16], [u32] and etc. I did some experimentation with trying to speed up memcmp a year or so ago, and most implementations just check a byte at a time until the slices align on a usize. For these larger types, we could just skip some of these alignment checks since we already know they're aligned.

Interesting! That sounds like exactly the motivation needed to make memcmp(*const u8, *const u8, usize) into an llvm intrinsic, so that it uses the alignment information from the pointer (like memcpy already does). http://llvm.org/docs/LangRef.html#llvm-memcpy-intrinsic

@bluss: Exactly. There was a few questions a number of years ago (1, 2) to add one, but it didn't get any traction.

PS: I think I found when I was talking about this on #rust-internals with you, @bluss, back in 2015 :) I can't say that this would be a big win, it might just shave off a few conditionals, which may or may not really matter in real code. I also found my old benchmarks, which I've uploaded to https://github.com/erickt/rust-memcmp-benches.

alexcrichton · 2016-04-05T16:46:30Z

@bors: r+ 28c4d12

bors · 2016-04-06T05:34:09Z

⌛ Testing commit 28c4d12 with merge 523ae98...

Manishearth · 2016-04-06T05:40:59Z

http://buildbot.rust-lang.org/builders/auto-win-msvc-64-opt-rustbuild/builds/597/steps/test/logs/stdio

Linkcheck stage2 (x86_64-pc-windows-msvc)
std\primitive.bool.html:110: broken link - core\slice\trait.BytewiseEquality.html
std\primitive.char.html:746: broken link - core\slice\trait.BytewiseEquality.html
std\primitive.i16.html:826: broken link - core\slice\trait.BytewiseEquality.html
std\primitive.i32.html:826: broken link - core\slice\trait.BytewiseEquality.html
std\primitive.i64.html:826: broken link - core\slice\trait.BytewiseEquality.html
std\primitive.i8.html:826: broken link - core\slice\trait.BytewiseEquality.html
std\primitive.isize.html:826: broken link - core\slice\trait.BytewiseEquality.html
std\primitive.slice.html:664: broken link - core\slice\trait.SliceOrd.html
std\primitive.slice.html:664: broken link - core\slice\trait.SliceOrd.html
std\primitive.slice.html:665: broken link - core\slice\trait.SliceOrd.html
std\primitive.slice.html:665: broken link - core\slice\trait.SliceOrd.html
std\primitive.slice.html:666: broken link - core\slice\trait.SlicePartialOrd.html
std\primitive.slice.html:666: broken link - core\slice\trait.SlicePartialOrd.html
std\primitive.slice.html:667: broken link - core\slice\trait.SlicePartialOrd.html
std\primitive.slice.html:667: broken link - core\slice\trait.SlicePartialOrd.html
std\primitive.slice.html:668: broken link - core\slice\trait.SlicePartialEq.html
std\primitive.slice.html:668: broken link - core\slice\trait.BytewiseEquality.html
std\primitive.slice.html:668: broken link - core\slice\trait.SlicePartialEq.html
std\primitive.slice.html:669: broken link - core\slice\trait.SlicePartialEq.html
std\primitive.slice.html:670: broken link - core\slice\trait.SlicePartialEq.html
std\primitive.slice.html:670: broken link - core\slice\trait.SlicePartialEq.html
std\primitive.slice.html:671: broken link - core\slice\trait.SlicePartialEq.html
std\primitive.u16.html:751: broken link - core\slice\trait.BytewiseEquality.html
std\primitive.u32.html:751: broken link - core\slice\trait.BytewiseEquality.html
std\primitive.u64.html:751: broken link - core\slice\trait.BytewiseEquality.html
std\primitive.u8.html:751: broken link - core\slice\trait.BytewiseEquality.html
std\primitive.usize.html:751: broken link - core\slice\trait.BytewiseEquality.html


command did not execute successfully: "C:\\bot\\slave\\auto-win-msvc-64-opt-rustbuild\\build\\obj\\build\\x86_64-pc-windows-msvc\\stage2-rustc\\x86_64-pc-windows-msvc\\release\\linkchecker.exe" "C:\\bot\\slave\\auto-win-msvc-64-opt-rustbuild\\build\\obj\\build\\x86_64-pc-windows-msvc\\doc"
expected success, got: exit code: 101

bors · 2016-04-06T05:41:09Z

⛄ The build was interrupted to prioritize another pull request.

Manishearth · 2016-04-06T05:41:32Z

@bors r-

(feel free to re-r when the bug is fixed)

bluss · 2016-04-06T10:18:07Z

oh, thanks Manishearth.

I guess we need doc(hidden) on the traits, even if they are private?

This should avoid the trait impls showing up in rustdoc.

bluss · 2016-04-06T10:31:50Z

ok, all other private traits seem to use doc(hidden). resubmitting with that fixed.

@bors r=alexcrichton

bors · 2016-04-06T10:31:50Z

📌 Commit a6c27be has been approved by alexcrichton

Specialize equality for [T] and comparison for [u8] to use memcmp when possible Specialize equality for [T] and comparison for [u8] to use memcmp when possible Where T is a type that can be compared for equality bytewise, we can use memcmp. We can also use memcmp for PartialOrd, Ord for [u8]. Use specialization to call memcmp in PartialEq for slices for certain element types. This PR does not change the user visible API since the implementation uses an intermediate trait. See commit messages for more information. The memcmp signature was changed from `*const i8` to `*const u8` which is in line with how the memcmp function is defined in C (taking const void * arguments, interpreting the values as unsigned bytes for purposes of the comparison).

Rollup of 11 pull requests - Successful merges: #32016, #32583, #32699, #32729, #32731, #32738, #32741, #32745, #32748, #32757, #32786 - Failed merges: #32773

Specialize equality for [T] and comparison for [u8] to use memcmp when possible Specialize equality for [T] and comparison for [u8] to use memcmp when possible Where T is a type that can be compared for equality bytewise, we can use memcmp. We can also use memcmp for PartialOrd, Ord for [u8]. Use specialization to call memcmp in PartialEq for slices for certain element types. This PR does not change the user visible API since the implementation uses an intermediate trait. See commit messages for more information. The memcmp signature was changed from `*const i8` to `*const u8` which is in line with how the memcmp function is defined in C (taking const void * arguments, interpreting the values as unsigned bytes for purposes of the comparison).

Rollup of 7 pull requests - Successful merges: #32674, #32699, #32711, #32745, #32748, #32757, #32789 - Failed merges:

core: check for pointer equality when comparing Eq slices Because `Eq` types must be reflexively equal, an equal-length slice to the same memory location must be equal. This is related to rust-lang#33892 (and rust-lang#32699) answering this comment from that PR: > Great! One more easy question: why does this optimization not apply in the non-BytewiseEquality implementation directly above? Because slices of non-reflexively equal types (like `f64`) are not equal even if it's the same slice. But if the types are `Eq`, we can use this same-address optimization, which this PR implements. Obviously this changes behavior if types violate the reflexivity condition of `Eq`, because their impls of `PartialEq` will no longer be called per-item, but 🤷‍♂ . It's not clear how often this optimization comes up in the real world outside of the same-`&str` case covered by rust-lang#33892, so **I'm requesting a perf run** (on MacOS today, so can't run `rustc_perf` myself). I'm going ahead and making the PR on the basis of being surprised things didn't already work this way. This is my first time hacking rust itself, so as a perf sanity check I ran `./x.py bench --stage 0 src/lib{std,alloc}`, but the differences were noisy. To make the existing specialization for `BytewiseEquality` explicit, it's now a supertrait of `Eq + Copy`. `Eq` should be sufficient, but `Copy` was included for clarity.

rust-highfive assigned brson Apr 3, 2016

bluss mentioned this pull request Apr 3, 2016

specialize ToString for str #32586

Merged

nagisa reviewed Apr 4, 2016
View reviewed changes

bluss added 2 commits April 5, 2016 14:06

Add test for [u8]'s Ord (and fix the old test for ord)

28c4d12

The old test for Ord used no asserts, and appeared to have a wrong test. (!).

bluss force-pushed the slice-memcmp branch from ff67e37 to 28c4d12 Compare April 5, 2016 12:06

bluss changed the title ~~Specialize &[u8] equality (and other primitive types) to use memcmp~~ Specialize equality for [T] and comparison for [u8] to use memcmp when possible Apr 5, 2016

erickt reviewed Apr 5, 2016
View reviewed changes

Manishearth mentioned this pull request Apr 6, 2016

Rollup of 13 pull requests #32759

Closed

slice: Use doc(hidden) on private traits

a6c27be

This should avoid the trait impls showing up in rustdoc.

Manishearth mentioned this pull request Apr 7, 2016

Rollup of 11 pull requests #32787

Closed

bors added a commit that referenced this pull request Apr 7, 2016

Auto merge of #32787 - Manishearth:rollup, r=Manishearth

6a55f4f

Rollup of 11 pull requests - Successful merges: #32016, #32583, #32699, #32729, #32731, #32738, #32741, #32745, #32748, #32757, #32786 - Failed merges: #32773

Manishearth mentioned this pull request Apr 7, 2016

Rollup of 7 pull requests #32794

Merged

bors added a commit that referenced this pull request Apr 7, 2016

Auto merge of #32794 - Manishearth:rollup, r=Manishearth

470ca1c

Rollup of 7 pull requests - Successful merges: #32674, #32699, #32711, #32745, #32748, #32757, #32789 - Failed merges:

bors merged commit a6c27be into rust-lang:master Apr 7, 2016

bluss added the relnotes Marks issues that should be documented in the release notes of the next release. label Apr 9, 2016

aschampion mentioned this pull request Jun 8, 2019

core: check for pointer equality when comparing Eq slices #61665

Merged

Specialize equality for [T] and comparison for [u8] to use memcmp when possible #32699

Specialize equality for [T] and comparison for [u8] to use memcmp when possible #32699

Uh oh!

Conversation

bluss commented Apr 3, 2016

Uh oh!

rust-highfive commented Apr 3, 2016

Uh oh!

bluss commented Apr 3, 2016

Uh oh!

alexcrichton commented Apr 3, 2016

Uh oh!

bluss commented Apr 3, 2016

Uh oh!

nagisa Apr 4, 2016

Choose a reason for hiding this comment

Uh oh!

bluss Apr 4, 2016

Choose a reason for hiding this comment

Uh oh!

petrochenkov Apr 4, 2016

Choose a reason for hiding this comment

Uh oh!

bluss commented Apr 5, 2016

Uh oh!

erickt Apr 5, 2016

Choose a reason for hiding this comment

Uh oh!

bluss Apr 5, 2016

Choose a reason for hiding this comment

Uh oh!

erickt Apr 5, 2016

Choose a reason for hiding this comment

Uh oh!

alexcrichton commented Apr 5, 2016

Uh oh!

bors commented Apr 6, 2016

Uh oh!

Manishearth commented Apr 6, 2016

Uh oh!

bors commented Apr 6, 2016

Uh oh!

Manishearth commented Apr 6, 2016

Uh oh!

bluss commented Apr 6, 2016

Uh oh!

bluss commented Apr 6, 2016

Uh oh!

bors commented Apr 6, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants