Hash up to 8 bytes at once with FxHasher #1

Zoxc · 2018-05-27T14:44:42Z

kennytm · 2018-05-27T15:52:18Z

eddyb · 2018-05-27T16:34:45Z

src/lib.rs

+            bytes = &bytes[2..];
+        }
+        if (size_of::<usize>() > 1) && bytes.len() >= 1 {
+            hash.add_to_hash(bytes[0] as usize);


Doesn't all of this mean that splitting a write call changes the hash? IIRC it shouldn't.
Could an union { bytes: [u8; size_of::<usize>()], usize: usize } buffer be used instead?

That doesn't seem to be a documented nor a useful property.

cc @michaelwoerister @gankro I remember discussions about this property

Note that it's potentially useful to buffer the values if, with e.g. nested enums, you're writing byte-sized values (i.e. discriminants) most of the time, one at a time.

Since the FxHasher is only used with hash tables, I don't think that the hash must be stable. As long as it is deterministic for our use cases, it's fine, I think. It already treats (u8, u8) different from u16 where a similar argument could be made.

My view is: FxHasher should be the absolute fastest for small keys and it should do whatever it can get away with in practice.

I still think we should try and bench this against some buffering scheme, especially if it can all be inlined down to a few applications of the usize "block" function.

EDIT: nevermind, all the leaves I was thinking off go through the write_uN methods below, so those would also need to be buffered somehow to observe a benefit.

Yeah, we don't need to do this in this PR. The benchmarks showed that it's an improvement.

As a sidenote, using perf.rlo is a lot more complicated when testing out-of-tree crates...

michaelwoerister · 2018-05-27T18:57:52Z

The only problem I see here does not have to do with the PR directly: Since this is a standalone crate now, it should have tests and integrate with travis. Seeing that all tests pass makes approving a PR much simpler.

eddyb · 2018-05-27T20:25:50Z

Cargo.lock

 [[package]]
 name = "rustc-hash"
-version = "0.1.0"


Why does a library crate have a Cargo.lock?

michaelwoerister · 2018-05-28T11:13:21Z

I think the version number needs to be bumped so we can publish on crates.io.

michaelwoerister · 2018-05-28T15:57:35Z

I'll merge this because it was already tested as part of rustc_data_structures. The next PR will have to add tests and CI integration though.

Hash up to 8 bytes at once with FxHasher

35fa918

Zoxc mentioned this pull request May 27, 2018

Hash up to 8 bytes at once with FxHasher rust-lang/rust#51019

Merged

kennytm assigned michaelwoerister May 27, 2018

eddyb reviewed May 27, 2018

View reviewed changes

Zoxc added 2 commits May 28, 2018 16:35

Remove Cargo.lock

8640781

Bump version

5c4b6e9

Zoxc force-pushed the batch-bytes branch from 377c056 to 5c4b6e9 Compare May 28, 2018 14:35

michaelwoerister merged commit 1e61258 into master May 28, 2018

Zoxc deleted the batch-bytes branch May 28, 2018 18:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hash up to 8 bytes at once with FxHasher #1

Hash up to 8 bytes at once with FxHasher #1

Zoxc commented May 27, 2018

kennytm commented May 27, 2018

eddyb May 27, 2018

Zoxc May 27, 2018

eddyb May 27, 2018

michaelwoerister May 27, 2018

eddyb May 27, 2018 •

edited

Loading

michaelwoerister May 27, 2018

michaelwoerister commented May 27, 2018

eddyb May 27, 2018

michaelwoerister commented May 28, 2018

michaelwoerister commented May 28, 2018

Hash up to 8 bytes at once with FxHasher #1

Hash up to 8 bytes at once with FxHasher #1

Conversation

Zoxc commented May 27, 2018

kennytm commented May 27, 2018

eddyb May 27, 2018

Choose a reason for hiding this comment

Zoxc May 27, 2018

Choose a reason for hiding this comment

eddyb May 27, 2018

Choose a reason for hiding this comment

michaelwoerister May 27, 2018

Choose a reason for hiding this comment

eddyb May 27, 2018 • edited Loading

Choose a reason for hiding this comment

michaelwoerister May 27, 2018

Choose a reason for hiding this comment

michaelwoerister commented May 27, 2018

eddyb May 27, 2018

Choose a reason for hiding this comment

michaelwoerister commented May 28, 2018

michaelwoerister commented May 28, 2018

eddyb May 27, 2018 •

edited

Loading