Document seemingly unneeded self.clone in write #34

krtab · 2024-03-25T21:41:01Z

I don't think the clone is needed.

compiler-errors · 2024-03-25T23:28:27Z

src/lib.rs

@@ -116,20 +116,19 @@ impl Hasher for FxHasher {
        const _: () = assert!(size_of::<usize>() <= size_of::<u64>());
        // Ensure no bytes are discarded by casting to usize
        const _: () = assert!(size_of::<u32>() <= size_of::<usize>());
-        let mut state = self.clone();


@calebsander: Why did you add this clone in #12?

In case they don't answer, my two guesses are:

It is a leftover of the refactoring that is indeed superfluous

It is supposed to prevent committing "partially hashed" slices of bytes to the hasher in case of a panic in add_to_hash. But a) I don't see currently a way for this to panic and b) I don't think it would make much sense to try to recover from a panic anyway, especially as this invariant is not documented anywhere.

Thanks for reviewing

Please check the generated assembly. If I remember correctly, cloning (which is a single 64-bit load, and later a 64-bit store to save the state) allowed the hash accumulation to happen in registers. Accumulating directly on self meant the bitwise operations and multiplication were applied to a memory location. I figured avoiding memory accesses in hot loops (even if they are very cache-friendly) was a good idea. But it's also likely that the Hasher methods will be inlined anyways, with the caller storing the hash state in a local variable, so the state may well be in a register rather than memory in the first place.
It's entirely possible that the compiler no longer behaves this way, or that it doesn't really matter for performance. Adding a benchmark to give some concrete data would be much better than my speculating.

krtab · 2024-03-27T10:45:35Z

@calebsander Thanks for the explanation. It seems that the codegen is indeed suboptimal without the clone. I've added a comment explaining why the clone is here and opened rust-lang/rust#123129 about the suboptimal codegen.

compiler-errors reviewed Mar 25, 2024

View reviewed changes

krtab mentioned this pull request Mar 27, 2024

Suboptimal register allocation across loops rust-lang/rust#123129

Open

Add comment explaining why the state is cloned in FxHasher::write

524e2ab

krtab force-pushed the del_extra_self branch from 7baf494 to 524e2ab Compare March 27, 2024 10:44

krtab changed the title ~~Delete extraneous self clone~~ Document seemingly unneeded self.clone in write Mar 27, 2024

calebsander approved these changes Mar 27, 2024

View reviewed changes

WaffleLapkin approved these changes Mar 30, 2024

View reviewed changes

WaffleLapkin merged commit e155548 into rust-lang:master Mar 30, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document seemingly unneeded self.clone in write #34

Document seemingly unneeded self.clone in write #34

krtab commented Mar 25, 2024

compiler-errors Mar 25, 2024

krtab Mar 26, 2024

calebsander Mar 26, 2024

krtab commented Mar 27, 2024

Document seemingly unneeded self.clone in write #34

Document seemingly unneeded self.clone in write #34

Conversation

krtab commented Mar 25, 2024

compiler-errors Mar 25, 2024

Choose a reason for hiding this comment

krtab Mar 26, 2024

Choose a reason for hiding this comment

calebsander Mar 26, 2024

Choose a reason for hiding this comment

krtab commented Mar 27, 2024