Improve RDRAND implementation #24

josephlr · 2019-06-10T06:17:52Z

This change makes a few improvements to be closer to Intel's recommendations

The value of RETRY_LIMIT is the one recommended by Intel.
Use chunks_exact_mut to elide most calls to memcpy. See generated assembly.
Remove all unsafe code except that used to invoke the RDRAND intrinsic.
Adds Travis build check for x86_64-fortanix-unknown-sgx

Stylistically, the code is now much easier to read. The file has also been renamed to rdrand.rs to better reflect what it does. This diff makes it easier to see what has actually changed.

newpavlov

Overall looks good to me except two small nitpicks.

src/rdrand.rs

josephlr · 2019-06-11T00:33:04Z

@newpavlov this should be ready to merge, provided you're OK with an implementation that needs up to two calls to memcpy at the beginning and end of dest.

I have an alternative implementation that only calls memcpy on small slices. On large slices, it calls write_unaligned at the beginning and end of dest (writing to some of the memory twice). While the alternative implementation also gives a smaller binary, it has to use much more unsafe code to stop the compiler from emitting slice_index_len_fail or panic hooks.

newpavlov · 2019-06-11T09:09:47Z

Personally I prefer the alternative implementation. We even could drop 0 => {} branch to make binary even smaller, as I guess calls ofgetrandom with zero-length slices will be really rare.

If we'll decide to keep the current implementation, then you can make get_rand_unaligned slightly more idiomatic, while keeping the efficiency:

fn get_rand_unaligned(dest: &mut [u8]) -> Result<(), Error> {
    for chunk in dest.chunks_mut(mem::size_of::<u64>()) {
        let data = get_rand_u64()?;
        let n = chunk.len();
        chunk.copy_from_slice(&data.to_ne_bytes()[..n]);
    }
    Ok(())
}

@dhardy
What do you think?

josephlr · 2019-06-11T19:18:30Z

If we'll decide to keep the current implementation, then you can make get_rand_unaligned slightly more idiomatic, while keeping the efficiency:

Oh that's much nicer, I just updated this PR to use essentially that code. The generated assembly still gives what we expect.

Personally I prefer the alternative implementation. We even could drop 0 => {} branch to make binary even smaller, as I guess calls of getrandom with zero-length slices will be really rare.

I'm still trying to figure out if there is a way to get the alternative implementation working without as much unsafe code. I'll let you know my progress.

josephlr · 2019-06-12T03:30:45Z

So after looking at this some more, it’s probably not worth it to hyper-optimize this to maximize the number of aligned accesses. On x86 the same code gets emitted for aligned/unaligned accesses anyway (unaligned ones just happen to be slower).

I’ll rewrite this to just use chunks_mut_exact, should make everything fairly short.

Note: as most of the time this is being used to fill a freshly allocated 128/256 but key, everything will usually be properly aligned regardless.

josephlr · 2019-06-12T07:15:32Z

@dhardy @newpavlov this should be ready for final review/merging, see the updated PR description.

src/rdrand.rs

dhardy requested a review from newpavlov June 10, 2019 06:38

newpavlov reviewed Jun 10, 2019

View reviewed changes

src/rdrand.rs Outdated Show resolved Hide resolved

src/rdrand.rs Outdated Show resolved Hide resolved

josephlr force-pushed the rdrand-ext branch from 93719c0 to e82576b Compare June 11, 2019 00:19

Move sgx.rs to rdrand.rs

5302a81

josephlr force-pushed the rdrand-ext branch from e82576b to ff7f9ba Compare June 11, 2019 00:54

josephlr force-pushed the rdrand-ext branch from d93a059 to 91db682 Compare June 12, 2019 07:01

newpavlov reviewed Jun 12, 2019

View reviewed changes

src/rdrand.rs Outdated Show resolved Hide resolved

Improve RDRAND implementation

0558adf

josephlr force-pushed the rdrand-ext branch from 91db682 to 0558adf Compare June 12, 2019 10:24

newpavlov merged commit 9e64082 into rust-random:master Jun 12, 2019

josephlr deleted the rdrand-ext branch June 12, 2019 10:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve RDRAND implementation #24

Improve RDRAND implementation #24

josephlr commented Jun 10, 2019 •

edited

Loading

newpavlov left a comment

josephlr commented Jun 11, 2019

newpavlov commented Jun 11, 2019 •

edited

Loading

josephlr commented Jun 11, 2019

josephlr commented Jun 12, 2019

josephlr commented Jun 12, 2019

Improve RDRAND implementation #24

Improve RDRAND implementation #24

Conversation

josephlr commented Jun 10, 2019 • edited Loading

newpavlov left a comment

Choose a reason for hiding this comment

josephlr commented Jun 11, 2019

newpavlov commented Jun 11, 2019 • edited Loading

josephlr commented Jun 11, 2019

josephlr commented Jun 12, 2019

josephlr commented Jun 12, 2019

josephlr commented Jun 10, 2019 •

edited

Loading

newpavlov commented Jun 11, 2019 •

edited

Loading