Improve codegen for `align_offset` #75600

nagisa · 2020-08-16T18:35:15Z

In this PR the align_offset implementation is changed/improved to produce better code in certain scenarios such as when pointer type is has a stride of 1 or when building for low optimisation levels.

While these changes do not achieve the "ideal" codegen referenced in #75579, it gets significantly closer to it. I’m not actually sure if the codegen can actually be much better with this function returning the offset, rather than the aligned pointer.

See the descriptions for separate commits for further information.

At opt-level <= 1, the methods such as `wrapping_mul` are not being inlined, causing significant bloating and slowdowns of the implementation at these optimisation levels. With use of these intrinsics, the codegen of this function at -Copt_level=1 is the same as it is at -Copt_level=3.

Previously checking for `pmoda == 0` would get LLVM to generate branchy code, when, for `stride = 1` the offset can be computed without such a branch by doing effectively a `-p % a`. For well-known (constant) alignments, with the new ordering of these conditionals, we end up generating 2 to 3 cheap instructions on x86_64: movq %rdi, %rax negl %eax andl $7, %eax instead of 5+ as previously. For unknown alignments the new code also generates just 3 instructions: negq %rdi leaq -1(%rsi), %rax andq %rdi, %rax

rust-highfive · 2020-08-16T18:35:18Z

r? @KodrAus

(rust_highfive has picked a reviewer for you, use r? to override)

Mark-Simulacrum · 2020-08-16T19:08:55Z

@bors try @rust-timer queue

(I don't think we use this much so probably not important, but worth checking)

rust-timer · 2020-08-16T19:08:56Z

Awaiting bors try build completion

bors · 2020-08-16T19:09:06Z

⌛ Trying commit 5d22b18 with merge 5c6f0b9c5f37b21b2a1dcf91bc2a50a5d6209672...

bors · 2020-08-16T19:52:34Z

☀️ Try build successful - checks-actions, checks-azure
Build commit: 5c6f0b9c5f37b21b2a1dcf91bc2a50a5d6209672 (5c6f0b9c5f37b21b2a1dcf91bc2a50a5d6209672)

rust-timer · 2020-08-16T19:52:35Z

Queued 5c6f0b9c5f37b21b2a1dcf91bc2a50a5d6209672 with parent 009551f, future comparison URL.

rust-timer · 2020-08-16T21:22:43Z

Finished benchmarking try commit (5c6f0b9c5f37b21b2a1dcf91bc2a50a5d6209672): comparison url.

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. Please note that if the perf results are neutral, you should likely undo the rollup=never given below by specifying rollup- to bors.

Importantly, though, if the results of this run are non-neutral do not roll this PR up -- it will mask other regressions or improvements in the roll up.

@bors rollup=never

KodrAus · 2020-08-19T10:23:27Z

Looks good to me! The performance results seem to suggest we do get a slight improvement in those builds with low optimization levels.

@bors r+

bors · 2020-08-19T10:23:29Z

📌 Commit 5d22b18 has been approved by KodrAus

bors · 2020-08-19T10:54:50Z

⌛ Testing commit 5d22b18 with merge 11a44ad...

bors · 2020-08-19T13:10:04Z

☀️ Test successful - checks-actions, checks-azure
Approved by: KodrAus
Pushing 11a44ad to master...

nagisa added 2 commits August 16, 2020 21:31

rust-highfive assigned KodrAus Aug 16, 2020

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Aug 16, 2020

nagisa changed the title ~~Improve align offset~~ Improve codegen for align_offset Aug 16, 2020

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Aug 19, 2020

bors added the merged-by-bors This PR was explicitly merged by bors. label Aug 19, 2020

bors merged commit 11a44ad into rust-lang:master Aug 19, 2020

nagisa mentioned this pull request Oct 25, 2020

Optimise align_offset for stride=1 further #75728

Merged

cuviper added this to the 1.47.0 milestone May 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve codegen for `align_offset` #75600

Improve codegen for `align_offset` #75600

Uh oh!

nagisa commented Aug 16, 2020 •

edited

Loading

Uh oh!

rust-highfive commented Aug 16, 2020

Uh oh!

Mark-Simulacrum commented Aug 16, 2020

Uh oh!

rust-timer commented Aug 16, 2020

Uh oh!

bors commented Aug 16, 2020

Uh oh!

bors commented Aug 16, 2020

Uh oh!

rust-timer commented Aug 16, 2020

Uh oh!

rust-timer commented Aug 16, 2020

Uh oh!

KodrAus commented Aug 19, 2020

Uh oh!

bors commented Aug 19, 2020

Uh oh!

bors commented Aug 19, 2020

Uh oh!

bors commented Aug 19, 2020

Uh oh!

Uh oh!

Improve codegen for align_offset #75600

Improve codegen for align_offset #75600

Uh oh!

Conversation

nagisa commented Aug 16, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rust-highfive commented Aug 16, 2020

Uh oh!

Mark-Simulacrum commented Aug 16, 2020

Uh oh!

rust-timer commented Aug 16, 2020

Uh oh!

bors commented Aug 16, 2020

Uh oh!

bors commented Aug 16, 2020

Uh oh!

rust-timer commented Aug 16, 2020

Uh oh!

rust-timer commented Aug 16, 2020

Uh oh!

KodrAus commented Aug 19, 2020

Uh oh!

bors commented Aug 19, 2020

Uh oh!

bors commented Aug 19, 2020

Uh oh!

bors commented Aug 19, 2020

Uh oh!

Uh oh!

Improve codegen for `align_offset` #75600

Improve codegen for `align_offset` #75600

nagisa commented Aug 16, 2020 •

edited

Loading