Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use assume to inform the optimiser about refcount invariants #21418

Merged
merged 3 commits into from
Jan 22, 2015

Conversation

Aatch
Copy link
Contributor

@Aatch Aatch commented Jan 20, 2015

The reference count can never be 0, unless we're about to drop the data
completely. Using the assume intrinsic allows us to inform LLVM about
that invariant, meaning it can avoid unnecessary drops.


Before and after IR: https://gist.github.com/Aatch/3786d20df2edaad6a0e8

Generated from the example in #13018

Fixes #13018

The reference count can never be 0, unless we're about to drop the data
completely. Using the `assume` intrinsic allows us to inform LLVM about
that invariant, meaning it can avoid unnecessary drops.
@rust-highfive
Copy link
Contributor

Thanks for the pull request, and welcome! The Rust team is excited to review your changes, and you should hear from @alexcrichton (or someone else) soon.

If any changes to this PR are deemed necessary, please add them as extra commits. This ensures that the reviewer can see what has changed since they last reviewed the code. The way Github handles out-of-date commits, this should also make it reasonably obvious what issues have or haven't been addressed. Large or tricky changes may require several passes of review and changes.

Please see CONTRIBUTING.md for more information.

@alexcrichton
Copy link
Member

Nice win! It seems like there's still room for improvement as there's still a check for null, so perhaps this shouldn't close #13018 but rather just update the IR?

@Aatch
Copy link
Contributor Author

Aatch commented Jan 20, 2015

@alexcrichton Oh, I missed that. I should be able to tell it to assume that the pointer is non-null.

@Aatch
Copy link
Contributor Author

Aatch commented Jan 20, 2015

Ok, so I updated this to add assumptions that pointer is non-null in a couple places.

New IR: https://gist.github.com/Aatch/3786d20df2edaad6a0e8#file-rc-after2-ll

That has some junk that isn't required because it's only there for the assumes, so the assembly looks like this: https://gist.github.com/Aatch/3786d20df2edaad6a0e8#file-rc-after2-s

@alexcrichton
Copy link
Member

r=me with some comments on each assume explaining why it's there and what optimizations it's gaining us

Nice find!

@Aatch
Copy link
Contributor Author

Aatch commented Jan 20, 2015

@bors r=alexcrichton a7525bc

alexcrichton added a commit to alexcrichton/rust that referenced this pull request Jan 21, 2015
The reference count can never be 0, unless we're about to drop the data
completely. Using the `assume` intrinsic allows us to inform LLVM about
that invariant, meaning it can avoid unnecessary drops.

---

Before and after IR: https://gist.github.com/Aatch/3786d20df2edaad6a0e8

Generated from the example in rust-lang#13018

Fixes rust-lang#13018
@alexcrichton
Copy link
Member

If the rollup bounces again I may have to back this out, it looks like we've taken a serious hit in compile times based on this patch. I did a perf record of building stage0 rustc and got: https://gist.github.com/alexcrichton/479ddc775aec259fd04a

Spending 10% of our time in computing knowledge from assumptions seems like quite a lot!

@alexcrichton
Copy link
Member

To put this into concrete numbers:

# stage0 liballoc with assumptions compiled in
$ time ./x86_64-unknown-linux-gnu/stage0/bin/rustc -O ./src/librustc/lib.rs --out-dir foo
./x86_64-unknown-linux-gnu/stage0/bin/rustc -O ./src/librustc/lib.rs --out-di  610.62s user 2.01s system 99% cpu 10:12.68 total

# stage0 liballoc without assumptions compiled in
$ time ./x86_64-unknown-linux-gnu/stage0/bin/rustc -O ./src/librustc/lib.rs --out-dir foo
./x86_64-unknown-linux-gnu/stage0/bin/rustc -O ./src/librustc/lib.rs --out-di  202.90s user 1.70s system 99% cpu 3:24.62 total

@Aatch
Copy link
Contributor Author

Aatch commented Jan 21, 2015

Ugh, ouch. That's surprising, but not entirely unexpected. The assumption tracker framework is relatively new in LLVM, so it's not too surprising that it hasn't been tuned as much as it could be. I'm fine with pulling the PR if it has that much of a negative effect on compile times.

@Aatch Aatch closed this Jan 21, 2015
@Aatch Aatch reopened this Jan 21, 2015
@alexcrichton
Copy link
Member

Could you measure the impact on just having the assumptions on reference counting? The assumptions on deref may be overloading the compiler, but perhaps deref happens an order of magnitude more often than clone?

@bors bors merged commit a7525bc into rust-lang:master Jan 22, 2015
@alexcrichton
Copy link
Member

Ah sorry this didn't actually merge (had to revert the commits). Feel free to make a new PR though!

bors added a commit that referenced this pull request Feb 5, 2015
This is half of what @Aatch implemented in #21418. The non-null assumption is later canonicalized to !nonnull metadata and doesn't cause any slowdowns (in fact the build is slightly faster with this change). I left out the other half of #21418 because that still causes a ~16% increase in compile times (30m -> 35m).
bors added a commit that referenced this pull request Aug 21, 2018
Change `Rc::inc_{weak,strong}` to better hint optimization to LLVM

As discussed in #13018, `Rc::inc_strong` and `Rc::inc_weak` are changed to allow compositions of `clone` and `drop` to be better optimized. Almost entirely as in [this comment](#13018 (comment)), except that `abort` on zero is added so that a `drop(t.clone())` does not produce a zero check followed by conditional deallocation.

This is different from #21418 in that it doesn't rely on `assume`, avoiding the prohibitive compilation slowdown.

[Before and after IR](https://gist.github.com/hermord/266e55451b7fe0bb8caa6e35d17c86e1).
bors added a commit that referenced this pull request Aug 21, 2018
Change `Rc::inc_{weak,strong}` to better hint optimization to LLVM

As discussed in #13018, `Rc::inc_strong` and `Rc::inc_weak` are changed to allow compositions of `clone` and `drop` to be better optimized. Almost entirely as in [this comment](#13018 (comment)), except that `abort` on zero is added so that a `drop(t.clone())` does not produce a zero check followed by conditional deallocation.

This is different from #21418 in that it doesn't rely on `assume`, avoiding the prohibitive compilation slowdown.

[Before and after IR](https://gist.github.com/hermord/266e55451b7fe0bb8caa6e35d17c86e1).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Room for optimization improvement on reference counting
4 participants