Make io::Error use 64 bits on targets with 64 bit pointers. #87869

thomcc · 2021-08-08T17:10:18Z

I've wanted this for a long time, but didn't see a good way to do it without having extra allocation. When looking at it yesterday, it was more clear what to do for some reason.

This approach avoids any additional allocations, and reduces the size by half (8 bytes, down from 16). AFAICT it doesn't come additional runtime cost, and the compiler seems to do a better job with code using it.

Additionally, this io::Error has a niche (still), so io::Result<()> is also 64 bits (8 bytes, down from 16), and io::Result<usize> (used for lots of io trait functions) is 2x64 bits (16 bytes, down from 24 — this means on x86_64 it can use the nice rax/rdx 2-reg struct return). More generally, it shaves a whole 64 bit integer register off of the size of basically any io::Result<()>.

(For clarity: Improving io::Result (rather than io::Error) was most of the motivation for this)

On 32 bit (or other non-64bit) targets we still use something equivalent the old repr — I don't think think there's improving it, since one of the fields it stores is a i32, so we can't get below that, and it's already about as close as we can get to it.

Isn't Pointer Tagging Dodgy?

The details of the layout, and why its implemented the way it is, are explained in the header comment of library/std/src/io/error/repr_bitpacked.rs. There's probably more details than there need to be, but I didn't trim it down that much, since there's a lot of stuff I did deliberately, that might have not seemed that way.

There's actually only one variant holding a pointer which gets tagged. This one is the (holder for the) user-provided error.

I believe the scheme used to tag it is not UB, and that it preserves pointer provenance (even though often pointer tagging does not) because the tagging operation is just core::ptr::add, and untagging is core::ptr::sub. The result of both operations lands inside the original allocation, so it would follow the safety contract of core::ptr::{add,sub}.

The other pointer this had to encode is not tagged — or rather, the tagged repr is equivalent to untagged (it's tagged with 0b00, and has >=4b alignment, so we can reuse the bottom bits). And the other variants we encode are just integers, which (which can be untagged using bitwise operations without worry — they're integers).

CC @RalfJung for the stuff in repr_bitpacked.rs, as my comments are informed by a lot of the UCG work, but it's possible I missed something or got it wrong (even if the implementation is okay, there are parts of the header comment that says things like "We can't do $x" which could be false).

Why So Many Changes?

The repr change was mostly internal, but changed one widely used API: I had to switch how io::Error::new_const works.

This required switching io::Error::new_const to take the full message data (including the kind) as a &'static, rather than just the string. This would have been really tedious, but I made a macro that made it much simpler, but it was a wide change since io::Error::new_const is used everywhere.

This included changing files for a lot of targets I don't have easy access to (SGX? Haiku? Windows? Who has heard of these things), so I expect there to be spottiness in CI initially, unless luck is on my side.

Anyway this large only tangentially-related change is all in the first commit (although that commit also pulls the previous repr out into its own file), whereas the packing stuff is all in commit 2.

P.S. I haven't looked at all of this since writing it, and will do a pass over it again later, sorry for any obvious typos or w/e. I also definitely repeat myself in comments and such.

(It probably could use more tests too. I did some basic testing, and made it so we debug_assert! in cases the decode isn't what we encoded, but I don't know the degree which I can assume libstd's testing of IO would exercise this. That is: it wouldn't be surprising to me if libstds IO testing were minimal, especially around error cases, although I have no idea).

rust-highfive · 2021-08-08T17:10:20Z

r? @yaahc

(rust-highfive has picked a reviewer for you, use r? to override)

library/std/src/io/error/repr_bitpacked.rs

RalfJung

I only read the comments in repr_bitpacked carefully, and modulo some small comments that all sounds very sensible. :)

library/std/src/io/error/repr_bitpacked.rs

library/std/src/io/error.rs

library/std/src/io/error/repr_bitpacked.rs

bors · 2021-09-02T06:05:15Z

☔ The latest upstream changes (presumably #83342) made this pull request unmergeable. Please resolve the merge conflicts.

JohnCSimon · 2021-09-20T02:33:51Z

Ping from triage:

@thomcc Can you please address the merge conflict?

thomcc · 2021-09-20T02:43:11Z

~~Sure, although it might not be until the start of October.~~ I got to it, but it was a bit later >_>

bors · 2021-11-27T11:17:01Z

☔ The latest upstream changes (presumably #90846) made this pull request unmergeable. Please resolve the merge conflicts.

kornelski · 2021-12-02T14:29:34Z

Maybe to help with the merge conflicts, first land a PR that only wraps the new_const function in a macro call, with no other changes?

JohnCSimon · 2022-01-30T04:07:04Z

Ping from triage:
@thomcc I see a lot of merge conflicts but the PR hasnt been touched in three months. Closing as inactive, feel free to reopen.

@rustbot label: +S-inactive

thomcc · 2022-01-30T04:18:41Z

I think this was actually waiting on the reviewer — they hadn't updated since oct. The merge errors are because it modified all creation sites of errors in libstd, so they accumulate very easy, even though they're mostly mechanical. Had I gotten any indication it would be reviewed I'd have updated it.

Anyway, I'll try and rebase it in a week or so.

JohnCSimon · 2022-01-30T04:55:47Z

I think this was actually waiting on the reviewer

Sorry @thomcc it was marked as S-waiting-on-author
I'll switch it over.

@rustbot label: -S-waiting-on-author +S-waiting-on-review

yaahc · 2022-01-30T20:54:01Z

Oops, sorry about that 😅, no idea why I swapped it to waiting on author, I'll take a look at this next week.

Co-authored-by: Ralf Jung <post@ralfj.de>

Co-authored-by: the8472 <the8472@users.noreply.github.com>

thomcc · 2022-02-05T06:55:47Z

Updated to address tests, and address some comments that came outside of this issue.

Also, I noticed the NonNull::new_unchecked safety doc in Repr::new_custom had gotten stale (it said "see above", referring to the safety comment for ptr::add, which I had made more hand-wavey after switching to wrapping_add).

I also added a few more assertions to ensure that none of the tags use bits outside TAG_MASK, and replace some 4's with TAG_MASK + 1 since it seems likely we will add more variants in the future, which would require another tag bit (which won't be a problem).

…n't quite accurate anymore

yaahc · 2022-02-07T20:24:33Z

I think this all looks good to go to me! Thanks again for the amazing work @thomcc

@bors r+ rollup=never

bors · 2022-02-07T20:24:35Z

📌 Commit 9cbe994 has been approved by yaahc

bors · 2022-02-07T20:33:04Z

⌛ Testing commit 9cbe994 with merge 734368a...

bors · 2022-02-07T23:40:20Z

☀️ Test successful - checks-actions
Approved by: yaahc
Pushing 734368a to master...

rust-timer · 2022-02-08T01:12:17Z

Finished benchmarking commit (734368a): comparison url.

Summary: This benchmark run shows 22 relevant improvements 🎉 but 13 relevant regressions 😿 to instruction counts.

Average relevant regression: 3.4%
Average relevant improvement: -1.1%
Largest improvement in instruction counts: -4.3% on full builds of piston-image opt
Largest regression in instruction counts: 11.2% on full builds of issue-46449 opt

If you disagree with this performance assessment, please file an issue in rust-lang/rustc-perf.

Next Steps: If you can justify the regressions found in this perf run, please indicate this with @rustbot label: +perf-regression-triaged along with sufficient written justification. If you cannot justify the regressions please open an issue or create a new PR that fixes the regressions, add a comment linking to the newly created issue or PR, and then add the perf-regression-triaged label to this PR.

@rustbot label: +perf-regression

thomcc · 2022-02-08T06:21:14Z

Hmm, interesting. The regressions seem to be in layout computation (conservative_is_privately_uninhabited, layout_of, etc, if I'm reading it right). I guess that makes sense, that was the point and stuff like Result<T, io::Error> is pretty common (same reason that the change seemed worthwhile).

Hmmmmm...

thomcc · 2022-02-08T07:09:37Z

Looking more closely, the regressions are... almost all versions of issue-46449 with different build flags. This seems fairly synthetic (while it was derived from real code, it's pretty far from it now, and AFAICT that code had performance problems for reasons wholly unrelated to layout).

Thankfully, the benches with perf deltas that seem to more realistic crates, seem to be more consistently green -- it's not 100% green in the real usage, but its a heck of a lot more consistent, and for the most part seems like the code that uses io::Error more benefits more.

(Honestly... if you ignore the issue-46449 family, it's feels like a pretty consistent (but not 100%) win for compile times, and mostly in checks of real code. But OTOH, perhaps you can always say "if you ignore the thing that my code makes worse, you'll find that my code makes everything better")

Anyway, it kinda seems fine to regress a synthetic benchmark like that in this case (where there are runtime benefits in very commonly used types), especially given how many of the non-synthetic benchmarks had a compile-time improvement. But it may be unsurprising that I'm in favor of my own patch.

(That said, I aslo I can't quite wrap my head around what I'd poke in the compiler improve this anyway (not more specifically than ✨ layout ✨, at least), so I'd appreciate pointers if it turns out I'm responsible for un-regressing it).

Mark-Simulacrum · 2022-02-08T17:06:41Z

I'm going to go ahead and mark this as triaged -- I tend to agree that this is likely not something to be directly addressed, and is limited to just one benchmark.

rust-highfive assigned yaahc Aug 8, 2021

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Aug 8, 2021

This comment has been minimized.

Sign in to view

nvzqz reviewed Aug 8, 2021

View reviewed changes

library/std/src/io/error/repr_bitpacked.rs Outdated Show resolved Hide resolved

jfrimmel reviewed Aug 9, 2021

View reviewed changes

library/std/src/io/error/repr_bitpacked.rs Outdated Show resolved Hide resolved

RalfJung reviewed Aug 10, 2021

View reviewed changes

the8472 reviewed Aug 10, 2021

View reviewed changes

library/std/src/io/error.rs Outdated Show resolved Hide resolved

library/std/src/io/error/repr_bitpacked.rs Outdated Show resolved Hide resolved

library/std/src/io/error/repr_bitpacked.rs Show resolved Hide resolved

yaahc added the A-error-handling Area: Error handling label Aug 16, 2021

thomcc mentioned this pull request Aug 18, 2021

Make Rust Errors more idiomatic build-trust/ockam#1655

Closed

inquisitivecrystal added the T-libs Relevant to the library team, which will review and decide on the PR/issue. label Aug 24, 2021

yaahc added the PG-error-handling Project group: Error handling (https://github.com/rust-lang/project-error-handling) label Sep 27, 2021

yaahc added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Oct 12, 2021

thomcc force-pushed the skinny-io-error branch from 1c66cff to decba7f Compare October 30, 2021 02:22

JohnCSimon closed this Jan 30, 2022

rustbot added the S-inactive Status: Inactive and waiting on the author. This is often applied to closed PRs. label Jan 30, 2022

JohnCSimon reopened this Jan 30, 2022

JohnCSimon removed the S-inactive Status: Inactive and waiting on the author. This is often applied to closed PRs. label Jan 30, 2022

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Jan 30, 2022

thomcc and others added 5 commits February 4, 2022 18:47

Fix comment typos noticed by code review.

9f7eb7d

Co-authored-by: Ralf Jung <post@ralfj.de>

Update library/std/src/io/error/repr_bitpacked.rs

06edf08

Co-authored-by: the8472 <the8472@users.noreply.github.com>

Elaborate some in the documentation and respond to some review comments

f950edb

Use wrapping pointer arithmetic in the bitpacked io::Error

e98c7f7

Update documentation somewhat

a17a896

thomcc force-pushed the skinny-io-error branch from 0bb5b76 to 1c1286e Compare February 5, 2022 06:38

Add more tests for io::Error packing, and fix some comments that were…

9cbe994

…n't quite accurate anymore

thomcc force-pushed the skinny-io-error branch from 1c1286e to 9cbe994 Compare February 5, 2022 07:15

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Feb 7, 2022

bors added the merged-by-bors This PR was explicitly merged by bors. label Feb 7, 2022

bors merged commit 734368a into rust-lang:master Feb 7, 2022

rustbot added this to the 1.60.0 milestone Feb 7, 2022

bors mentioned this pull request Feb 7, 2022

Add ReadBufRef #93359

Closed

rustbot added the perf-regression Performance regression. label Feb 8, 2022

ChrisDenton mentioned this pull request Feb 8, 2022

std::path::absolute #91673

Merged

Mark-Simulacrum added the perf-regression-triaged The performance regression has been triaged. label Feb 8, 2022

compiler-errors mentioned this pull request Mar 10, 2022

macros in static-assertions crate no longer compile #94791

Closed

compiler-errors mentioned this pull request Mar 23, 2022

io::Error is UnwindSafe, but only on 64-bit platforms #95203

Closed

thomcc deleted the skinny-io-error branch March 24, 2022 00:09

jonhoo mentioned this pull request Sep 14, 2022

Error::source and Error::cause do not expose immediate error in custom io::Error #101817

Open

Make io::Error use 64 bits on targets with 64 bit pointers. #87869

Make io::Error use 64 bits on targets with 64 bit pointers. #87869

Uh oh!

Conversation

thomcc commented Aug 8, 2021

Isn't Pointer Tagging Dodgy?

Why So Many Changes?

Uh oh!

rust-highfive commented Aug 8, 2021

Uh oh!

This comment has been minimized.

Uh oh!

Uh oh!

RalfJung left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bors commented Sep 2, 2021

Uh oh!

JohnCSimon commented Sep 20, 2021

Uh oh!

thomcc commented Sep 20, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bors commented Nov 27, 2021

Uh oh!

kornelski commented Dec 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JohnCSimon commented Jan 30, 2022

Uh oh!

thomcc commented Jan 30, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JohnCSimon commented Jan 30, 2022

Uh oh!

yaahc commented Jan 30, 2022

Uh oh!

thomcc commented Feb 5, 2022

Uh oh!

yaahc commented Feb 7, 2022

Uh oh!

bors commented Feb 7, 2022

Uh oh!

bors commented Feb 7, 2022

Uh oh!

bors commented Feb 7, 2022

Uh oh!

rust-timer commented Feb 8, 2022

Uh oh!

thomcc commented Feb 8, 2022

Uh oh!

thomcc commented Feb 8, 2022

Uh oh!

Mark-Simulacrum commented Feb 8, 2022

Uh oh!

Uh oh!

thomcc commented Sep 20, 2021 •

edited

Loading

kornelski commented Dec 2, 2021 •

edited

Loading

thomcc commented Jan 30, 2022 •

edited

Loading