Make `Rc<T>::deref` and `Arc<T>::deref` zero-cost #132553

EFanZh · 2024-11-03T07:48:34Z

Currently, Rc<T> and Arc<T> store pointers to RcInner<T> and ArcInner<T>. This PR changes the pointers so that they point to T directly instead.

This is based on the assumption that we access the T value more frequently than accessing reference counts. With this change, accessing the data can be done without offsetting pointers from RcInner<T> and ArcInner<T> to their contained data. This change might also enables some possibly useful future optimizations, such as:

Convert &[Rc<T>] into &[&T] within O(1) time.
Convert &[Rc<T>] into Vec<&T> utilizing memcpy.
Convert &Option<Rc<T>> into Option<&T> without branching.
Make Rc<T> and Arc<T> FFI compatible types where T: Sized.

rustbot · 2024-11-03T07:48:42Z

r? @jhpratt

rustbot has assigned @jhpratt.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

marmeladema · 2024-11-03T09:22:09Z

Would it potentially enable those types to have an ffi compatible ABI? So that they could be returned and passed directly from /to ffi function, like Box?

EFanZh · 2024-11-03T10:36:08Z

Would it potentially enable those types to have an ffi compatible ABI? So that they could be returned and passed directly from /to ffi function, like Box?

I think in theory it is possible, at least for sized types, but I am not familiar with how to formally make it so.

jhpratt · 2024-11-03T23:07:24Z

r? libs

joboet · 2024-11-07T16:17:52Z

@EFanZh Is this ready for review? If so, please un-draft the PR.

EFanZh · 2024-11-07T17:12:22Z

@joboet: The source code part is mostly done, but I haven’t finished updating LLDB and CDB pretty printers. The CI doesn’t seem to run those tests.

joboet · 2024-11-08T12:56:42Z

No worries! I just didn't want to keep you waiting in case you had forgotten to change the state.
@rustbot author

EFanZh · 2025-03-29T15:31:25Z

But the largest thing that stands out to me is that there's a lot of tiny functions. Although they have descriptive names and are documented, it made it somewhat harder to follow for me. For the ones used only once, it might make sense to inline them for readability.

@thaliaarchi: Do you mean the allocation functions? I have checked those functions, most functions are used more than once, and the function that are used once are used for unifying non-generic codes used in generic functions.

@rustbot ready

bors · 2025-03-29T23:36:36Z

☔ The latest upstream changes (presumably #133572) made this pull request unmergeable. Please resolve the merge conflicts.

bors · 2025-04-28T05:41:11Z

☔ The latest upstream changes (presumably #140378) made this pull request unmergeable. Please resolve the merge conflicts.

bors · 2025-05-07T04:43:54Z

☔ The latest upstream changes (presumably #140726) made this pull request unmergeable. Please resolve the merge conflicts.

joboet · 2025-05-15T10:38:24Z

This PR is way too large for me to review. Is there any way you could split this up? E.g. by moving the Arc changes to a different PR...

EFanZh · 2025-05-15T12:25:02Z

@joboet: The majority codes are in alloc::raw_rc module, which has to be submitted with at least one of alloc::rc and alloc::sync changes, otherwise they will be unused codes. Do you accept that I split this PR into these two PRs?

PR containing alloc::raw_rc + alloc::rc changes: this one will still contain a large chunk of code, which I don’t have a good way to further split.
PR containing alloc::sync changes.

thaliaarchi · 2025-05-15T23:23:28Z

Alternatively, an approach to consider, if splitting horizontally isn’t feasible, you could split it vertically: Split it into a commit history of discrete, atomic steps progressing towards the goal of the PR. Even if it needs to be merged all at once, this can aid reviewing.

I know the Rust project prefers smaller PRs, though, so that should be preferred if possible, but the approaches could be combined.

joboet · 2025-05-16T12:29:55Z

@joboet: The majority codes are in alloc::raw_rc module, which has to be submitted with at least one of alloc::rc and alloc::sync changes, otherwise they will be unused codes. Do you accept that I split this PR into these two PRs?
1. PR containing `alloc::raw_rc` + `alloc::rc` changes: this one will still contain a large chunk of code, which I don’t have a good way to further split.

2. PR containing `alloc::sync` changes.

That sounds very reasonable!

EFanZh · 2025-05-24T14:31:32Z

@joboet: I have split this PR into two commits:

The first commit contains alloc::raw_rc and alloc::rc changes, which has also be submitted separately in Make Rc<T>::deref zero-cost #141348.
The second commit contains alloc::sync changes.

Make `Rc<T>::deref` zero-cost This PR makes `Rc::deref` zero-cost by changing the internal pointer so that it points to the value directly instead of the allocation. This is split out from #132553, which will also make `Arc::deref` zero-cost.

bors · 2025-06-27T04:13:49Z

☔ The latest upstream changes (presumably #143074) made this pull request unmergeable. Please resolve the merge conflicts.

rustbot assigned jhpratt Nov 3, 2024

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. T-libs Relevant to the library team, which will review and decide on the PR/issue. labels Nov 3, 2024

EFanZh force-pushed the zero-cost-rc-arc-deref branch from b283c44 to ae36f44 Compare November 3, 2024 09:14