rustc_metadata: replace Entry table with one table for each of its fields (AoS -> SoA). #59953

eddyb · 2019-04-14T02:23:37Z

In #59789 (comment) I noticed that for many cross-crate queries (e.g. predicates_of(def_id)), we were deserializing the rustc_metadata::schema::Entry for def_id only to read one field (i.e. predicates).

But there are several such queries, and Entry is not particularly small (in terms of number of fields, the encoding itself is quite compact), so there is a large (and unnecessary) constant factor.

This PR replaces the (random-access) array¹ of Entry structures ("AoS"), with many separate arrays¹, one for each field that used to be in Entry ("SoA"), resulting in the ability to read individual fields separately, with negligible time overhead (in thoery), and some size overhead (as these arrays are not sparse).

In a way, the new approach is closer to incremental on-disk caches, which store each query's cached results separately, but it would take significantly more work to unify the two.

For stage1 libcore's metadata blob, the size overhead is 8.44%, and I have another commit (~~not initially included because I want to do perf runs with both~~ EDIT: added it now) that brings it down to 5.88%.

¹(in the source, these arrays are called "tables", but perhaps they could use a better name)

rust-highfive · 2019-04-14T02:23:41Z

r? @zackmdavis

(rust_highfive has picked a reviewer for you, use r? to override)

eddyb · 2019-04-14T02:24:17Z

r? @michaelwoerister cc @Zoxc @nnethercote

@bors try

bors · 2019-04-14T02:24:29Z

⌛ Trying commit 1ab0e24 with merge 3919374...

rustc_metadata: replace Entry table with one table for each of its fields (AoS -> SoA). In #59789 (comment) I noticed that for many cross-crate queries (e.g. `predicates_of(def_id)`), we were deserializing the `rustc_metadata::schema::Entry` for `def_id` *only* to read one field (i.e. `predicates`). But there are several such queries, and `Entry` is not particularly small (in terms of number of fields, the encoding itself is quite compact), so there is a large (and unnecessary) constant factor. This PR replaces the (random-access) array¹ of `Entry` structures ("AoS"), with many separate arrays¹, one for each field that used to be in `Entry` ("SoA"), resulting in the ability to read individual fields separately, with negligible time overhead (in thoery), and some size overhead (as these arrays are not sparse). For stage1 `libcore`'s metadata blob, the size overhead is `8.44%`, and I have another commit (not initially included in this PR because I want to do perf runs with both) that brings it down to `5.88%`. ¹(in the source, these arrays are called "tables", but perhaps they could use a better name)

bors · 2019-04-14T04:46:10Z

☀️ Try build successful - checks-travis
Build commit: 3919374

Zoxc · 2019-04-14T05:27:35Z

@rust-timer build 3919374

rust-timer · 2019-04-14T05:27:36Z

Success: Queued 3919374 with parent 0085672, comparison URL.

rust-timer · 2019-04-14T08:30:44Z

Finished benchmarking try commit 3919374

Zoxc · 2019-04-14T09:06:46Z

Performance looks good except for webrender-check.

nnethercote · 2019-04-14T11:36:05Z

Performance looks good except for webrender-check.

Can you please be more specific with your performance comments?

Instruction counts for everything look great, including webrender-check. Wall-time for webrender-check check-incremental builds has a 28% regression but almost everything else (including other webrender workloads) is a clear improvement. I wonder if that's a fluky bad measurement. Even if it's not, I think this PR is a clear win overall.

eddyb · 2019-04-14T14:47:06Z

@Zoxc Oh, wow, that's a huge gap between instructions and cycles.
I wonder if it's the extra memory being used, that's causing that.
I really wish I had cache misses on there, it would make things clearer.

Also, I've pushed the extra commit I mentioned in the PR description, let's see what that does!

@bors try

bors · 2019-04-14T14:47:18Z

⌛ Trying commit a173ddc with merge 0403760...

rustc_metadata: replace Entry table with one table for each of its fields (AoS -> SoA). *Based on top of #59887* In #59789 (comment) I noticed that for many cross-crate queries (e.g. `predicates_of(def_id)`), we were deserializing the `rustc_metadata::schema::Entry` for `def_id` *only* to read one field (i.e. `predicates`). But there are several such queries, and `Entry` is not particularly small (in terms of number of fields, the encoding itself is quite compact), so there is a large (and unnecessary) constant factor. This PR replaces the (random-access) array¹ of `Entry` structures ("AoS"), with many separate arrays¹, one for each field that used to be in `Entry` ("SoA"), resulting in the ability to read individual fields separately, with negligible time overhead (in thoery), and some size overhead (as these arrays are not sparse). In a way, the new approach is closer to incremental on-disk caches, which store each query's cached results separately, but it would take significantly more work to unify the two. For stage1 `libcore`'s metadata blob, the size overhead is `8.44%`, and I have another commit (not initially included in this PR because I want to do perf runs with both) that brings it down to `5.88%`. ¹(in the source, these arrays are called "tables", but perhaps they could use a better name)

bors · 2019-04-14T17:00:04Z

☀️ Try build successful - checks-travis
Build commit: 0403760

eddyb · 2019-04-14T17:13:39Z

@rust-timer build 0403760

rust-timer · 2019-04-14T17:13:41Z

Success: Queued 0403760 with parent 60076bb, comparison URL.

rust-timer · 2019-04-15T04:43:13Z

Finished benchmarking try commit 0403760

michaelwoerister · 2019-04-15T13:14:07Z

Thanks a lot for the PR, @eddyb! Looks like a nice improvement.
This is a bigger changeset than I thought. I'll review soonish :)

eddyb · 2019-04-15T17:27:51Z

Keep in mind most commits are refactors that don't change the encoded metadata (or do so without changing its size).
The last two are the most interesting ones wrt this change. Oh and the diff is large in part because indentation/syntax changes.

eddyb · 2019-04-15T17:31:00Z

@Zoxc looks like the latest numbers are better?
(I haven't checked yet between the two builds (for the last 2 commits), and webrender-check may have been spurious, not sure)

Zoxc · 2019-04-15T17:37:37Z

It's possible, webrender-debug or webrender-opt didn't regress which would be expected if webrender-check did.

eddyb · 2019-04-16T07:59:44Z

Ugh, these numbers are poisoned by the huge delta in 0085672...60076bb.

I'll have to open another PR for the version without the last commit, and start the try builds at the same time...

bors · 2019-10-15T16:16:38Z

⌛ Trying commit d89dddc with merge b2a5ec95e0c4044dafb3cc99fcf71c2db186bb42...

bors · 2019-10-15T19:28:23Z

☀️ Try build successful - checks-azure
Build commit: b2a5ec95e0c4044dafb3cc99fcf71c2db186bb42 (b2a5ec95e0c4044dafb3cc99fcf71c2db186bb42)

Centril · 2019-10-15T19:36:50Z

@rust-timer build b2a5ec95e0c4044dafb3cc99fcf71c2db186bb42

rust-timer · 2019-10-15T19:36:51Z

Queued b2a5ec95e0c4044dafb3cc99fcf71c2db186bb42 with parent 437ca55, future comparison URL.

rust-timer · 2019-10-16T00:52:02Z

Finished benchmarking try commit b2a5ec95e0c4044dafb3cc99fcf71c2db186bb42, comparison URL.

eddyb · 2019-10-16T05:24:58Z

	Before (`437ca55`) rust-std.tar.xz	After (b2a5ec95e0c4044dafb3cc99fcf71c2db186bb42) rust-std.tar.xz	Increase
`libcore.rlib`	24.37MB	25.85MB	6.1%
`libstd.rlib`	9.64MB	10.12MB	5.0%
Total	175.12MB	177.22MB	1.2%

(Note that the total is compressed with XZ, but also includes librustc*, soon to be in a separate rustc-dev component)

michaelwoerister · 2019-10-17T08:17:05Z

Thanks, @eddyb!

@bors r+

bors · 2019-10-17T08:17:07Z

📌 Commit d89dddc has been approved by michaelwoerister

michaelwoerister · 2019-10-17T08:17:19Z

@bors rollup=never

bors · 2019-10-17T10:45:15Z

⌛ Testing commit d89dddc with merge ea45150...

rustc_metadata: replace Entry table with one table for each of its fields (AoS -> SoA). In #59789 (comment) I noticed that for many cross-crate queries (e.g. `predicates_of(def_id)`), we were deserializing the `rustc_metadata::schema::Entry` for `def_id` *only* to read one field (i.e. `predicates`). But there are several such queries, and `Entry` is not particularly small (in terms of number of fields, the encoding itself is quite compact), so there is a large (and unnecessary) constant factor. This PR replaces the (random-access) array¹ of `Entry` structures ("AoS"), with many separate arrays¹, one for each field that used to be in `Entry` ("SoA"), resulting in the ability to read individual fields separately, with negligible time overhead (in thoery), and some size overhead (as these arrays are not sparse). In a way, the new approach is closer to incremental on-disk caches, which store each query's cached results separately, but it would take significantly more work to unify the two. For stage1 `libcore`'s metadata blob, the size overhead is `8.44%`, and I have another commit (~~not initially included because I want to do perf runs with both~~ **EDIT**: added it now) that brings it down to `5.88%`. ¹(in the source, these arrays are called "tables", but perhaps they could use a better name)

bors · 2019-10-17T14:42:16Z

☀️ Test successful - checks-azure
Approved by: michaelwoerister
Pushing ea45150 to master...

…ables, r=michaelwoerister rustc_metadata: use a table for super_predicates, fn_sig, impl_trait_ref. This is an attempt at a part of rust-lang#65407, i.e. moving parts of cross-crate "metadata" into tables that match queries more closely. Three new tables should be enough to see some perf/metadata size changes. (need to do something similar to rust-lang#59953 (comment)) There are other bits of data that could be made into tables, but they can be more compact so the impact would likely be not as bad, and they're also more work to set up.

@michaelwoerister

…imulacrum rustc_metadata: simplify the interactions between Lazy and Table. These are small post-rust-lang#59953 cleanups (including undoing some contrivances from that PR). r? @michaelwoerister

rust-highfive assigned zackmdavis Apr 14, 2019

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Apr 14, 2019

rust-highfive assigned michaelwoerister and unassigned zackmdavis Apr 14, 2019

eddyb mentioned this pull request Apr 14, 2019

Revert two unapproved changes to rustc_typeck. #59789

Merged

eddyb mentioned this pull request Apr 16, 2019

rustc_metadata: more safely read/write the index positions. #59887

Merged

eddyb force-pushed the soa-metadata branch from a173ddc to dca6840 Compare April 17, 2019 08:04

eddyb mentioned this pull request Apr 17, 2019

[WIP] #59953 without the last commit (i.e. missing an optimization) #60029

Closed

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Oct 17, 2019

petrochenkov self-assigned this Oct 17, 2019

petrochenkov removed their assignment Oct 17, 2019

bors added the merged-by-bors This PR was explicitly merged by bors. label Oct 17, 2019

bors merged commit d89dddc into rust-lang:master Oct 17, 2019

eddyb deleted the soa-metadata branch October 17, 2019 15:08

Mark-Simulacrum mentioned this pull request Oct 18, 2019

failed to build archive invalid data encountered #65536

Closed

This was referenced Oct 18, 2019

rustc: arena-allocate the slice in ty::GenericsPredicate, not the whole struct. #65535

Merged

rustc: add Spans to inferred_outlives_of predicates. #65541

Merged

rustc_metadata: use a table for super_predicates, fn_sig, impl_trait_ref. #65583

Merged

eddyb mentioned this pull request Nov 14, 2019

rustc_metadata: simplify the interactions between Lazy and Table. #66399

Merged

rustc_metadata: replace Entry table with one table for each of its fields (AoS -> SoA). #59953

rustc_metadata: replace Entry table with one table for each of its fields (AoS -> SoA). #59953

Uh oh!

Conversation

eddyb commented Apr 14, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rust-highfive commented Apr 14, 2019

Uh oh!

eddyb commented Apr 14, 2019

Uh oh!

bors commented Apr 14, 2019

Uh oh!

bors commented Apr 14, 2019

Uh oh!

Zoxc commented Apr 14, 2019

Uh oh!

rust-timer commented Apr 14, 2019

Uh oh!

rust-timer commented Apr 14, 2019

Uh oh!

Zoxc commented Apr 14, 2019

Uh oh!

nnethercote commented Apr 14, 2019

Uh oh!

eddyb commented Apr 14, 2019

Uh oh!

bors commented Apr 14, 2019

Uh oh!

bors commented Apr 14, 2019

Uh oh!

eddyb commented Apr 14, 2019

Uh oh!

rust-timer commented Apr 14, 2019

Uh oh!

rust-timer commented Apr 15, 2019

Uh oh!

michaelwoerister commented Apr 15, 2019

Uh oh!

eddyb commented Apr 15, 2019

Uh oh!

eddyb commented Apr 15, 2019

Uh oh!

Zoxc commented Apr 15, 2019

Uh oh!

eddyb commented Apr 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bors commented Oct 15, 2019

Uh oh!

bors commented Oct 15, 2019

Uh oh!

Centril commented Oct 15, 2019

Uh oh!

rust-timer commented Oct 15, 2019

Uh oh!

rust-timer commented Oct 16, 2019

Uh oh!

eddyb commented Oct 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

michaelwoerister commented Oct 17, 2019

Uh oh!

bors commented Oct 17, 2019

Uh oh!

michaelwoerister commented Oct 17, 2019

Uh oh!

bors commented Oct 17, 2019

Uh oh!

bors commented Oct 17, 2019

Uh oh!

Uh oh!

eddyb commented Apr 14, 2019 •

edited

Loading

eddyb commented Apr 16, 2019 •

edited

Loading

eddyb commented Oct 16, 2019 •

edited

Loading