[DO NOT LAND] Regress the `token-stream-stress` benchmark. #67248

nnethercote · 2019-12-12T02:32:09Z

I have a suspicion that there is a bug in rustc-perf or rust-timer
causing the wrong revisions to be measured by CI. See #66405 and #67079
for more details.

This commit deliberately causes a massive regression to the
token-stream-stress benchmark. On my machine, the instruction count
goes from 313M to 6084M, an 1843.4% regression. I want to see if a CI
run replicates that.

cc @Mark-Simulacrum
r? @ghost

I have a suspicion that there is a bug in rustc-perf or rust-timer causing the wrong revisions to be measured by CI. See rust-lang#66405 and rust-lang#67079 for more details. This commit deliberately causes a massive regression to the `token-stream-stress` benchmark. On my machine, the instruction count goes from 313M to 6084M, an 1843.4% regression. I want to see if a CI run replicates that.

nnethercote · 2019-12-12T02:32:22Z

@bors try @rust-timer queue

rust-timer · 2019-12-12T02:32:24Z

Awaiting bors try build completion

bors · 2019-12-12T02:32:33Z

⌛ Trying commit 2d5843d with merge 5e1e02e...

@Mark-Simulacrum

[DO NOT LAND] Regress the `token-stream-stress` benchmark. I have a suspicion that there is a bug in rustc-perf or rust-timer causing the wrong revisions to be measured by CI. See #66405 and #67079 for more details. This commit deliberately causes a massive regression to the `token-stream-stress` benchmark. On my machine, the instruction count goes from 313M to 6084M, an 1843.4% regression. I want to see if a CI run replicates that. cc @Mark-Simulacrum r? @ghost

nnethercote · 2019-12-12T03:18:33Z

Here are the top 10 check-clean entries from a local run:

token-stream-stress-check
        avg: 1843.4%    min: 1843.4%    max: 1843.4%
helloworld-check
        avg: -0.1%      min: -0.1%      max: -0.1%
issue-46449-check
        avg: -0.1%      min: -0.1%      max: -0.1%
unify-linearly-check
        avg: -0.0%      min: -0.0%      max: -0.0%
html5ever-check
        avg: 0.0%       min: 0.0%       max: 0.0%
deeply-nested-check
        avg: -0.0%      min: -0.0%      max: -0.0%
await-call-tree-check
        avg: -0.0%      min: -0.0%      max: -0.0%
coercions-check
        avg: -0.0%?     min: -0.0%?     max: -0.0%?
serde-check
        avg: 0.0%       min: 0.0%       max: 0.0%
syn-check
        avg: -0.0%      min: -0.0%      max: -0.0%

It shows a huge regression for token-stream-stress, and negligible other changes. Let's see if the CI run matches that.

bors · 2019-12-12T05:04:25Z

☀️ Try build successful - checks-azure
Build commit: 5e1e02e (5e1e02e73da308be8b0908637628027265f123a7)

rust-timer · 2019-12-12T05:04:27Z

Queued 5e1e02e with parent de0abf7, future comparison URL.

rust-timer · 2019-12-12T07:23:33Z

Finished benchmarking try commit 5e1e02e, comparison URL.

nnethercote · 2019-12-12T09:59:41Z

CI results match my local results. Top 10 check-clean results:

token-stream-stress-check
        avg: 1695.9%    min: 1695.9%    max: 1695.9%
helloworld-check
        avg: 0.4%       min: 0.4%       max: 0.4%
issue-46449-check
        avg: 0.2%       min: 0.2%       max: 0.2%
await-call-tree-check
        avg: 0.2%       min: 0.2%       max: 0.2%
unify-linearly-check
        avg: 0.2%       min: 0.2%       max: 0.2%
deeply-nested-check
        avg: 0.2%       min: 0.2%       max: 0.2%
ripgrep-check
        avg: 0.1%       min: 0.1%       max: 0.1%
futures-check
        avg: 0.1%       min: 0.1%       max: 0.1%
regression-31157-check
        avg: 0.1%       min: 0.1%       max: 0.1%
regex-check
        avg: 0.0%       min: 0.0%       max: 0.0%

A bit more variation among the barely-changing ones, but it disproves the theory that the wrong revisions are being tested.

nnethercote closed this Dec 12, 2019

nnethercote deleted the regress-token-stream-stress branch December 12, 2019 09:59

nnethercote mentioned this pull request Dec 12, 2019

Optimize shallow_resolve_changed #67079

Merged

nnethercote mentioned this pull request Dec 21, 2019

Revert parts of #66405. #67471

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DO NOT LAND] Regress the `token-stream-stress` benchmark. #67248

[DO NOT LAND] Regress the `token-stream-stress` benchmark. #67248

Uh oh!

nnethercote commented Dec 12, 2019

Uh oh!

nnethercote commented Dec 12, 2019

Uh oh!

rust-timer commented Dec 12, 2019

Uh oh!

bors commented Dec 12, 2019

Uh oh!

nnethercote commented Dec 12, 2019

Uh oh!

bors commented Dec 12, 2019

Uh oh!

rust-timer commented Dec 12, 2019

Uh oh!

rust-timer commented Dec 12, 2019

Uh oh!

nnethercote commented Dec 12, 2019

Uh oh!

Uh oh!

[DO NOT LAND] Regress the token-stream-stress benchmark. #67248

[DO NOT LAND] Regress the token-stream-stress benchmark. #67248

Uh oh!

Conversation

nnethercote commented Dec 12, 2019

Uh oh!

nnethercote commented Dec 12, 2019

Uh oh!

rust-timer commented Dec 12, 2019

Uh oh!

bors commented Dec 12, 2019

Uh oh!

nnethercote commented Dec 12, 2019

Uh oh!

bors commented Dec 12, 2019

Uh oh!

rust-timer commented Dec 12, 2019

Uh oh!

rust-timer commented Dec 12, 2019

Uh oh!

nnethercote commented Dec 12, 2019

Uh oh!

Uh oh!

[DO NOT LAND] Regress the `token-stream-stress` benchmark. #67248

[DO NOT LAND] Regress the `token-stream-stress` benchmark. #67248