Switch self profile to use HW counters instead of walltime #1647

Kobzol · 2023-07-08T15:24:12Z

This PR changes the gathered self profile to use instructions as a metric, instead of wall time. There are some problems with this:

This PR simply switches the interpretation of the data to instructions. However, old artifacts will have values stored in the DB as walltime. We could record the metric (instructions/walltime) in the DB, so that we show e.g. the correct units on the detailed query page table, but this does not change the fact that there will be some pairs of artifacts where one side will have walltime, and the other instructions, and these will be basically uncomparable.
Instructions will be added together for all threads, which might be a bit misleading where parallelism is used.

Fixes: #1345

Mark-Simulacrum

I think this is broadly reasonable - I'm not too worried about the N commits that happen in the transit time, but maybe we can temporarily add a note: message to the HTML noting that? I don't think we have easy ways of tying that back to the deployed version of rustc-perf (though I seem to recall the information being persisted into the DB), so it would just be unconditional.

Adding together across threads also seems like best we can do and is already done for the general view.

Kobzol · 2023-07-18T08:24:18Z

I added the note to the detailed query page and added more documentation about this to the code. I also changed the formatting of values in the self profile table.

Kobzol · 2023-08-03T11:23:52Z

I tried locally that combining perf stat -e instructions and measureme using instructions doesn't seem to affect the results (I'm not sure if it's OK in general to "nest" PMU counter recording).

@Mark-Simulacrum I don't have any more ideas on how to test this further locally. I guess that we'll just have to merge it and see what happens :) Feel free to do that.

bjorn3 · 2023-08-03T12:16:41Z

https://perf.wiki.kernel.org/index.php/Tutorial

Furthermore, the perf_events interface allows multiple tools to measure the same thread or CPU at the same time.

Kobzol · 2023-08-03T12:33:06Z

Yeah it definitely seems like it's supported, I just wanted to be sure that when I measure the same event (instructions) twice, it won't take two counter slots (to avoid possible multiplexing).

Events are currently managed in round-robin fashion. Therefore each event will eventually get a chance to run. If there are N counters, then up to the first N events on the round-robin list are programmed into the PMU. In certain situations it may be less than that because some events may not be measured together or they compete for the same counter. Furthermore, the perf_events interface allows multiple tools to measure the same thread or CPU at the same time. Each event is added to the same round-robin list. There is no guarantee that all events of a tool are stored sequentially in the list.

It's not clear to me whether this (counter sharing/multiplexing) can happen when two processes measure the exact same event.

Kobzol requested a review from Mark-Simulacrum July 8, 2023 15:24

Mark-Simulacrum approved these changes Jul 16, 2023

View reviewed changes

Kobzol force-pushed the self-profile-hw-counters branch from 2fcbe78 to 5367565 Compare July 17, 2023 19:30

Switch self profile to use HW counters instead of walltime

0b72313

Kobzol force-pushed the self-profile-hw-counters branch from 5367565 to 0b72313 Compare July 18, 2023 07:20

Kobzol marked this pull request as ready for review July 18, 2023 07:39

Kobzol added 2 commits July 18, 2023 09:46

Document the switch to HW counters

7162f86

Change formatting of self-profile metrics

47e20b8

Kobzol force-pushed the self-profile-hw-counters branch from 07fa96c to 47e20b8 Compare July 18, 2023 08:24

Kobzol merged commit f174358 into rust-lang:master Aug 12, 2023

Kobzol deleted the self-profile-hw-counters branch August 12, 2023 11:56

This was referenced Aug 12, 2023

Revert "Switch self profile to use HW counters instead of walltime" #1700

Merged

Use hardware performance counter data for the detailed/self-profile data view #1345

Open

Kobzol mentioned this pull request Oct 4, 2024

Switch self profile to use HW counters instead of walltime (attempt 2) #1984

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Switch self profile to use HW counters instead of walltime #1647

Switch self profile to use HW counters instead of walltime #1647

Uh oh!

Kobzol commented Jul 8, 2023 •

edited

Loading

Uh oh!

Mark-Simulacrum left a comment

Uh oh!

Kobzol commented Jul 18, 2023

Uh oh!

Kobzol commented Aug 3, 2023

Uh oh!

bjorn3 commented Aug 3, 2023

Uh oh!

Kobzol commented Aug 3, 2023

Uh oh!

Uh oh!

Switch self profile to use HW counters instead of walltime #1647

Switch self profile to use HW counters instead of walltime #1647

Uh oh!

Conversation

Kobzol commented Jul 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Mark-Simulacrum left a comment

Choose a reason for hiding this comment

Uh oh!

Kobzol commented Jul 18, 2023

Uh oh!

Kobzol commented Aug 3, 2023

Uh oh!

bjorn3 commented Aug 3, 2023

Uh oh!

Kobzol commented Aug 3, 2023

Uh oh!

Uh oh!

Kobzol commented Jul 8, 2023 •

edited

Loading