Apply some obvious rich debug info optimizations #73373

jakobbotsch · 2022-08-04T13:52:33Z

When rich debug info is enabled we send MethodDetails events for all inlinees as otherwise no information for those may have been sent (in case they have not been jitted). During rundown this was sending a lot of duplicate events, so use a hash set to avoid this. On Avalonia.ILSpy which has around ~33k methods jitted, we go from ~94k MethodDetails events to ~43k MethodDetails events on rundown.

In addition, compress rich debug info when stored in the runtime. For the above scenario the memory overhead of rich debug info goes from 7 MB -> 2.1 MB.

Also optimize the format we use for the ETW events. While we do not use the same delta compression as the runtime storage, we now avoid sending out padding and also use a smaller type for the offset mappings source type. The average size of a rich debug info ETW event goes from 259 bytes to 219 bytes for the above scenario.

The net result is the following for two rundowns of Avalonia.ILSpy (I clicked around a bit and did a few actions before enabling the rundown. EventID(189) is the rich debug info event)
Before:

After:

There should not be any effect of this PR except when rich debug info is enabled.

I would like to get this in .NET 7. There is more work to be done here for .NET 8, e.g. by avoiding storing method handles when we can get it based on IL offset, but this fixes the most glaring issues.

When rich debug info is enabled we send MethodDetails events for all inlinees as otherwise no information for those may have been sent (in case they have not been jitted). During rundown this was sending a lot of duplicate events.

* Compress rich debug info when stored in the runtime. For Avalonia.ILSpy which has ~33k methods, the memory overhead of rich debug info goes from 7 MB -> 2.1 MB * ETW events do not use the delta compression, but still optimize them a little by avoiding sending out the padding and unnecessarily large types. The average size of a rich debug info ETW event goes from 259 bytes to 219 bytes for ILSpy

jakobbotsch · 2022-08-04T14:01:58Z

I'll make sure to run the diagnostics tests on this and will verify manually that some basic debugging scenarios work as well.

davidwrighton · 2022-08-04T18:21:11Z

src/coreclr/vm/eventtrace.cpp

+ WriteToBuffer(&pBuffer, numMappings);
+ for (uint32_t i = 0; i < numInlineTree; i++)
+ {
+ WriteToBuffer(&pBuffer, inlineTree[i].Method);


Use of WriteToBuffer is a concern for me.

The definition of these types does not have comments/description that the field's types are part of a contract that is never permitted to change.

This is an inefficient encoding. For instance, ILOffset is unlikely to use the full 32 bits that is being sent here, and the Child/Sibling are highly unlikely to be large as well.

We discussed offline. I'm ok with the current set of changes.

The inline ordinals may not be monotonically increasing and the existing DoEncodedDeltaU32 is unnecessarily inefficient for the cases where it isn't (plus, will assert).

Gets us an extra ~10% size reduction in my tests

jakobbotsch · 2022-08-05T12:35:36Z

I added a DoEncodedDeltaU32NonMonotonic as I realized the existing DoEncodedDeltaU32 assumes monotonicity, and I was hitting asserts when testing with checked builds since inlinee IDs may not be monotonic (this would translate to inefficient compression in release builds). I also made the IL offset encoding use it, which nets us another ~10% reduction in memory usage for the debug info.

I've verified that some basic debugging/stepping scenarios work, and that the diagnostics tests pass, with both rich debug info on and off.

jakobbotsch · 2022-08-06T11:42:38Z

Failure is #73247

jakobbotsch added 2 commits August 4, 2022 15:32

dotnet-issue-labeler bot added the area-VM-coreclr label Aug 4, 2022

jakobbotsch added this to the 7.0.0 milestone Aug 4, 2022

ghost assigned jakobbotsch Aug 4, 2022

jakobbotsch requested a review from davidwrighton August 4, 2022 13:52

jakobbotsch added 2 commits August 4, 2022 15:56

Nit

4d8fe49

Another nit

4e7b387

jakobbotsch added 2 commits August 4, 2022 16:30

Fix x86 build

e657855

Fix checked build

244c222

davidwrighton reviewed Aug 4, 2022

View reviewed changes

davidwrighton approved these changes Aug 4, 2022

View reviewed changes

jakobbotsch added 3 commits August 5, 2022 11:52

Add a contract

e67c05f

Add DoEncodedDeltaU32NonMonotonic and use it

b3cf722

The inline ordinals may not be monotonically increasing and the existing DoEncodedDeltaU32 is unnecessarily inefficient for the cases where it isn't (plus, will assert).

Use non-monotonic delta compression for IL offsets

5e43ca0

Gets us an extra ~10% size reduction in my tests

This was referenced Aug 5, 2022

Infra improvements for Helix #68176

Closed

GC/API/GC/GetGCMemoryInfo/GetGCMemoryInfo.sh test failing intermittently on CoreCLR Linux ARM32 #73247

Closed

jakobbotsch merged commit 2db51aa into dotnet:main Aug 6, 2022

jakobbotsch deleted the optimize-rich-debug-info branch August 6, 2022 11:43

ghost locked as resolved and limited conversation to collaborators Sep 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apply some obvious rich debug info optimizations #73373

Apply some obvious rich debug info optimizations #73373

jakobbotsch commented Aug 4, 2022

jakobbotsch commented Aug 4, 2022

davidwrighton Aug 4, 2022

davidwrighton Aug 4, 2022

jakobbotsch commented Aug 5, 2022

jakobbotsch commented Aug 6, 2022

Apply some obvious rich debug info optimizations #73373

Apply some obvious rich debug info optimizations #73373

Conversation

jakobbotsch commented Aug 4, 2022

jakobbotsch commented Aug 4, 2022

davidwrighton Aug 4, 2022

Choose a reason for hiding this comment

davidwrighton Aug 4, 2022

Choose a reason for hiding this comment

jakobbotsch commented Aug 5, 2022

jakobbotsch commented Aug 6, 2022