ChildSyntaxList.ItemInternal optimization #73650

ToddGrun · 2024-05-22T22:35:45Z

ChildSyntaxList.ItemInternal has shown up in many profiles that I've looked at. I finally dug in a bit to it, and think I've identified a potential optimization opportunity.

This optimization takes advantage that the first loop was previously always iterating through all the slots until it had found index items. However, as this method is commonly called from inside the ChildSyntaxList.Enumerator, it can use knowledge from previous calls to start that first loop at a more appropriate location.

Running the benchmark.net tests in the PR yield the following output. Essentially, the tree walk is ~10% faster while the line walk is much closer to 50% faster (depending on the file size)

*** without changes ***
Run1:

Method	Mean	Error	StdDev	Gen 0	Gen 1	Gen 2	Allocated
WalkLines_Small	2.792 ms	0.0100 ms	0.0089 ms	39.0625	-	-	408 KB
WalkLines_Medium	49.311 ms	0.0907 ms	0.0758 ms	272.7273	-	-	3,115 KB
WalkLines_Large	1,962.582 ms	4.9003 ms	4.3440 ms	4000.0000	-	-	49,784 KB
WalkTree_Small	48.747 ms	0.0986 ms	0.0874 ms	-	-	-	259 KB
WalkTree_Medium	330.527 ms	0.8990 ms	0.7507 ms	-	-	-	466 KB
WalkTree_Large	4,046.432 ms	10.3293 ms	8.6254 ms	-	-	-	3,155 KB
WalkClassifications_Small	2.273 ms	0.0052 ms	0.0047 ms	-	-	-	47 B
WalkClassifications_Medium	37.415 ms	0.0288 ms	0.0256 ms	-	-	-	15,257 B
WalkClassifications_Large	1,233.854 ms	2.7014 ms	2.5269 ms	-	-	-	2,967,000 B

Run2:

Method	Mean	Error	StdDev	Gen 0	Gen 1	Gen 2	Allocated
WalkLines_Small	2.799 ms	0.0157 ms	0.0146 ms	39.0625	-	-	408 KB
WalkLines_Medium	45.714 ms	0.0579 ms	0.0484 ms	272.7273	-	-	3,115 KB
WalkLines_Large	1,914.537 ms	2.2995 ms	2.0384 ms	4000.0000	-	-	49,784 KB
WalkTree_Small	48.578 ms	0.0511 ms	0.0478 ms	-	-	-	259 KB
WalkTree_Medium	341.976 ms	0.5460 ms	0.4840 ms	-	-	-	466 KB
WalkTree_Large	3,846.922 ms	19.6948 ms	17.4589 ms	-	-	-	3,155 KB
WalkClassifications_Small	2.231 ms	0.0076 ms	0.0068 ms	-	-	-	47 B
WalkClassifications_Medium	37.379 ms	0.0805 ms	0.0672 ms	-	-	-	15,257 B
WalkClassifications_Large	1,113.740 ms	2.1410 ms	1.7879 ms	-	-	-	2,967,000 B

*** with changes ***
Run 1:

Method	Mean	Error	StdDev	Gen 0	Gen 1	Gen 2	Allocated
WalkLines_Small	1.826 ms	0.0065 ms	0.0061 ms	39.0625	-	-	408 KB
WalkLines_Medium	21.893 ms	0.0929 ms	0.0776 ms	281.2500	-	-	3,103 KB
WalkLines_Large	629.993 ms	4.4575 ms	3.9515 ms	4000.0000	-	-	49,784 KB
WalkTree_Small	46.546 ms	0.2303 ms	0.2154 ms	-	-	-	259 KB
WalkTree_Medium	322.603 ms	0.7319 ms	0.6847 ms	-	-	-	466 KB
WalkTree_Large	3,734.326 ms	9.9091 ms	9.2690 ms	-	-	-	3,155 KB
WalkClassifications_Small	1.569 ms	0.0046 ms	0.0043 ms	-	-	-	24 B
WalkClassifications_Medium	19.329 ms	0.0631 ms	0.0590 ms	-	-	-	6,675 B
WalkClassifications_Large	459.593 ms	2.4224 ms	2.1474 ms	-	-	-	2,967,000 B

Run2:

Method	Mean	Error	StdDev	Gen 0	Gen 1	Gen 2	Allocated
WalkLines_Small	1.793 ms	0.0093 ms	0.0087 ms	39.0625	-	-	408 KB
WalkLines_Medium	22.862 ms	0.4528 ms	0.5032 ms	281.2500	-	-	3,103 KB
WalkLines_Large	690.992 ms	2.0962 ms	1.7504 ms	4000.0000	-	-	49,784 KB
WalkTree_Small	45.063 ms	0.1223 ms	0.1144 ms	-	-	-	259 KB
WalkTree_Medium	320.729 ms	0.9224 ms	0.8177 ms	-	-	-	466 KB
WalkTree_Large	3,864.808 ms	20.0134 ms	18.7205 ms	-	-	-	3,155 KB
WalkClassifications_Small	1.569 ms	0.0088 ms	0.0082 ms	-	-	-	24 B
WalkClassifications_Medium	19.573 ms	0.0679 ms	0.0635 ms	-	-	-	6,675 B
WalkClassifications_Large	458.431 ms	2.5393 ms	2.3752 ms	-	-	-	2,967,000 B

ToddGrun · 2024-05-23T01:04:36Z

@jaredpar -- Who would be appropriate compiler reviewers?

CyrusNajmabadi · 2024-05-23T01:13:28Z

For context, the "line walk" simulates how 'syntactic classification' works, where the editor calls into us to classify lines in view (including as the view is scrolled).

So we get N calls to classify certain subspans of the document, and each of those N calls needs to walk the tree to find the tokens intersecting that subspan. Improvements to the tree-walk make a big difference. Here, it's about 66% better (from 1,962.582ms to 629.993ms). This is because to walk the tree, we're constantly hitting nodes and doing .ChildNodesAndTokens on it. Speeding that up from N^2 to linear for each node we hit makes a significant difference.

We were looking at this because we were seeing expenses in Roslyn in classification/scrolling higher than what we wanted. This alone drops our current syntactic classification time by about 50%.

CyrusNajmabadi · 2024-05-23T01:18:31Z

Note; while classification was the core scenario we cared about (as it is such a hot spot), this shows up everywhere, since we're constantly walking trees. Everything benefits form this :)

src/Compilers/Core/Portable/Syntax/ChildSyntaxList.Enumerator.cs

src/Compilers/Core/Portable/Syntax/ChildSyntaxList.cs

src/Tools/IdeCoreBenchmarks/SyntaxListBenchmarks.cs

src/Compilers/Core/Portable/Syntax/ChildSyntaxList.cs

src/Tools/IdeCoreBenchmarks/SyntaxListBenchmarks.cs

CyrusNajmabadi · 2024-05-23T05:45:25Z

With my tests, i get this speedup on the walking algorithm that classification uses:

before:

|  WalkClassification_Large | 1,119.638 ms | 2.8996 ms | 2.7122 ms |         - |         - |     - |  1,490,944 B |

After

|  WalkClassification_Large | 572.202 ms | 2.7666 ms | 2.4525 ms |         - |         - |     - |  1,490,944 B |

So basically 50% faster.

Also, the classification walk is much better than .DescendantTokens wrt to memory:

|           WalkLines_Large | 941.099 ms | 3.0237 ms | 2.8284 ms | 6000.0000 | 6000.0000 |     - | 31,531,432 B |
|  WalkClassification_Large | 572.202 ms | 2.7666 ms | 2.4525 ms |         - |         - |     - |  1,490,944 B |

That's 20x less memory. And nothing in gen0//gen1.

ToddGrun · 2024-05-23T14:34:08Z

Refresh my memory, it was pretty invasive to try and get the DescendantTokens code to have an allocation profile similar to the manual walk done in WalkClassification, right?

In reply to: 2126281448

CyrusNajmabadi · 2024-05-23T15:11:10Z

Refresh my memory, it was pretty invasive to try and get the DescendantTokens code to have an allocation profile similar to the manual walk done in WalkClassification, right?

I think it wouldn't be too hard to do. we'd want to potentially invest in some dedicated tests to ensure the same behavior as before (esp. around empty length tokens).

jaredpar · 2024-05-30T20:14:42Z

@cston, @333fred and @jjonescz should take a look. But note we are bogged down right now in functional issues.

src/Compilers/Core/Portable/Syntax/ChildSyntaxList.cs

ToddGrun · 2024-06-03T20:31:05Z

@333fred or @jjonescz for 2nd review

ToddGrun · 2024-06-05T19:37:40Z

ping @333fred or @jjonescz for 2nd review

ToddGrun · 2024-06-06T18:11:22Z

@dotnet/roslyn-compiler -- ptal, need 2nd review

ToddGrun · 2024-06-07T21:10:48Z

@dotnet/roslyn-compiler -- ptal, need 2nd review

ToddGrun · 2024-06-11T22:24:18Z

Please @333fred or @jjonescz for 2nd review

src/Compilers/CSharp/Portable/Syntax/CSharpSyntaxWalker.cs

ToddGrun · 2024-06-12T23:10:00Z

/azp run

azure-pipelines · 2024-06-12T23:10:18Z

Azure Pipelines successfully started running 2 pipeline(s).

jjonescz · 2024-06-13T08:45:44Z

@dotnet-policy-service rerun

ToddGrun added 2 commits May 22, 2024 13:28

optimization of ChildSyntaxList.Enumerator

7b078bf

Update benchmark to test different sized files

42027a9

dotnet-issue-labeler bot added Area-Compilers untriaged Issues and PRs which have not yet been triaged by a lead labels May 22, 2024

ToddGrun changed the title ~~WIP: ChildSyntaxListlist.ItemInternal optimization~~ WIP: ChildSyntaxList.ItemInternal optimization May 22, 2024

updated benchmarks

7f29425

ToddGrun marked this pull request as ready for review May 23, 2024 01:03

ToddGrun requested review from a team as code owners May 23, 2024 01:04

ToddGrun requested a review from jaredpar May 23, 2024 01:04

ToddGrun changed the title ~~WIP: ChildSyntaxList.ItemInternal optimization~~ ChildSyntaxList.ItemInternal optimization May 23, 2024

CyrusNajmabadi reviewed May 23, 2024

View reviewed changes

src/Compilers/Core/Portable/Syntax/ChildSyntaxList.Enumerator.cs Outdated Show resolved Hide resolved

CyrusNajmabadi reviewed May 23, 2024

View reviewed changes

src/Compilers/Core/Portable/Syntax/ChildSyntaxList.cs Outdated Show resolved Hide resolved

CyrusNajmabadi reviewed May 23, 2024

View reviewed changes

src/Compilers/Core/Portable/Syntax/ChildSyntaxList.cs Outdated Show resolved Hide resolved

CyrusNajmabadi reviewed May 23, 2024

View reviewed changes

src/Compilers/Core/Portable/Syntax/ChildSyntaxList.cs Outdated Show resolved Hide resolved

CyrusNajmabadi reviewed May 23, 2024

View reviewed changes

src/Compilers/Core/Portable/Syntax/ChildSyntaxList.cs Show resolved Hide resolved

Add some comments, and change from record to standard struct

6fe1728

ToddGrun commented May 23, 2024

View reviewed changes

src/Tools/IdeCoreBenchmarks/SyntaxListBenchmarks.cs Outdated Show resolved Hide resolved

CyrusNajmabadi reviewed May 23, 2024

View reviewed changes

src/Compilers/Core/Portable/Syntax/ChildSyntaxList.cs Outdated Show resolved Hide resolved

CyrusNajmabadi reviewed May 23, 2024

View reviewed changes

src/Compilers/Core/Portable/Syntax/ChildSyntaxList.cs Outdated Show resolved Hide resolved

CyrusNajmabadi reviewed May 23, 2024

View reviewed changes

src/Tools/IdeCoreBenchmarks/SyntaxListBenchmarks.cs Outdated Show resolved Hide resolved

PR feedback, primarily around adding a new scenario to benchmark.

7edc7c3

lint fix

ac665ea

build-analysis bot mentioned this pull request May 23, 2024

The active test run was aborted. Reason: Test host process crashed dotnet/dnceng#451

Open

3 tasks

cston reviewed Jun 3, 2024

View reviewed changes

src/Compilers/Core/Portable/Syntax/ChildSyntaxList.cs Show resolved Hide resolved

cston reviewed Jun 3, 2024

View reviewed changes

src/Compilers/Core/Portable/Syntax/ChildSyntaxList.cs Outdated Show resolved Hide resolved

cston reviewed Jun 3, 2024

View reviewed changes

src/Compilers/Core/Portable/Syntax/ChildSyntaxList.cs Outdated Show resolved Hide resolved

cston reviewed Jun 3, 2024

View reviewed changes

src/Compilers/Core/Portable/Syntax/ChildSyntaxList.cs Outdated Show resolved Hide resolved

cston reviewed Jun 3, 2024

View reviewed changes

src/Compilers/Core/Portable/Syntax/ChildSyntaxList.cs Outdated Show resolved Hide resolved

ToddGrun added 2 commits June 3, 2024 13:06

PR suggestions

2363796

readonly struct

30053b4

cston approved these changes Jun 3, 2024

View reviewed changes

jjonescz approved these changes Jun 12, 2024

View reviewed changes

src/Compilers/CSharp/Portable/Syntax/CSharpSyntaxWalker.cs Outdated Show resolved Hide resolved

ToddGrun added 2 commits June 12, 2024 08:32

Create a SlotData ctor overload using common parameters

0d90bf9

Remove benchmarks used to analyze change effectiveness

34d2425

cston approved these changes Jun 12, 2024

View reviewed changes

jjonescz approved these changes Jun 13, 2024

View reviewed changes

ToddGrun merged commit 0b83719 into dotnet:main Jun 13, 2024
27 checks passed

dotnet-policy-service bot added this to the Next milestone Jun 13, 2024

dotnet-bot mentioned this pull request Jun 18, 2024

[Automated] PRs inserted in VS build main-35017.309 #74045

Closed

ToddGrun deleted the dev/toddgrun/SyntaxListItemInternalOptimization branch June 19, 2024 16:19

dotnet-bot mentioned this pull request Jun 21, 2024

[Automated] PRs inserted in VS build feature.debugger.main-35020.110 #74093

Closed

jjonescz modified the milestones: Next, 17.11 P3 Jun 24, 2024

dotnet-bot mentioned this pull request Jun 27, 2024

[Automated] PRs inserted in VS build feature.dotnetVS-35026.97 #74176

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ChildSyntaxList.ItemInternal optimization #73650

ChildSyntaxList.ItemInternal optimization #73650

ToddGrun commented May 22, 2024 •

edited

Loading

ToddGrun commented May 23, 2024

CyrusNajmabadi commented May 23, 2024

CyrusNajmabadi commented May 23, 2024

CyrusNajmabadi commented May 23, 2024 •

edited

Loading

ToddGrun commented May 23, 2024

CyrusNajmabadi commented May 23, 2024

jaredpar commented May 30, 2024

ToddGrun commented Jun 3, 2024

ToddGrun commented Jun 5, 2024

ToddGrun commented Jun 6, 2024

ToddGrun commented Jun 7, 2024

ToddGrun commented Jun 11, 2024

ToddGrun commented Jun 12, 2024

azure-pipelines bot commented Jun 12, 2024

jjonescz commented Jun 13, 2024

ChildSyntaxList.ItemInternal optimization #73650

ChildSyntaxList.ItemInternal optimization #73650

Conversation

ToddGrun commented May 22, 2024 • edited Loading

ToddGrun commented May 23, 2024

CyrusNajmabadi commented May 23, 2024

CyrusNajmabadi commented May 23, 2024

CyrusNajmabadi commented May 23, 2024 • edited Loading

ToddGrun commented May 23, 2024

CyrusNajmabadi commented May 23, 2024

jaredpar commented May 30, 2024

ToddGrun commented Jun 3, 2024

ToddGrun commented Jun 5, 2024

ToddGrun commented Jun 6, 2024

ToddGrun commented Jun 7, 2024

ToddGrun commented Jun 11, 2024

ToddGrun commented Jun 12, 2024

azure-pipelines bot commented Jun 12, 2024

jjonescz commented Jun 13, 2024

ToddGrun commented May 22, 2024 •

edited

Loading

CyrusNajmabadi commented May 23, 2024 •

edited

Loading