Reduce complexity overhead (and avoid one copy) when obtaining character SSDQs in `SpriteTextDrawNode` #5883

peppy · 2023-07-03T08:17:39Z

Of note, this doesn't show a huge improvement in benchmarks, but in an edge case noticed in osu!, the AddRange operation can add a perceivable overhead:

Before	After

We believe this may be a dotnet runtime quirk.

Even though this change can't be shown in isolated benchmarks, I'd argue the code quality improvement is worth it.

Of note, the caching of the SSDQs at SpriteText was a bit redundant as it was being invalidated by basically everything. So it makes sense to just fetch it every time, regardless, in the draw node itself.

The only potential saving we could obtain with the previous logic would be to shift the load to the Update frame. But this wasn't being done. Rather than investigating whether that has any benefits, I'd rather focus on getting SpriteText rewritten to use an optimised shader.

master

Method	Mean	Error	StdDev	Allocated
TestStaticText	984.4 us	1.30 us	1.22 us	6 B
TestMovingText	2,701.5 us	47.68 us	42.27 us	117 B
TestChangingText	11,381.6 us	161.87 us	143.49 us	64182 B

this pr (pooled at 2967626)

Method	Mean	Error	StdDev	Allocated
TestStaticText	985.3 us	2.46 us	2.30 us	6 B
TestMovingText	2,928.0 us	57.80 us	75.16 us	119 B
TestChangingText	11,426.4 us	186.17 us	155.46 us	64184 B

this pr (list at 90a2e28)

Method	Mean	Error	StdDev	Allocated
TestStaticText	985.5 us	1.80 us	1.68 us	6 B
TestMovingText	2,476.1 us	47.54 us	50.86 us	110 B
TestChangingText	10,793.3 us	211.44 us	217.14 us	64180 B

…ter SSDQs in `SpriteTextDrawNode` Of note, this doesn't show a huge improvement in benchmarks, but in an edge case noticed in osu!, the `AddRange` operation can add a perceivable overhead. We believe this may be a dotnet runtime quirk. Even though this change can't be shown in isolated benchmarks, I'd argue the code quality improvement is worth it. Of note, the caching of the SSDQs at `SpriteText` was a bit redundant as it was being invalidated by basically everything. So it makes sense to just fetch it every time, regardless, in the draw node itself. The only potential saving we could obtain with the previous logic would be to shift the load to the `Update` frame. But this wasn't being done. Rather than investigating whether that has any benefits, I'd rather focus on getting `SpriteText` rewritten to use an optimised shader.

bdach

Would like @smoogipoo's take on the invalidation stuff if possible.

bdach · 2023-07-03T21:47:29Z

osu.Framework/Graphics/Sprites/SpriteText_DrawNode.cs

+            {
+                int partCount = Source.characters.Count;
+
+                parts ??= new List<ScreenSpaceCharacterPart>(partCount);


This is the optimisation, correct? The lazy list init and the partCount spec?

It's not what I would call "avoid one copy" exactly because it's not directly avoiding copies, it's avoiding list reallocs due to having to expand the collection to match incoming count (which in turn will cause struct copies due to how list-of-struct works), which makes me mildly confused. Strictly speaking it would save to the order of O(n) copies if I'm not mistaken?

The copy I meant was the fact that items were previously added once to a list in SpriteText then subsequently copied to a second list in DrawNode.

The main goal was to remove the weird InsertRange overhead, and this was enough to do that.

osu.Framework/Graphics/Sprites/SpriteText_DrawNode.cs

peppy · 2023-07-03T22:33:56Z

Would like @smoogipoo's take on the invalidation stuff if possible.

I double-checked with him IRL on the invalidation part when making this change, so he should be on board with it 😄

peppy · 2023-07-04T04:02:09Z

With EnsureCapacity (wouldn't expect a change, so just confirming it doesn't get worse):

Method	Mean	Error	StdDev	Allocated
TestStaticText	985.8 us	0.99 us	0.93 us	6 B
TestMovingText	2,529.1 us	49.84 us	48.95 us	108 B
TestChangingText	10,811.6 us	79.74 us	74.59 us	64166 B

smoogipoo · 2023-07-05T06:53:14Z

osu.Framework/Graphics/Sprites/SpriteText.cs

-        private readonly LayoutValue parentScreenSpaceCache = new LayoutValue(Invalidation.DrawSize | Invalidation.Presence | Invalidation.DrawInfo, InvalidationSource.Parent);
-        private readonly LayoutValue localScreenSpaceCache = new LayoutValue(Invalidation.MiscGeometry, InvalidationSource.Self);


I suppose one of the advantages of this is that the overhead only occurred once per invalidation, rather than 3 times (once for each DrawNode) per invalidation.

The lookup was already lazy, so I'm not sure if this would ever be the case, but maybe.

peppy added 3 commits July 3, 2023 15:51

Add SpriteText benchmark

00c570f

Don't use ArrayPool (list is faster)

90a2e28

peppy added the type:performance label Jul 3, 2023

pull-request-size bot added the size/L label Jul 3, 2023

bdach reviewed Jul 3, 2023

View reviewed changes

bdach requested a review from smoogipoo July 3, 2023 22:03

Use EnsureCapcity to potentially avoid multiple copies

3092bd6

bdach removed the request for review from smoogipoo July 4, 2023 19:53

Merge branch 'master' into sprite-text-draw-node-perf

8eb1b55

bdach approved these changes Jul 4, 2023

View reviewed changes

bdach enabled auto-merge July 4, 2023 20:11

bdach merged commit 050857d into ppy:master Jul 4, 2023

smoogipoo reviewed Jul 5, 2023

View reviewed changes

peppy deleted the sprite-text-draw-node-perf branch September 14, 2023 09:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce complexity overhead (and avoid one copy) when obtaining character SSDQs in `SpriteTextDrawNode` #5883

Reduce complexity overhead (and avoid one copy) when obtaining character SSDQs in `SpriteTextDrawNode` #5883

peppy commented Jul 3, 2023

bdach left a comment

bdach Jul 3, 2023

peppy Jul 3, 2023 •

edited

Loading

peppy commented Jul 3, 2023

peppy commented Jul 4, 2023

smoogipoo Jul 5, 2023

peppy Jul 5, 2023

		private readonly LayoutValue parentScreenSpaceCache = new LayoutValue(Invalidation.DrawSize \| Invalidation.Presence \| Invalidation.DrawInfo, InvalidationSource.Parent);
		private readonly LayoutValue localScreenSpaceCache = new LayoutValue(Invalidation.MiscGeometry, InvalidationSource.Self);

Reduce complexity overhead (and avoid one copy) when obtaining character SSDQs in SpriteTextDrawNode #5883

Reduce complexity overhead (and avoid one copy) when obtaining character SSDQs in SpriteTextDrawNode #5883

Conversation

peppy commented Jul 3, 2023

bdach left a comment

Choose a reason for hiding this comment

bdach Jul 3, 2023

Choose a reason for hiding this comment

peppy Jul 3, 2023 • edited Loading

Choose a reason for hiding this comment

peppy commented Jul 3, 2023

peppy commented Jul 4, 2023

smoogipoo Jul 5, 2023

Choose a reason for hiding this comment

peppy Jul 5, 2023

Choose a reason for hiding this comment

Reduce complexity overhead (and avoid one copy) when obtaining character SSDQs in `SpriteTextDrawNode` #5883

Reduce complexity overhead (and avoid one copy) when obtaining character SSDQs in `SpriteTextDrawNode` #5883

peppy Jul 3, 2023 •

edited

Loading