[Perf -29%] PerfLabTests.CastingPerf (2) #37803

DrewScoggins · 2020-06-12T17:32:11Z

Run Information

Architecture	x64
OS	Windows 10.0.18362
Changes	diff

Regressions in PerfLabTests.CastingPerf

Benchmark	Baseline	Test	Test/Base	Modality	Baseline Outlier
CheckObjIsInterfaceNo	218.29 μs	280.64 μs	1.29		False
CheckIsInstAnyIsInterfaceNo	218.29 μs	280.65 μs	1.29		False

Historical Data in Reporting System

Repro

git clone https://github.com/dotnet/performance.git
py .\performance\scripts\benchmarks_ci.py -f netcoreapp5.0 --filter 'PerfLabTests.CastingPerf*';

Histogram

PerfLabTests.CastingPerf.CheckObjIsInterfaceNo

[217888.775 ; 232960.149) | @@@@@@@@@@@@
[232960.149 ; 252981.511) | @@@@@@@@@@@@@@@@@@@@@@@@@@@@
[252981.511 ; 270375.304) | 
[270375.304 ; 285446.677) | @@@@@@@@@@@@@@@@@@@

PerfLabTests.CastingPerf.CheckIsInstAnyIsInterfaceNo

[209794.928 ; 226755.346) | @@@@@@@@@@@@@
[226755.346 ; 237218.013) | 
[237218.013 ; 254170.131) | @@@@@@@@@@@@@@@@@@@@@@@@@
[254170.131 ; 269951.518) | 
[269951.518 ; 286903.637) | @@@@@@@@@@@@@@@@@@@@
[286903.637 ; 303354.976) | 
[303354.976 ; 320307.095) | @

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

The text was updated successfully, but these errors were encountered:

DrewScoggins · 2020-06-12T17:32:36Z

@adamsitnik @VSadov

VSadov · 2020-07-21T18:49:44Z

Could be the cost of tiered compiler indirection showing up on a fast cast. Native implementation did not have that expense.

The cost of tiering indirection could be easily avoided by opting out the casting helper from tiering. That would mean jitting it at start up time. We decided against that, since these are "fast" casts and generally not a perf concern.

I will take a look to be sure.

mangod9 · 2020-08-06T02:17:53Z

@VSadov the last time we chatted I believe you mentioned this perf was reasonable. Is there anything actionable here?

VSadov · 2020-08-06T21:51:16Z

@mangod9 Right. I just looked at this again to be sure. The code that we run here is the same as in the native implementation - these are simple cases that just compare the target interface against implemented interfaces (which in these benchmarks is a list of length 1).

The difference is that the managed implementation is called via an extra indirection due to tiering callsite, which is a trade off for the start up latency.
The only reason why we see the impact of an extra jmp is because the operation is extremely fast. The measurements are for loops that do 100000 casts.

I think we can accept the current performance. It is unlikely to cause issues in real world scenarios.

mangod9 · 2020-08-06T21:58:24Z

thanks for the follow up. Will close this out. @DrewScoggins is there anything required to track any further regressions in the future?

VSadov · 2020-08-06T22:02:47Z

BTW, I like that we have tests sensitive to this.

VSadov · 2020-08-06T22:03:50Z

Also, I think I can resolve #1994 , which was asking for impact of tiering on casting and it is exactly what we are looking at here.

Dotnet-GitSync-Bot added area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI untriaged New issue has not been triaged by the area owner labels Jun 12, 2020

DrewScoggins added the tenet-performance-benchmarks Issue from performance benchmark label Jun 12, 2020

jkotas added area-VM-coreclr and removed area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI labels Jun 12, 2020

mangod9 removed the untriaged New issue has not been triaged by the area owner label Jun 22, 2020

mangod9 added this to the 5.0.0 milestone Jun 22, 2020

DrewScoggins added tenet-performance Performance related issue tenet-performance-benchmarks Issue from performance benchmark os-windows arch-x64 and removed tenet-performance-benchmarks Issue from performance benchmark labels Jul 7, 2020

VSadov self-assigned this Jul 21, 2020

VSadov mentioned this issue Aug 6, 2020

Measure the impact of prejitting/tiering on casting helpers. #1994

Closed

mangod9 closed this as completed Aug 7, 2020

mangod9 mentioned this issue Aug 10, 2020

[Perf -25%] PerfLabTests.CastingPerf2.CastingPerf.ScalarValueTypeObj #39039

Closed

adamsitnik mentioned this issue Sep 4, 2020

.NET 5.0 Microbenchmarks Performance Study Report #41871

Closed

21 tasks

ghost locked as resolved and limited conversation to collaborators Dec 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Perf -29%] PerfLabTests.CastingPerf (2) #37803

[Perf -29%] PerfLabTests.CastingPerf (2) #37803

DrewScoggins commented Jun 12, 2020

Histogram

PerfLabTests.CastingPerf.CheckObjIsInterfaceNo

PerfLabTests.CastingPerf.CheckIsInstAnyIsInterfaceNo

Docs

DrewScoggins commented Jun 12, 2020

VSadov commented Jul 21, 2020 •

edited

Loading

mangod9 commented Aug 6, 2020

VSadov commented Aug 6, 2020

mangod9 commented Aug 6, 2020

VSadov commented Aug 6, 2020

VSadov commented Aug 6, 2020

[Perf -29%] PerfLabTests.CastingPerf (2) #37803

[Perf -29%] PerfLabTests.CastingPerf (2) #37803

Comments

DrewScoggins commented Jun 12, 2020

Run Information

Regressions in PerfLabTests.CastingPerf

Repro

Histogram

PerfLabTests.CastingPerf.CheckObjIsInterfaceNo

PerfLabTests.CastingPerf.CheckIsInstAnyIsInterfaceNo

Docs

DrewScoggins commented Jun 12, 2020

VSadov commented Jul 21, 2020 • edited Loading

mangod9 commented Aug 6, 2020

VSadov commented Aug 6, 2020

mangod9 commented Aug 6, 2020

VSadov commented Aug 6, 2020

VSadov commented Aug 6, 2020

VSadov commented Jul 21, 2020 •

edited

Loading