Cached interface dispatch for coreclr #111771

davidwrighton · 2025-01-24T00:18:20Z

Enabling cached interface dispatch as an options for CoreCLR (should reduce memory usage/remove RWX pages, at the cost of reducing performance)

Current implementation is only enabled in release builds for Apple platforms with restrictions on code generation
On Debug/Checked builds of X64/Arm64 platforms it is possible to enable the feature by setting the DOTNET_UseCachedInterfaceDispatch environment variable to 1. (NOTE: Enabling this feature requires running on a processor which supports 128 bit compare and swap, which has implications on Linux X64 builds, and would have implications for Loongarch/RiscV if we enable the code there.)
The strategy is to re-use the existing VirtualCallStubManager infrastructure for all non-code-generation driven lookups, but to replace the stub generation logic with the CachedInterfaceDispatch paths from NativeAOT.
In addition, to support this, we need to extend the size of a Dispatch cell embedded in R2R images, so various parts of that logic are now capable of generating double pointer aligned dispatch cells when commanded. Infrastructure to set the right behavior for targetting apple platforms has not yet been implemented although the general purpose support is in place.

Known issues addressed before making a non-draft PR

… on shared things, and parts that are not shared

…ally in place

…an be switched between

… not yet supported

…rface dispatch

…ce dispatch or virtual stub dispatch

dotnet-policy-service · 2025-01-24T00:19:23Z

Tagging subscribers to this area: @mangod9
See info in area-owners.md if you want to be subscribed.

…te that this requires adding the -mcx16 switch to clang, so that cmpxchg16b instruction gets generated, which is an increase in the baseline CPU required by CoreCLR on Linux, and isn't likely to be OK for shipping publicly

…veAOT cached interface dispatch implementation (as it isn't actually used) Update IsIPinVirtualStub to check the AVLocations, not the stub entry points

…e_dispatch_for_coreclr

…hook up the VTable offset logic and such (vtable paths are untested)

- Enable generating double pointer indirection cells in R2R files using command line switch. - Fix VTableOffset calculation - Add logic in ExternalMethodFixupWorker to handle the double pointer indirection cells.

src/coreclr/vm/prestub.cpp

src/coreclr/vm/virtualcallstub.cpp

src/coreclr/vm/prestub.cpp

src/coreclr/vm/virtualcallstub.cpp

kg · 2025-03-01T00:53:22Z

src/coreclr/vm/virtualcallstub.cpp

+    }
+
+    MethodDesc *pTargetMD = COMDelegate::GetMethodDescForOpenVirtualDelegate(delegateObj);
+    pSDFrame->SetFunction(pTargetMD);


If pSDFrame is &frame why are we indirecting through the pointer instead of using frame directly?

This is due to historical reasons. Until recently (3 weeks ago), the frame was FrameWithCookie<StubDispatchFrame> and not just StubDispatchFrame. So the StubDispatchFrame * pSDFrame = &frame was used so that on every access we don't need to do casting. It isn't needed anymore.

src/coreclr/vm/virtualcallstub.h

src/tests/Loader/CollectibleAssemblies/Statics/CollectibleTLSStaticCollection.cs

…interface_dispatch_for_coreclr

am11 · 2025-03-05T22:53:29Z

@davidwrighton is it supposed to exclude riscv64 and loongarch64? Test build has started to break:

ld.lld : error : undefined symbol: RhpVTableOffsetDispatchAVLocation [/runtime/src/tests/nativeaot/GenerateUnmanagedEntryPoints/GenerateUnmanagedEntryPoints.csproj] [/runtime/src/tests/build.proj]

MichalStrehovsky · 2025-03-05T23:02:21Z

@davidwrighton is it supposed to exclude riscv64 and loongarch64? Test build has started to break:
ld.lld : error : undefined symbol: RhpVTableOffsetDispatchAVLocation [/runtime/src/tests/nativeaot/GenerateUnmanagedEntryPoints/GenerateUnmanagedEntryPoints.csproj] [/runtime/src/tests/build.proj]

I think I have a fix in #113179

davidwrighton added 10 commits January 8, 2025 15:47

Move the cached interface dispatch code into a shared region

2b2ca52

Split cached interface dispatch up into a component which is focussed…

4892674

… on shared things, and parts that are not shared

It builds for X64, VTable stuff isn't probably correct, but its basic…

d69528f

…ally in place

Add indirection cell helper so that VSD and CachedInterfaceDispatch c…

5f1f2b5

…an be switched between

Ready to try running things. R2R not yet supported. Virtual delegates…

976bf83

… not yet supported

Initialize CachedInterfaceDispatch at startup

652930c

AMD64 seems to work

39a2574

Arm64 Windows assembly written and factored amd64 to be similar

4c0865c

Allow there to be flavors of the build which do not build cached inte…

645c487

…rface dispatch

Make it possible for some OS/Architecture sets to have cached interfa…

921631a

…ce dispatch or virtual stub dispatch

dotnet-issue-labeler bot added the area-VM-coreclr label Jan 24, 2025

dotnet-policy-service bot assigned davidwrighton Jan 24, 2025

build-analysis bot mentioned this pull request Jan 24, 2025

The Operation will be canceled. The next steps may not contain expected logs. dotnet/dnceng#3008

Open

3 tasks

davidwrighton closed this Jan 24, 2025

davidwrighton added 5 commits January 24, 2025 09:46

Fix X86 build

73b0b26

Get Linux Arm64 and Amd64 into a possibly good state

2cdd955

Enable building cached interface dispatch for Linux arm64

df393d9

Add AVLocation for the VTable helper which wasn't present in the Nati…

cce3bcb

…veAOT cached interface dispatch implementation (as it isn't actually used) Update IsIPinVirtualStub to check the AVLocations, not the stub entry points

davidwrighton reopened this Jan 24, 2025

davidwrighton added 7 commits January 24, 2025 23:50

Merge branch 'main' of github.com:dotnet/runtime into cached_interfac…

4bbcdaa

…e_dispatch_for_coreclr

Fix musl build failure

c320e1d

Handle missed RhpVTableOffsetDispatchAVLocation case

361588a

Move RiscV stub dispatch logic to the same place as everything else

24e78b2

Fix assertion issue with collectible assemblies

5b0e5ac

Reduce InterfaceDispatchCell size from 4 pointers to 2, and actually …

fa7826a

…hook up the VTable offset logic and such (vtable paths are untested)

Use the isCachedInterfaceDispatchStubAVLocation helper where appropriate

f1c2c65

build-analysis bot mentioned this pull request Jan 28, 2025

slow macOS - "##[error]The job running on agent Azure Pipelines 9 ran longer than the maximum time of 60 minutes." dotnet/dnceng#1883

Open

3 tasks

Enable using cached interface dispatch in R2R

36c9cc0

- Enable generating double pointer indirection cells in R2R files using command line switch. - Fix VTableOffset calculation - Add logic in ExternalMethodFixupWorker to handle the double pointer indirection cells.