[RISC-V][x64] WiP: Passing empty struct fields #101796

tomeksowi · 2024-05-02T13:42:23Z

This PR should be viewed as proof of concept, it will be upstreamed in smaller chunks so it can be reviewed with some confidence.

RISC-V and LoongArch

Small structs containing one or two fields, at least one of them floating-point, can be passed (or returned) according to hardware FP calling convention: each field occupying one register. The existing implementation worked only for the narrow case when the fields were naturally aligned. However, the ABIs on both platforms when enregistering fields disregard placement hints such as manual alignment, packing attributes, or padding with empty structs (when they are sized 1 byte like in C++ or .NET). This means additional information on field offsets and sizes needs to be passed wherever registers<->memory copying of such structs happens.

RISC-V only: Unlike LoongArch's, RISC-V's ABI does not bound the size of such structs to 16 bytes. This means, among other things, that we can no longer rule out struct's eligibility for passing according to hardware FP calling convention by simply checking size > 16, which is assumed in many places.

System V x86-64

The current implementation barred a struct containing empty struct fields from enregistration. This did not match the System V ABI which says "NO_CLASS This class is used as initializer in the algorithms. It will be used for padding and empty structures and unions". It also does not match the behavior of GCC & Clang on Linux.

Part of #84834, cc @dotnet/samsung

dotnet-policy-service · 2024-05-02T13:42:49Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

The current implementation barred a struct containing empty struct fields from enregistration. This did not match the [System V ABI](https://refspecs.linuxbase.org/elf/x86_64-abi-0.99.pdf) which says "NO_CLASS This class is used as initializer in the algorithms. It will be used for padding and **empty structures** and unions". It also does not match the behavior of GCC & Clang on Linux.

…s on ARM 32

tomeksowi · 2024-05-09T13:53:44Z

src/tests/JIT/Directed/StructABI/StructABI.cs

+[StructLayout(LayoutKind.Explicit, Pack=1)]
+struct ExplicitFloatLong
+{
+	[FieldOffset(1)] float FieldF;
+	[FieldOffset(5)] long FieldL;
+


For the record, I don't think Pack should be necessary. From what I dug in the codebase, it's a known problem:

runtime/src/coreclr/vm/jitinterface.cpp

Lines 2102 to 2112 in ce2364c

if (result < 8 && pMT->RequiresAlign8())

{

// If the structure contains 64-bit primitive fields and the platform requires 8-byte alignment for

// such fields then make sure we return at least 8-byte alignment. Note that it's technically possible

// to create unmanaged APIs that take unaligned structures containing such fields and this

// unconditional alignment bump would cause us to get the calling convention wrong on platforms such

// as ARM. If we see such cases in the future we'd need to add another control (such as an alignment

// property for the StructLayout attribute or a marshaling directive attribute for p/invoke arguments)

// that allows more precise control. For now we'll go with the likely scenario.

result = 8;

}

IMHO the fix would be something like:

In CalculateSizeAndFieldOffsets amend calculating alignmentRequirement to:
GCD(min(alignmentRequirement, packingSize), placementInfo->m_offset)

Amend this condition to take explicit offset into consideration:

runtime/src/coreclr/vm/methodtablebuilder.cpp

Lines 4532 to 4540 in ce2364c

// For types with layout we drop any 64-bit alignment requirement if the packing size was less than 8

// bytes (this mimics what the native compiler does and ensures we match up calling conventions during

// interop).

// We don't do this for types that are marked as sequential but end up with auto-layout due to containing pointers,

// as auto-layout ignores any Pack directives.

if (HasLayout() && (HasExplicitFieldOffsetLayout() || IsManagedSequential()) && GetLayoutInfo()->GetPackingSize() < 8)

{

fFieldRequiresAlign8 = false;

}

But since this doesn't have much to do with empty structs, I'll leave it for now to keep this PR focused.

…alling convention, in ArgIterator::GetNextOffset and RiscV64Classifier

…n buffer; always calculate GetRiscV64PassStructInRegisterFlags in ComputeReturnFlags\(\)

…. We still need to look at GetRiscV64PassStructInRegisterFlags to rule out passing in registers according to hw FP call conv

…e native version for VM

…atment)

…o don't calculate GetRiscV64PassStructInRegisterFlags if struct fits in 16 bytes because we don't care whether it's passed in registers according to integer or hardware floating-point calling convention

…get(Arg|Return)TypeForStruct

Rework CallDescrWorkerInternal to do simple register saving into CallDescrWorker::returnValue rather than try to reconstruct the struct there. Reconstruction respecting the actual field layout is handled then by CopyReturnedFpStructFromRegisters. This fixes most reflection call tests in JIT/Directed/StructABI/StructABI.

…an argument is passed by ref by looking at m_hasArgLocDescForStructInRegs in ArgIteratorTemplate::IsArgPassedByRef()

…ntion to final destination in MethodDescCallSite::CallTargetWorker

jakobbotsch · 2024-06-18T13:44:31Z

src/coreclr/jit/codegencommon.cpp

+#elif defined(TARGET_RISCV64) || defined(TARGET_LOONGARCH64)
+            // On RISC-V/LoongArch struct { struct{} e1,e2,e3; byte b; float f; } is passed in 2 registers so the
+            // load/store instruction for 'b' needs to be exact in size or it will overlap 'f'.
+            return seg.GetRegisterType();


Can you share some information about the case this fixes? Given that promoted struct fields are normalize-on-load, this special case should not be necessary unless there is a bug elsewhere in the RISCV64/LA64 backends.

It was for cases like this:

runtime/src/tests/JIT/Directed/StructABI/StructABI.cs

Lines 691 to 707 in 7551d35

public struct EmptyFloatEmpty5Byte

{

public Empty e;

public float FieldF;

public Empty e0, e1, e2, e3, e4;

public sbyte FieldB;

public static EmptyFloatEmpty5Byte Get()

{

return new EmptyFloatEmpty5Byte { FieldF = 3.14159f, FieldB = -123 };

}

public bool Equals(EmptyFloatEmpty5Byte other)

{

return FieldF.Equals(other.FieldF) && FieldB == other.FieldB;

}

}

Homing the argument in the EmptyFloatEmpty5Byte:Equals prolog trashed the this pointer:

IN0016: 000000 addi sp, sp, -48 IN0017: 000004 sd fp, 32(sp) IN0018: 000008 sd ra, 40(sp) IN0019: 00000C addi fp, sp, 32 IN001a: 000010 sd a0, -8(fp) // store 'this' IN001b: 000014 fsw f10, -20(fp) IN001c: 000018 sw a1, -11(fp) // stomp on 'this'

I saw native compilers home arguments with appropriately-sized stores at original struct offsets so I fixed it the same way.

Full JitDump of EmptyFloatEmpty5Byte:Equals if interested

Got it, I think the fix makes sense. The comment doesn't seem fully accurate then; the real problem seems to be that since the enregistered layout does not match the memory layout of the struct, rounding up the size would potentially extend outside the stack slot.

Right, I'll reword the comment.

Oh, and please view this PR as a proof of concept. I need to upstream it in smaller chunks so it can be reviewed with some confidence.

…ckType

…lar to eeGetSystemVAmd64PassStructInRegisterDescriptor

jakobbotsch · 2024-06-25T08:21:44Z

src/coreclr/vm/methodtable.cpp

+static FpStructInRegistersInfo GetRiscV64PassFpStructInRegistersInfoImpl(TypeHandle th)
+{
+    FpStructInRegistersInfo info = {};
+    int nFields = 0;
+    if (!FlattenFields(th, 0, info, nFields DEBUG_ARG(0)))
+        return FpStructInRegistersInfo{};
+
+    using namespace FpStruct;
+    if ((info.flags & (FloatInt | IntFloat)) == 0)
+    {
+        LOG((LF_JIT, LL_EVERYTHING, "FpStructInRegistersInfo: struct %s (%u bytes) has no floating fields\n",
+            (!th.IsTypeDesc() ? th.AsMethodTable() : th.AsNativeValueType())->GetDebugClassName(), th.GetSize()));
+        return FpStructInRegistersInfo{};
+    }


As I alluded to in some other PRs we have the notion of significant padding where it can essentially be considered that all the padding of a struct is covered by fields. If you look at the getTypeLayout implementation you can see how it is computed. The JIT takes care to preserve values in bytes not covered by fields in those cases, and the ABI classification will need to do the same.

For example, a structure declaration like

private unsafe struct S { public fixed byte Foo[15]; public float Bar; }

results in underlying metadata that looks like

private unsafe struct S { public FooStruct Foo; public float Bar; } [StructLayout(LayoutKind.Sequential, Size = 15)] private struct FooStruct { public byte FixedElementField; }

I'm curious how this code ends up classifying a struct like this one for passing.

It seems like RISC-V/LA64 are going to end up with some potentially surprising user behavior here because of these ABI differences when padding is involved, and because it is somewhat ambiguous to the VM/JIT what is (ignorable) padding (or at least not totally obvious to the user).

Here's a log from a similar case in JIT/Directed/StructABI/StructABI/StructABI.cs:

TID 3f2277: FpStructInRegistersInfo: flattening InlineArray1 (managed, 1 fields) TID 3f2277: FpStructInRegistersInfo: flattening <Array>e__FixedBuffer (managed, 1 fields) TID 3f2277: FpStructInRegistersInfo: * found field FixedElementField [0..1), type: Byte TID 3f2277: FpStructInRegistersInfo: * array has too many elements: 16

So it stops the field flattening because there's too many fields (max 2 fields to get passed according to FP calling convention) and returns an empty FpStructInRegistersInfo which means pass according to integer calling convention where there is no notion of "fields", structs are a lump of bits laid out in registers as in memory. But it this case struct S is bigger than 16 bytes so it's passed by implicit ref.

I see now the handling for HasImpliedRepeatedField; that might have to be generalized somewhat. What about the following example?

[StructLayout(LayoutKind.Explicit, Size = 20)] struct S { [FieldOffset(0)] public byte FirstByteOfArray; [FieldOffset(16)] public float FloatField; }

Maybe SysV classification needs some generalization too (I can see it uses HasImpliedRepeatedFields too). Or perhaps it is the significant padding computation in getTypeLayout that is overly conservative.

I think as expected:

TID 3f849f: FpStructInRegistersInfo: flattening S (managed, 2 fields) TID 3f849f: FpStructInRegistersInfo: * found field FirstByteOfArray [0..1), type: Byte TID 3f849f: FpStructInRegistersInfo: * found field FloatField [12..16), type: Single TID 3f849f: FpStructInRegistersInfo: struct S (16 bytes) can be passed with floating-point calling convention, flags=0x88; IntFloat, sizes={1, 4}, offsets={0, 12}, IntFieldKindMask=Integer

Note: I downsized to 16 bytes as I'm on #103945 branch where the condition has not been relaxed for RISC-V. But I think it answers your question (padding via explicit layout).

There are problems with handling fixed buffers as the comment in HasImpliedRepeatedField implies. But as far as the classification for RISC-V goes I think it's ok.

getTypeLayout considers the above to be equivalent to a struct struct S {public fixed byte Array[16]; public float FloatField; }. So in that view I don't think the RISC-V classification is ok; IIUC it will silently drop parts of the struct that may contain user data when it gets passed as an argument.

The ABI classification here should match what getTypeLayout decides, one way or the other. I'm not sure if it is possible for us to change what it considers significant padding, since that has been in the JIT for a long time now.

How would the user write data to e.g. FirstByteOfArray[5] with standard language features?

Using unsafe code. I am not sure of the historical details of how things evolved this way, but I would guess C++/CLI is a large part of it.

Native compilers in general don't consider padding to be preserved while passing, definitely not the RISC-V ABI. That's the whole point of passing a struct according to FP calling convention, as two fields each in one register.

The difference in .NET is simply what we consider to be the discardable padding. We also have discardable padding, like the last 4 bytes of Span<T>. But ExplicitLayout/explicit size automatically promote the padding of the struct to be considered as "must be preserved".
I believe most of your example test cases have discardable padding since they do not have ExplicitLayout/explicit size, so there I think the ABI classification done here makes sense. But for the example given above things are different, where the ABI classification should essentially be done as if the padding was replaced by explicit char arrays of the right size.

BTW what would be the .NET equivalent for this C/C++ struct?

struct { char i; alignas(16) float f; };

I am not sure we have an equivalent. I think there are both .NET structs (like the ones with ExplicitLayout/Size) that are not representable in C/C++; and C/C++ structs (like your example) that are not representable in .NET metadata. @AaronRobinsonMSFT, @jkoritzinsky or @jkotas should know more about the interop story here...

That's news to me:/ I don't see getTypeLayout used for parameter classification neither on System V nor on RISC-V/LoongArch.

I mention getTypeLayout because it has the current source of truth of when we consider padding in structs to be significant/required to be preserved. I do not know if the rules in there could be relaxed; just that changing the rules would be breaking, and that users are potentially relying on the values in padding of such structures to be preserved. So in that sense it becomes a problem if the ABI does not match with these rules.

I believe all of our existing ABIs come with rules that will preserve this padding, and thus we have not needed to pay any special attention to it before. I could be wrong about this on SysV, in which case I would consider it a bug.

Ultimately I think getTypeLayout and the ABI classification need to agree on what padding in a struct is significant and needs to be preserved, but I do not know whether it would be possible to relax the rules in getTypeLayout or not.

Are there any rules defined for what padding is considered significant or just whatever is in the code for getTypeLayout? Becuase treating padding as an array does influence the classification for passing so the user would need to be aware of it. The remarks in StructLayout documentation say it's for controlling the layout to pass a type to unmanaged code, i.e. match the layout of the unmanaged type, and it doesn't mention any of it.

I don't think we explicitly document this part of the rules anywhere (and as you can see in #71711, the semantics were not clear even to ourselves for a long time). I wouldn't be surprised if CUSTOMLAYOUT came to exist quite organically when some issue was noticed where the JIT discarded padding that was expected to be preserved.

I believe most of your example test cases have discardable padding since they do not have ExplicitLayout/explicit size, so there I think the ABI classification done here makes sense. But for the example given above things are different, where the ABI classification should essentially be done as if the padding was replaced by explicit char arrays of the right size.

Right, the bulk of this PR is about empty struct fields, I'll leave out the ExplicitLayout test cases until we hammer out what to do with it, then I'll address it in a dedicated PR.

At any rate, the condition for which padding is preservable needs to be communicated loud and clear because it may change how the argument is passed. I think the best course of action would be to specify something like "When <condition for significant padding>, the field with its padding until the next field is treated as a fixed array of type same as the field" and then let the platform ABI decide how to pass a struct with that additional array so we're not inventing a custom ABI.

I think there are both .NET structs (like the ones with ExplicitLayout/Size) that are not representable in C/C++; and C/C++ structs (like your example) that are not representable in .NET metadata.

Yes, that sounds about right.

I think there are both .NET structs (like the ones with ExplicitLayout/Size) that are not representable in C/C++; and C/C++ structs (like your example) that are not representable in .NET metadata.

Agree.

dotnet-policy-service · 2024-07-25T14:26:42Z

Draft Pull Request was automatically closed for 30 days of inactivity. Please let us know if you'd like to reopen it.

First, add a bunch of failing tests

c2c2bba

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label May 2, 2024

dotnet-policy-service bot added the community-contribution Indicates that the PR has been added by a community member label May 2, 2024

clamp03 assigned tomeksowi May 3, 2024

clamp03 added the arch-riscv Related to the RISC-V architecture label May 3, 2024

tomeksowi added 4 commits May 6, 2024 09:07

Reduce empty megabyte field to 32k as msvc caps size of arguments at 64k

03d4b23

Don't stop calculating flags if struct size > 16 bytes

4403c4b

Quell C-linkage warnings just in case

60ad834

tomeksowi changed the title ~~[RISC-V] WiP: Passing empty struct fields~~ [RISC-V][x64] WiP: Passing empty struct fields May 8, 2024

This was referenced May 8, 2024

Test failure in System.Numerics.Tensors.Tests.SingleGenericTensorPrimitives.SpanDestinationFunctions_SpecialValues #101731

Closed

arm32 fails in CI with "/lib/arm-linux-gnueabihf/libc.so.6: version `GLIBC_2.34' not found" #102030

Closed

tomeksowi mentioned this pull request May 9, 2024

[LoongArch64] Simplify flags for passing struct in registers. #102041

Merged

Add Pack=1 to bypass a known problem with field alignment requirement…

9064784

…s on ARM 32

tomeksowi commented May 9, 2024

View reviewed changes

tomeksowi added 9 commits May 10, 2024 09:02

Merge branch 'main' into empty-struct-passing

002c4b2

Check size > 16 only when passing parameters according to integer c…

b02c913

…alling convention, in ArgIterator::GetNextOffset and RiscV64Classifier

Don't assume struct size > 16 means return by implicit ref to a retur…

0fde22d

…n buffer; always calculate GetRiscV64PassStructInRegisterFlags in ComputeReturnFlags\(\)

Don't assume if struct size > 16, IsArgPassedByRef should return true…

f89a68d

…. We still need to look at GetRiscV64PassStructInRegisterFlags to rule out passing in registers according to hw FP call conv

Adjust ArgIterator.GetNextOffset() for crossgen2 to be the same as th…

f5e3930

…e native version for VM

Adjust crossgen2 version of ComputeReturnFlags (ComputeReturnValueTre…

0a5ce05

…atment)

Adjust IsArgPassedByRef for crossgen2 with native version for VM. Als…

6a3118b

…o don't calculate GetRiscV64PassStructInRegisterFlags if struct fits in 16 bytes because we don't care whether it's passed in registers according to integer or hardware floating-point calling convention

Don't assume struct (size > 16) means pass by reference in Compiler::…

0a1a7ff

…get(Arg|Return)TypeForStruct

Add a test for a single-float struct padded with empty struct field

9aa2d10

tomeksowi added 5 commits June 12, 2024 11:32

Fix the rest of the reflection tests by properly determining whether …

c7e8a11

…an argument is passed by ref by looking at m_hasArgLocDescForStructInRegs in ArgIteratorTemplate::IsArgPassedByRef()

Update crossgen2 C# version of ArgIterator to changes on the native side

16b549f

Fix copying structs returned by hardware floating-point calling conve…

3389bcf

…ntion to final destination in MethodDescCallSite::CallTargetWorker

Merge branch 'main' into empty-struct-passing

d4f81be

build-analysis bot mentioned this pull request Jun 14, 2024

Checkout failure: "Git fetch failed with exit code 128" dotnet/arcade#9009

Open

2 tasks

Merge branch 'main' into empty-struct-passing

7551d35

jakobbotsch reviewed Jun 18, 2024

View reviewed changes

build-analysis bot mentioned this pull request Jun 18, 2024

GC/Regressions/v2.0-beta2/452950 failed in CI #103494

Closed

tomeksowi added 8 commits June 21, 2024 09:05

Add signedness to integer field FpStructPassInRegistersInfo

4d2e2cb

Better flag names

907ac95

Improve explanation why RISC-V can't use genActualType in genParamSta…

892ec22

…ckType

Take signedness into account in CopyStructToRegisters

e73450e

Remove IsSize(1st|2nd)8 because they weren't used much

717cd59

Improve getter names in FpStructInRegistersInfo

64de193

Add comment to fix JIT to AssignClassifiedEightByteTypes

d2d1841

Add helpers for IntKind and flag names to FpStructInRegistersInfo

47f9d87

tomeksowi mentioned this pull request Jun 21, 2024

[x64][SysV] Classify empty structs for passing like padding #103799

Merged

tomeksowi added 4 commits June 21, 2024 10:34

Merge branch 'main' into empty-struct-passing

09c0b38

Fix C# build: Enum values should be on separate lines

8d6a102

Merge branch 'main' into empty-struct-passing

5ebd2af

Make a logging wrapper Compiler::GetPassFpStructInRegistersInfo, simi…

7036dc0

…lar to eeGetSystemVAmd64PassStructInRegisterDescriptor

tomeksowi mentioned this pull request Jun 25, 2024

[RISC-V][LoongArch64] New passing info for floating-point structs #103945

Merged

jakobbotsch reviewed Jun 25, 2024

View reviewed changes

tomeksowi mentioned this pull request Jul 1, 2024

[RISC-V][LoongArch64] JIT: pass structs according to floating-point calling convention properly #104237

Merged

dotnet-policy-service bot closed this Jul 25, 2024

tomeksowi mentioned this pull request Aug 1, 2024

[RISC-V][LoongArch64] Pass FP struct fields at arbitrary offsets in ArgIterator and CallDescrWorker #105800

Merged

tomeksowi mentioned this pull request Aug 12, 2024

[RISC-V][LoongArch64] Pass structs containing empty struct arrays according to integer calling convention #106266

Merged

github-actions bot locked and limited conversation to collaborators Aug 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RISC-V][x64] WiP: Passing empty struct fields #101796

[RISC-V][x64] WiP: Passing empty struct fields #101796

tomeksowi commented May 2, 2024 •

edited

Loading

dotnet-policy-service bot commented May 2, 2024

tomeksowi May 9, 2024

jakobbotsch Jun 18, 2024

tomeksowi Jun 19, 2024

jakobbotsch Jun 19, 2024

tomeksowi Jun 19, 2024

jakobbotsch Jun 25, 2024

tomeksowi Jun 25, 2024

jakobbotsch Jun 25, 2024

tomeksowi Jun 25, 2024 •

edited

Loading

jakobbotsch Jun 25, 2024

jakobbotsch Jun 25, 2024

jakobbotsch Jun 25, 2024

tomeksowi Jun 25, 2024 •

edited

Loading

jkotas Jun 25, 2024

AaronRobinsonMSFT Jun 25, 2024

dotnet-policy-service bot commented Jul 25, 2024

	if (result < 8 && pMT->RequiresAlign8())
	{
	// If the structure contains 64-bit primitive fields and the platform requires 8-byte alignment for
	// such fields then make sure we return at least 8-byte alignment. Note that it's technically possible
	// to create unmanaged APIs that take unaligned structures containing such fields and this
	// unconditional alignment bump would cause us to get the calling convention wrong on platforms such
	// as ARM. If we see such cases in the future we'd need to add another control (such as an alignment
	// property for the StructLayout attribute or a marshaling directive attribute for p/invoke arguments)
	// that allows more precise control. For now we'll go with the likely scenario.
	result = 8;
	}

	// For types with layout we drop any 64-bit alignment requirement if the packing size was less than 8
	// bytes (this mimics what the native compiler does and ensures we match up calling conventions during
	// interop).
	// We don't do this for types that are marked as sequential but end up with auto-layout due to containing pointers,
	// as auto-layout ignores any Pack directives.
	if (HasLayout() && (HasExplicitFieldOffsetLayout() \|\| IsManagedSequential()) && GetLayoutInfo()->GetPackingSize() < 8)
	{
	fFieldRequiresAlign8 = false;
	}

	public struct EmptyFloatEmpty5Byte
	{
	public Empty e;
	public float FieldF;
	public Empty e0, e1, e2, e3, e4;
	public sbyte FieldB;

	public static EmptyFloatEmpty5Byte Get()
	{
	return new EmptyFloatEmpty5Byte { FieldF = 3.14159f, FieldB = -123 };
	}

	public bool Equals(EmptyFloatEmpty5Byte other)
	{
	return FieldF.Equals(other.FieldF) && FieldB == other.FieldB;
	}
	}

[RISC-V][x64] WiP: Passing empty struct fields #101796

[RISC-V][x64] WiP: Passing empty struct fields #101796

Conversation

tomeksowi commented May 2, 2024 • edited Loading

RISC-V and LoongArch

System V x86-64

dotnet-policy-service bot commented May 2, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tomeksowi Jun 25, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tomeksowi Jun 25, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dotnet-policy-service bot commented Jul 25, 2024

tomeksowi commented May 2, 2024 •

edited

Loading

tomeksowi Jun 25, 2024 •

edited

Loading

tomeksowi Jun 25, 2024 •

edited

Loading