Enforce scatter/gather file I/O Windows API requirements et. al. #57424

teo-tsirpanis · 2021-08-15T00:07:33Z

The Windows APIs that perform scatter/gather file I/O have more usage restrictions than their Unix counterparts. Besides requiring the file handle to be opened with buffering disabled, they also require each buffer segment to be aligned at page size boundaries, and will read/write one page from/to each segment.

Until now, the gather/scatter RandomAccess API Windows implementation only enforces the first requirement, allowing buffers containing segments without the proper alignment or length to be passed to the APIs, leading to potential undefined behavior like buffer overflows or underflows. For example, code around

runtime/src/libraries/System.Private.CoreLib/src/System/IO/RandomAccess.Windows.cs

Line 468 in 0ebc7ec

MemoryHandle memoryHandle = buffer.Pin();

which was pinning the segments did not check the Memory<byte>.Length property at all, which is what raised my suspicions and led us here.

This PR refactors it to additionally check whether each buffer segment is aligned and is exactly as long as a page. The previous check that the total buffer size fits in 32 bits is consolidated with the other two, in a common function for both scatter and gather, that also got the responsibility to pin the segments while checking their alignment (to prevent the GC from potentially moving them around in between). If these requirements are found to be violated, the buffers are unpinned and the operation will be performed with multiple read/write syscalls (like before when the file handle was opened synchronously or with buffering enabled).

Furthermore the RandomAccess APIs on Windows now special-case buffers with only one or zero (new in this PR) segments earlier in the call chain, and there were some other smaller changes.

… method.

And use pinned GCHandles and IntPtrs instead of MemoryHandles when passing the segment array to the bottom-most method.

ghost · 2021-08-15T00:07:39Z

Tagging subscribers to this area: @dotnet/area-system-io
See info in area-owners.md if you want to be subscribed.

Issue Details

The Windows APIs that perform scatter/gather file I/O have more usage restrictions than their Unix counterparts. Besides requiring the file handle to be opened with buffering disabled, they also require each buffer segment to be aligned at page size boundaries, and will read/write one page from/to each segment.

Until now, the gather/scatter RandomAccess API Windows implementation only enforces the first requirement, allowing buffers containing segments without the proper alignment or length to be passed to the APIs, leading to potential undefined behavior like buffer overflows or underflows. For example, code around

runtime/src/libraries/System.Private.CoreLib/src/System/IO/RandomAccess.Windows.cs

Line 468 in 0ebc7ec

MemoryHandle memoryHandle = buffer.Pin();

which was pinning the segments did not check the Memory<byte>.Length property at all, which is what raised my suspicions and led us here.

This PR refactors it to additionally check whether each buffer segment is aligned and is exactly as long as a page. The previous check that the total buffer size fits in 32 bits is consolidated with the other two, in a common function for both scatter and gather, that also got the responsibility to pin the segments while checking their alignment (to prevent the GC from potentially moving them around in between). If these requirements are found to be violated, the buffers are unpinned and the operation will be performed with multiple read/write syscalls (like before when the file handle was opened synchronously or with buffering enabled).

Furthermore the RandomAccess APIs on Windows now special-case buffers with only one or zero (new in this PR) segments earlier in the call chain, and there were some other smaller changes.

Author:	teo-tsirpanis
Assignees:	-
Labels:	`area-System.IO`, `community-contribution`
Milestone:	-

src/libraries/System.Private.CoreLib/src/System/IO/RandomAccess.Windows.cs

…ddrOfPinnedObject.

src/libraries/System.Private.CoreLib/src/System/IO/RandomAccess.Windows.cs

…rGatherBuffers.

…DOs.

adamsitnik

Please provide a single unit test, where you create a buffer that does not meet ReadFileScatter|WriteFileGather size and alignment requirements, but meets ReadFile|WriteFile size and alignment requirements for file handles opened with NO_BUFFERING.

teo-tsirpanis · 2021-08-18T16:51:01Z

I added the test @adamsitnik.

I also noticed in lines like https://github.com/dotnet/runtime/blob/bae5e4ec0ebbc93542779800a2c5a64e0c8dc620/src/libraries/System.IO.FileSystem/tests/RandomAccess/NoBuffering.Windows.cs#L147 that we always open the handle with FileOptions.Asynchronous, even when we test synchronous I/O. Is that intentional or should I fix it as well?

src/libraries/System.IO.FileSystem/tests/RandomAccess/NoBuffering.Windows.cs

src/libraries/System.Private.CoreLib/src/System/IO/RandomAccess.Windows.cs

adamsitnik · 2021-08-19T07:53:13Z

src/libraries/System.Private.CoreLib/src/System/IO/RandomAccess.Windows.cs

-
-            MemoryHandle[] memoryHandles = new MemoryHandle[buffersCount];
-            MemoryHandle pinnedSegments = fileSegments.AsMemory().Pin();
+            long* segmentsArray = (long*) NativeMemory.Alloc((nuint)(buffersCount + 1), sizeof(long));


this change needs to be benchmarked as I would expect managed allocator to be faster than native for small arrays.

It was @jkotas' idea. Allocating a managed array has less overhead, but it also has to be pinned for the duration of the async operation.

@teo-tsirpanis could you please benchmark your changes? The easiest way would be to contribute some new RandomAccess type benchmarks to dotnet/performance repo similar to FileStream benchmarks and run them using two CoreRun.exe: with and without your changes https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md#dotnet-runtime-prerequisites

dotnet run -c Release -f net6.0 --filter *RandomAccess* --corerun $pathToCoreRunWithoutYourChanges $pathToCoreRunWithYourChanges

I tried but my machine ran out of space. 😕

I tried but my machine ran out of space

@teo-tsirpanis could you please share the benchmark source code? I am would be very happy to help

@teo-tsirpanis I've implemented some benchmarks (dotnet/performance#1967) and run them against your fork.

This is what is going to be required to run them once dotnet/performance#1967 is merged (I am sharing that so you can run some benchmarks in the future):

git clone https://github.com/teo-tsirpanis/dotnet-runtime.git runtime cd .\runtime\ .\build.cmd -c Release -subset clr+libs robocopy .\artifacts\bin\testhost\net6.0-windows-Release-x64\shared\Microsoft.NETCore.App\7.0.0\ .\artifacts\bin\testhost\net6.0-windows-Release-x64\shared\Microsoft.NETCore.App\before /E git checkout windows-vectored-io-refactor taskkill /IM "dotnet.exe" /F .\build.cmd -c Release -subset clr.corelib+clr.nativecorelib+libs.PreTest robocopy .\artifacts\bin\testhost\net6.0-windows-Release-x64\shared\Microsoft.NETCore.App\7.0.0\ .\artifacts\bin\testhost\net6.0-windows-Release-x64\shared\Microsoft.NETCore.App\after /E cd .. git clone https://github.com/dotnet/performance.git performance py .\performance\scripts\benchmarks_ci.py -f net6.0 --filter *Perf_RandomAccess_NoBuffering* --corerun .\runtime\artifacts\bin\testhost\net6.0-windows-Release-x64\shared\Microsoft.NETCore.App\before\corerun.exe .\runtime\artifacts\bin\testhost\net6.0-windows-Release-x64\shared\Microsoft.NETCore.App\after\corerun.exe

src/libraries/System.Private.CoreLib/src/System/IO/RandomAccess.Windows.cs

src/libraries/System.IO.FileSystem/tests/RandomAccess/NoBuffering.Windows.cs

adamsitnik · 2021-08-19T08:10:48Z

that we always open the handle with FileOptions.Asynchronous, even when we test synchronous I/O. Is that intentional or should I fix it as well?

In this particular file where we test NO_BUFFERING support it's expected, as it's ReadFileScatter requirement:

The file handle must be created with the GENERIC_READ right, and the FILE_FLAG_OVERLAPPED and FILE_FLAG_NO_BUFFERING flags. For more information, see File Security and Access Rights.

FILE_FLAG_OVERLAPPED == FileOptions.Asynchronous (this is the tricky part)

But we should most probably extend the tests and cover all possible scenarios:

sync operations on async handle
async operations on async handle
sync operations on sync handle
async operations on sync handle

The test has been added

adamsitnik · 2021-08-19T08:41:03Z

But we should most probably extend the tests and cover all possible scenarios:

@teo-tsirpanis I've sent #57717 to address that

adamsitnik

The code LGTM, the perf also. @teo-tsirpanis once again thank you!

adamsitnik · 2021-08-31T08:30:05Z

/backport to release/6.0

github-actions · 2021-08-31T08:30:20Z

Started backporting to release/6.0: https://github.com/dotnet/runtime/actions/runs/1185379030

src/libraries/System.Private.CoreLib/src/System/IO/RandomAccess.Windows.cs

src/libraries/System.IO.FileSystem/tests/RandomAccess/NoBuffering.Windows.cs

src/libraries/System.Private.CoreLib/src/System/IO/RandomAccess.Windows.cs

teo-tsirpanis added 3 commits August 15, 2021 00:58

Move checking and pinning Windows vectored I/O buffers to a dedicated…

d32cf8b

… method.

Refactor the scatter/gather APIs to use the common checking method.

2d8e82f

And use pinned GCHandles and IntPtrs instead of MemoryHandles when passing the segment array to the bottom-most method.

Shorten the name of the buffer-checking method.

26e5350

ghost added the community-contribution Indicates that the PR has been added by a community member label Aug 15, 2021

dotnet-issue-labeler bot added the area-System.IO label Aug 15, 2021

teo-tsirpanis commented Aug 15, 2021

View reviewed changes

src/libraries/System.Private.CoreLib/src/System/IO/RandomAccess.Windows.cs Outdated Show resolved Hide resolved

teo-tsirpanis commented Aug 15, 2021

View reviewed changes

src/libraries/System.Private.CoreLib/src/System/IO/RandomAccess.Windows.cs Outdated Show resolved Hide resolved

Directly get the pinned array's address instead of calling GCHandle.A…

d7b82ff

…ddrOfPinnedObject.

jkotas reviewed Aug 15, 2021

View reviewed changes

src/libraries/System.Private.CoreLib/src/System/IO/RandomAccess.Windows.cs Outdated Show resolved Hide resolved

jkotas reviewed Aug 15, 2021

View reviewed changes

src/libraries/System.Private.CoreLib/src/System/IO/RandomAccess.Windows.cs Outdated Show resolved Hide resolved

teo-tsirpanis mentioned this pull request Aug 15, 2021

Investigate which other properties of System.Environment can be cached. #57442

Open

Refactor the error handling logic in TryPrepareScatterGatherBuffers.

a160f1d

adamsitnik added the NO-MERGE The PR is not ready for merge yet (see discussion for detailed reasons) label Aug 15, 2021

teo-tsirpanis added 2 commits August 15, 2021 19:07

Allocate the segment array from native memory and at TryPrepareScatte…

656adf6

…rGatherBuffers.

Cache the page size on a static readonly field and add a couple of TO…

e1b94a9

…DOs.

teo-tsirpanis force-pushed the windows-vectored-io-refactor branch from 330449f to e1b94a9 Compare August 15, 2021 16:29

adamsitnik previously requested changes Aug 15, 2021

View reviewed changes

adamsitnik added this to the Future milestone Aug 15, 2021

jaredpar mentioned this pull request Aug 17, 2021

Test failure: System.Security.Cryptography.X509Certificates.Tests.CertificateCreation.CertificateRequestChainTests/CreateChain_Hybrid #25979

Closed

Make the memory handlers readonly structs.

92924d8

runfoapp bot mentioned this pull request Aug 18, 2021

Feed unreliability affecting CI #55449

Closed

adamsitnik reviewed Aug 19, 2021

View reviewed changes

adamsitnik removed the NO-MERGE The PR is not ready for merge yet (see discussion for detailed reasons) label Aug 19, 2021

adamsitnik mentioned this pull request Aug 19, 2021

extend the NO_BUFFERING tests and cover all scenarios #57717

Merged

Add a test.

b1d60b6

Reorder some methods with PR feedback taken into consideration.

749b6c7

teo-tsirpanis force-pushed the windows-vectored-io-refactor branch from bae5e4e to 749b6c7 Compare August 19, 2021 14:02

teo-tsirpanis and others added 3 commits August 19, 2021 18:13

Merge branch 'main' into windows-vectored-io-refactor

4b940a4

Stop special-casing scatter/gather operations with zero or one buffer.

5ed633d

Factor the cleaning-up of the segment buffers into a separate method.

7e75cc7

adamsitnik approved these changes Aug 31, 2021

View reviewed changes

adamsitnik merged commit 8750e9a into dotnet:main Aug 31, 2021

github-actions bot mentioned this pull request Aug 31, 2021

[release/6.0] Enforce scatter/gather file I/O Windows API requirements et. al. #58423

Merged

teo-tsirpanis deleted the windows-vectored-io-refactor branch August 31, 2021 08:32