Optimize BigInteger.ToString for large decimal string #104676

kzrnm · 2024-07-10T13:53:55Z

This PR is a counterpart to #55121. divide-and-conquer algorithm

Number.FormatBigInteger() can run in $D(n)log(N)$ time using the Divide and Conquer algorithm, where $D(n)$ represents the computational complexity of BigInteger division.

The computational complexity of division will be improved by #96895. Once 96895 is merged, I will add a benchmark and set the PR to "ready for review."

dotnet-policy-service · 2024-07-10T13:54:25Z

Tagging subscribers to this area: @dotnet/area-system-numerics
See info in area-owners.md if you want to be subscribed.

huoyaoyuan · 2024-07-10T14:46:29Z

src/libraries/System.Runtime.Numerics/src/System/Number.BigInteger.cs

+            // The Ratio is calculated as: log_{10^9}(2^32)
+            const double digitRatio = 1.0703288734719332;
+            Debug.Assert(BigInteger.MaxLength * digitRatio + 1 < Array.MaxLength); // won't overflow


If the length doesn't need to be exact, you can use integer estimation instead, similar to what I did in NumberToBigInteger.

Which part are you referring to? NumberToBigInteger also used $\log_{2^{32}}(10^9)$.

runtime/src/libraries/System.Runtime.Numerics/src/System/Number.BigInteger.cs

Lines 562 to 573 in 264e39c

// shrink buffer to the currently used portion.

// First, calculate the rough size of the buffer from the ratio that the number

// of digits follows. Then, shrink the size until there is no more space left.

// The Ratio is calculated as: log_{2^32}(10^9)

const double digitRatio = 0.934292276687070661;

currentBufferSize = Math.Min((int)(bufferSize * digitRatio) + 1, bufferSize);

Debug.Assert(buffer.Length == currentBufferSize || buffer[currentBufferSize] == 0);

while (0 < currentBufferSize && buffer[currentBufferSize - 1] == 0)

{

currentBufferSize--;

}

currentBuffer = buffer.Slice(0, currentBufferSize);

I rewrote it to use $\log_{2^{32}}(10)$ in pull request #97589, but it is essentially the same.

runtime/src/libraries/System.Runtime.Numerics/src/System/Number.BigInteger.cs

Lines 372 to 379 in 0d426df

const double digitRatio = 0.10381025297; // log_{2^32}(10)

int resultLength = checked((int)(digitRatio * number.Scale) + 1 + 2);

uint[]? resultBufferFromPool = null;

Span<uint> resultBuffer = (

resultLength <= BigIntegerCalculator.StackAllocThreshold

? stackalloc uint[BigIntegerCalculator.StackAllocThreshold]

: resultBufferFromPool = ArrayPool<uint>.Shared.Rent(resultLength)).Slice(0, resultLength);

resultBuffer.Clear();

huoyaoyuan · 2024-07-10T14:48:20Z

src/libraries/System.Runtime.Numerics/src/System/Number.BigInteger.cs

@@ -1109,6 +1210,26 @@ public static int OmittedLength(int index)
                return (MaxPartialDigits * (1 << index)) >> 5;
            }

+            public static void FloorBufferSize(int size, out int bufferSize, out int maxIndex)


Is it required to calculate exact buffer length? Can it be relaxed, and let the algorithm to strip unnecessary zeros?

Is there a concise way to calculate the inexact buffer length? It would be most concise to find out from the predefined buffer length.

dotnet-policy-service · 2024-08-10T18:45:05Z

Draft Pull Request was automatically closed for 30 days of inactivity. Please let us know if you'd like to reopen it.

kzrnm added 3 commits July 10, 2024 01:27

Add test

95b3901

ToString by DivideAndConquer

b8823ac

ToStringTestThreshold

a4dc235

dotnet-issue-labeler bot added the area-System.Numerics label Jul 10, 2024

dotnet-policy-service bot added the community-contribution Indicates that the PR has been added by a community member label Jul 10, 2024

huoyaoyuan reviewed Jul 10, 2024

View reviewed changes

This was referenced Jul 10, 2024

The Operation will be canceled. The next steps may not contain expected logs. dotnet/dnceng#3008

Open

The job running on agent NetCore-Public ran longer than the maximum time #104044

Closed

dotnet-policy-service bot closed this Aug 10, 2024

github-actions bot locked and limited conversation to collaborators Sep 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize BigInteger.ToString for large decimal string #104676

Optimize BigInteger.ToString for large decimal string #104676

kzrnm commented Jul 10, 2024 •

edited

Loading

dotnet-policy-service bot commented Jul 10, 2024

huoyaoyuan Jul 10, 2024 •

edited

Loading

kzrnm Jul 11, 2024 •

edited

Loading

huoyaoyuan Jul 10, 2024

kzrnm Jul 11, 2024

dotnet-policy-service bot commented Aug 10, 2024

	// shrink buffer to the currently used portion.
	// First, calculate the rough size of the buffer from the ratio that the number
	// of digits follows. Then, shrink the size until there is no more space left.
	// The Ratio is calculated as: log_{2^32}(10^9)
	const double digitRatio = 0.934292276687070661;
	currentBufferSize = Math.Min((int)(bufferSize * digitRatio) + 1, bufferSize);
	Debug.Assert(buffer.Length == currentBufferSize \|\| buffer[currentBufferSize] == 0);
	while (0 < currentBufferSize && buffer[currentBufferSize - 1] == 0)
	{
	currentBufferSize--;
	}
	currentBuffer = buffer.Slice(0, currentBufferSize);

	const double digitRatio = 0.10381025297; // log_{2^32}(10)
	int resultLength = checked((int)(digitRatio * number.Scale) + 1 + 2);
	uint[]? resultBufferFromPool = null;
	Span<uint> resultBuffer = (
	resultLength <= BigIntegerCalculator.StackAllocThreshold
	? stackalloc uint[BigIntegerCalculator.StackAllocThreshold]
	: resultBufferFromPool = ArrayPool<uint>.Shared.Rent(resultLength)).Slice(0, resultLength);
	resultBuffer.Clear();

Optimize BigInteger.ToString for large decimal string #104676

Optimize BigInteger.ToString for large decimal string #104676

Conversation

kzrnm commented Jul 10, 2024 • edited Loading

dotnet-policy-service bot commented Jul 10, 2024

huoyaoyuan Jul 10, 2024 • edited Loading

Choose a reason for hiding this comment

kzrnm Jul 11, 2024 • edited Loading

Choose a reason for hiding this comment

huoyaoyuan Jul 10, 2024

Choose a reason for hiding this comment

kzrnm Jul 11, 2024

Choose a reason for hiding this comment

dotnet-policy-service bot commented Aug 10, 2024

kzrnm commented Jul 10, 2024 •

edited

Loading

huoyaoyuan Jul 10, 2024 •

edited

Loading

kzrnm Jul 11, 2024 •

edited

Loading