Refactor BigDecimal #10641

straight-shoota · 2021-04-17T12:38:16Z

While looking into current issues around BigDecimal (#10502, #5714, #10599, #9547, #9578) it became apparent that the current implementation is lacking.

Our implementation is based on https://github.com/akubera/bigdecimal-rs but worse. And even that is IMO not the best role model.

A pretty extensive implementation with great documentation is Python's decimal library. Julia's Decimals.jl is another good example. There is actually a technical standard for decimal floating point numbers in IEEE 754.

A few observations about the data format in comparison with Python/Julia implementations:

BigDecimal#scale is an unsigned integer and represents the amount of digits the decimal point is shifted to the left, i.e. a negative exponent to the base of 10. The number is described by the formula value * 10 ** -scale. In the other definitions, the scale is a signed exponent and allows shifting the decimal point in both directions. Large numbers could already be expressed by a large enough BigInt value for the coefficient, but a signed exponent expresses precision. The formula is value * 10 ** exponent. So scale is the same value as exponent but with opposite sign.
In the other implementations the coefficient is always positive and a separate sign flag determines wether the value is positive or negative. The practical effect is that there are signed zero values (like in binary floating point types).
Similarly, special values Nan, +Infinity, -Infinity are missing in our implementation.

The specs for BigDecimal are pretty scarce for most methods, so I wouldn't be surprised to discover more bugs like (#10502).

The other Big* types are mostly wrappers for libgmp, so there's less chance for error on our side. BigDecimal is the only type implemented in Crystal.

The text was updated successfully, but these errors were encountered:

asterite · 2021-04-17T12:48:37Z

It would be really great to have decimal implemented exclusively in Crystal. If that were the case, we could even introduce a literal for them. They are very useful, for example for representing money amounts. C# has a built-in decimal type.

HertzDevil · 2021-04-17T14:12:46Z

Finite precision, or arbitrary precision? (C#'s is the former, but it isn't one of the IEEE decimal formats either)

asterite · 2021-04-17T15:57:02Z

I don't know

straight-shoota · 2021-04-17T16:51:29Z

Finite precision floating-pount decimal numbers should be relatively trivial to implement.

For representing monetary values however, you would better use a fixed-point data type than floating-point (when precision is finite). There's no point in being able to represent values in larger orders of magnitudes, when the primary concern is to be exact down to a certain number of decimal digits (usually 4 in monetary applications).
I don't see too much value for such a data type, at least not in stdlib. In the end, it's just the same as using an integer data type and interpreting the value 1 as a tenthousand's of the respective monetary base unit.

Implementing arbitrary precision floating-point decimal numbers essentially means we have to implement arbitrary precision integers (BigInt) in Crystal, too. That might be nice as a long term goal, but I don't think there's much value in that. Reusing an existing library like libgmp is fine and saves us a lot of trouble.

Sija · 2021-04-17T17:50:10Z

There's no point in being able to represent values in larger orders of magnitudes, when the primary concern is to be exact down to a certain number of decimal digits (usually 4 in monetary applications).

That's only correct for FIAT currencies. Cryptocurrencies can have as much as 18 decimals (see Ether for example).

HertzDevil · 2021-06-09T18:18:27Z

Apart from making scale signed we should also make it so that all BigDecimals are normalized, i.e. powers of 10 are always bookkept by scale, whereas value (which is really the mantissa or significand) is never divisible by 10, as if every constructor invokes factor_powers_of_ten. Other than that I think the current BigDecimal is actually fine.

I don't think BigDecimal needs to port all the features of IEEE 754 floats either.

straight-shoota · 2021-06-10T11:30:21Z

Technically, non-normalized values could be used to express precision. But we don't offer access to that and many implementations already apply normalization, so making it the default sounds good.

seyerian · 2021-08-08T16:29:00Z

I just created #11076 which is related to how BigRational is converted to BigDecimal using core methods. That's not a huge issue but I thought it was worth a mention here.

Are there any plans for BigDecimal beyond this GitHub issue? I'm interested in contributing here, if this is an appropriate place to start.

straight-shoota added status:discussion kind:refactor topic:stdlib:numeric labels Apr 17, 2021

HertzDevil mentioned this issue Jun 9, 2021

Optimize BigDecimal#div for inexact divisions #10803

Merged

This was referenced Mar 11, 2022

Fix E notation parsing in BigDecimal #9577

Merged

RFC: E notation BigDecimal parser #9581

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor BigDecimal #10641

Refactor BigDecimal #10641

straight-shoota commented Apr 17, 2021

asterite commented Apr 17, 2021

HertzDevil commented Apr 17, 2021 •

edited

Loading

asterite commented Apr 17, 2021

straight-shoota commented Apr 17, 2021

Sija commented Apr 17, 2021

HertzDevil commented Jun 9, 2021 •

edited

Loading

straight-shoota commented Jun 10, 2021

seyerian commented Aug 8, 2021

Refactor BigDecimal #10641

Refactor BigDecimal #10641

Comments

straight-shoota commented Apr 17, 2021

asterite commented Apr 17, 2021

HertzDevil commented Apr 17, 2021 • edited Loading

asterite commented Apr 17, 2021

straight-shoota commented Apr 17, 2021

Sija commented Apr 17, 2021

HertzDevil commented Jun 9, 2021 • edited Loading

straight-shoota commented Jun 10, 2021

seyerian commented Aug 8, 2021

HertzDevil commented Apr 17, 2021 •

edited

Loading

HertzDevil commented Jun 9, 2021 •

edited

Loading