Should BigDecimal always be normalized? #26

littledan · 2019-11-17T17:25:08Z

This proposal starts off with the idea that trailing zeroes should be preserved, for reasons discussed in various comments in the Decimal FAQ. Should we instead normalize BigDecimals, i.e., make "precision" not a meaningful part of the BigDecimal values, just of operations on the data? Some comments from @ljharb in various threads (#11 #12) made me suspect he was leaning in this direction.

domenic · 2019-11-17T17:55:20Z

I think a bit more legwork needs to be done to justify the statements in http://speleotrove.com/decimal/decifaq1.html#tzeros in the context of JavaScript and the context of this proposal. For example,

End users expect trailing zeros to be preserved in addition, subtraction, and multiplication (as in the 1.23 + 1.27 example). This is especially true for financial and commercial applications. If the zeros are not preserved, users are surprised, frustrated, and lose confidence in the application.

seems, as an end-user concern, to be mostly targeted at formatting, and so could be solved in other ways (e.g. .toString(2) or the corresponding Intl.NumberFormat invocation).

Similarly, most of the bullet points following that one really seem to be user-facing formatting concerns, or in one case database-facing serialization concerns. They are not strongly supportive of the notion that decimals should carry along precision information with them internally, throughout the program.

I think it would be helpful to approach this from the perspective of observable consequences for JavaScript programmers. Those seem to be:

Whether you can carry around a "default number of digits" which will show up at formatting time---assuming that there is a formatting method (whether toString() or something else) which will draw upon this information.
Whether various equality operations will compare different-precision values as equal, e.g. whether 1.23m + 1.27m === 2.5m holds, or the same question for SameValue/SameValueZero/==. This answer can be different for each equality relation.
Anything else?

littledan · 2019-11-17T18:15:10Z

There's also round-tripping to external places where decimals are held in other formats.

domenic · 2019-11-17T18:34:28Z

Good point. Although it sounds like from #16 the plan is already to have that format be non-interoperable between machines, so the bar already seems pretty low there in terms of what guarantees programmers can rely on.

(I suppose you could also be referring to other serialization mechanisms besides the BigDecimalXArrays, but those seem to fall into the "formatting time" bucket I already included.)

qzb · 2019-11-17T18:52:20Z

Representation of financial amounts could be also addressed with some F#-like units system, (extended numeric literals proposal seems to be a step in that direction), so storing precision within BigDecimal is not an only option.

ljharb · 2019-11-17T19:10:54Z

BigDecimal wouldn’t be storing currency either - precision seems to me in the same category: things that you need to preserve explicitly alongside the amount (the BigDecimal)

littledan · 2019-11-17T20:47:47Z

@domenic The plan in that issue was to leave two options, and have DataView methods to select one or the other. This is parallel to TypedArrays and endianness. Anyway, that's just one serialization format for interacting with certain systems; you could do other things in JS as well.

LeszekSwirski · 2019-11-18T03:29:04Z

I'm unclear about the precision semantics for operations on mixed precision BigDecimal -- would it be right to assume that we take for C = A + B, precision_C = max(precision_A, precision_B)? Does that mean that high precision is "infectious" and cannot be removed? I'm asking in particular since the FAQ motivating preserving trailing zeros sometimes defines precision by total digits, rather than digits after the decimal point (e.g. the Euro exchange rate rules).

littledan · 2019-11-18T07:52:13Z

@LeszekSwirski Those are the semantics I was picturing for +. The precision could be reduced with a .round() method, as we're discussing in #14 . Probably the README should be more concrete and explicit about all this, with code samples. (PRs welcome!)

littledan · 2019-11-18T12:33:10Z

Due to the feedback so far, I'd suggest that we switch to a normalized data model, as in #29 .

littledan · 2020-01-08T17:48:24Z

Fabrice Bellard said the following about normalization:

I strongly support normalized decimal floating point. Unnormalized
decimal FP adds a non negligible complexity in terms of specification
and implementation. Unfortunately IEEE 754-2008 and Java BigDecimal
allow unnormalized values. I don't know to what extend they are really
used in financial applications.

Unfortunately most of the decimal floating point specifications require unnormalized representations so it will be difficult to convince some people to be only normalized.

peteroupc · 2020-02-07T05:46:53Z

I think that the advantages and disadvantages of normalized and unnormalized arithmetic at this part of the FAQ (first two questions) should be considered. These two questions more directly address the merits of normalized vs. unnormalized decimal arithmetic than the issue of trailing zeros does. These were apparently written by Mike Cowlishaw, but they are quite dated -- does he still think these advantages and disadvantages exist?

Also, in my opinion, normalization of decimal numbers should only affect how those numbers are compared and formatted (in the sense that, say, 0.5 and 0.50 are the same as far as the application is concerned, but that either value is normalized by default to 0.5 at the moment of formatting); it should not preclude implementations from storing and operating on decimal numbers, internally, in either unnormalized or normalized representations.

littledan · 2020-02-07T14:30:23Z

@peteroupc Good points, thanks for writing in.

does he still think these advantages and disadvantages exist?

I've seen no indication that he changed his mind. I don't think we'll get everyone in the world to agree on this tradeoff--that's probably why we see a diversity of answers among other systems with decimals. What's your opinion?

it should not preclude implementations from storing and operating on decimal numbers, internally, in either unnormalized or normalized representations.

I agree. In general, we'll only specify the observable semantics, which will relate to arithmetic, comparison and formatting. Implementations will be free to use unnormalized or normalized logic internally, as normalization "commutes"--you could do it before or afterwards, and there will be the same answer.

In everything I've seen him say and write, @waldemarhorwat has been very careful to phrase this as, there should be no way to distinguish among decimals in a cohort; I've been a little more sloppy and intuitive when I talk about these things.

sffc · 2023-09-27T05:15:32Z

As mentioned in plenary and other conversations, Intl.PluralRules needs non-normalized decimals. "1 email" and "1.0 emails" are different.

Yaffle · 2024-01-24T08:23:24Z

So toString exposes this, and comparision ignores.
Does it affect toExponential/toPrecision/toFixed?
There are coefficient and exponent getters, which will expose non-normalized decimals, unless they are speced to return as it was normalized.
And if there is a plan to have a typed array or conversion to standard binary representations this will also expose normalization or lack of it.

LeszekSwirski mentioned this issue Nov 18, 2019

BigDecimal vs Rational #6

Open

qzb mentioned this issue Nov 18, 2019

Should decimals with different precisions be incompatible? #28

Closed

eemeli mentioned this issue Jun 6, 2024

Intl.PluralRules support #154

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should BigDecimal always be normalized? #26

Should BigDecimal always be normalized? #26

littledan commented Nov 17, 2019

domenic commented Nov 17, 2019

littledan commented Nov 17, 2019

domenic commented Nov 17, 2019 •

edited

Loading

qzb commented Nov 17, 2019

ljharb commented Nov 17, 2019

littledan commented Nov 17, 2019

LeszekSwirski commented Nov 18, 2019

littledan commented Nov 18, 2019

littledan commented Nov 18, 2019

littledan commented Jan 8, 2020 •

edited

Loading

peteroupc commented Feb 7, 2020 •

edited

Loading

littledan commented Feb 7, 2020

sffc commented Sep 27, 2023

Yaffle commented Jan 24, 2024

Should BigDecimal always be normalized? #26

Should BigDecimal always be normalized? #26

Comments

littledan commented Nov 17, 2019

domenic commented Nov 17, 2019

littledan commented Nov 17, 2019

domenic commented Nov 17, 2019 • edited Loading

qzb commented Nov 17, 2019

ljharb commented Nov 17, 2019

littledan commented Nov 17, 2019

LeszekSwirski commented Nov 18, 2019

littledan commented Nov 18, 2019

littledan commented Nov 18, 2019

littledan commented Jan 8, 2020 • edited Loading

peteroupc commented Feb 7, 2020 • edited Loading

littledan commented Feb 7, 2020

sffc commented Sep 27, 2023

Yaffle commented Jan 24, 2024

domenic commented Nov 17, 2019 •

edited

Loading

littledan commented Jan 8, 2020 •

edited

Loading

peteroupc commented Feb 7, 2020 •

edited

Loading