Study mul256 #574

ckormanyos · 2024-05-18T08:31:45Z

The purpose of this PR is to simply insert parts of wide-integer's mul-256 into decimal. Let's see how CI shakes out.

This phenomenon was originally reported in #559.

ckormanyos · 2024-05-18T09:23:06Z

Hard error but very confusing on Ubuntu, cbrt(128-bit) for argument close to $1.0$ hangs, not fails but hangs indefinitely.

It works just fine on MSVC. Will now investigate why/what's up on LINUX. It also hangs on WSL2 so I'll look there locally first.

ckormanyos · 2024-05-19T11:28:52Z

Hi Matt (@mborland) I took the opportunity to fully refactor sqrt() and cbrt() and their tests. Interesting point: When I had initially started with mul-256 for some strange reason the cube-root test was hanging. I also found that several tens of Newton iterations were being performed in some cases.

So for sqrt() and cbrt() I used a uniform estimation scheme providing slightly more than $2$ decimal digits of accuracy in the initial guess on the interval $1/10 {\leq} x < 1$. This results excellent results for exactly in $2$, $3$ or $4$ iterations for 32/64/128 bits respectively. I added and strengthened the tests and am now hammering the new multiplication and roots.

When implementing $8{\times}8$ multiplication, I opted to unroll the loop. I could add some specialized optimitations for multiplication $4{\times}4$ or $8{\times}2$, but we can do that later as needed.

ckormanyos · 2024-05-19T11:35:56Z

Oh yes and also Matt (@mborland) there was, in fact, a hard error lurking around somewhere in the original $256{\times}256$-bit multiplication. So these changes (or something like them) are seemingly needed.

I can now reduce significantly all tolerances on decimal128-tests. I was tempted to gut out the whole thing and put in a small subset of wide-integer, but we've got no time for that at the moment. So I opted to simply port over a few routines.

Now the test codes in #559 all pass.

ckormanyos · 2024-05-19T17:08:33Z

... can now reduce significantly all tolerances on decimal128-tests

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.7%. Comparing base (04f5974) to head (fe4392e).

Additional details and impacted files

@@            Coverage Diff            @@
##           develop    #574     +/-   ##
=========================================
+ Coverage     98.7%   98.7%   +0.1%     
=========================================
  Files          195     195             
  Lines        12014   12472    +458     
  Branches      1419    1455     +36     
=========================================
+ Hits         11846   12307    +461     
+ Misses         168     165      -3

Files	Coverage Δ
include/boost/decimal/detail/cmath/cbrt.hpp	`100.0% <100.0%> (ø)`
include/boost/decimal/detail/cmath/sqrt.hpp	`100.0% <100.0%> (+5.3%)`	⬆️
include/boost/decimal/detail/emulated128.hpp	`96.5% <100.0%> (+0.2%)`	⬆️
include/boost/decimal/detail/emulated256.hpp	`100.0% <100.0%> (+1.0%)`	⬆️
...e/boost/decimal/detail/wide-integer/uintwide_t.hpp	`100.0% <100.0%> (ø)`
test/test_atan.cpp	`100.0% <100.0%> (ø)`
test/test_big_uints.cpp	`100.0% <100.0%> (ø)`
test/test_cbrt.cpp	`100.0% <100.0%> (ø)`
test/test_erf.cpp	`100.0% <ø> (ø)`
test/test_exp.cpp	`100.0% <100.0%> (ø)`
... and 8 more

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 04f5974...fe4392e. Read the comment docs.

ckormanyos · 2024-05-20T08:05:34Z

I was concerned about changing performance. Multiplication stays about the same. I also benchmarked sqrt() which has improved in the mul256 branch compared to the develop branch. The tables are from MSVC 142, with similar trends on Ubuntu in WSL2.

Multiplication	`mul256`	`develop`
Multiplication `<float >`	443	558
Multiplication `<double >`	476	609
Multiplication `<decimal32 >`	37383	37159
Multiplication `<decimal64 >`	125178	123256
Multiplication `<decimal128>`	397336	397468
Multiplication `<dec32_fast>`	32281	32995

`sqrt()`	`mul256`	`develop`
`sqrt<float >`	1053	1353
`sqrt<double >`	2908	2884
`sqrt<decimal32 >`	417289	666491
`sqrt<decimal64 >`	1604746	2181287
`sqrt<decimal128>`	15742232	23100960

ckormanyos · 2024-05-20T11:56:13Z

Hi Matt (@mborland) before leaving this one, I found another issue in uint128 when performing $128{\times}64$-bit multiplication. I'll fix this here in this branch (although not with the highest possible efficiency).

At some time in the future, I'll take a few morning sessions and make a dedicated, C++14-friendly uint-whatever (like $128$, $256$) for this work. But we can probably push that off until the interfacing and most of the known numerical problems have been eliminated.

ckormanyos · 2024-05-20T16:07:11Z

This thing fixes a bunch of pesky problems.

Repair $128$ and $256$-bit multiplication, but retain whatever performance we have (repair temporary permanent fiy planned this year).
Improve sqrt() and cbrt().
Eliminate spurious errors on standard function tests via eliminating troublesome ranges not needed for basic tests.
Reduce decimal128 tolerances on a bunch of tests.

ckormanyos added 2 commits May 18, 2024 10:19

Start using wide-integer in mul-256

598f480

Tighten up tols selectively on tgamma()

53b9388

ckormanyos added 4 commits May 19, 2024 11:24

Unroll 8x8 mul and repair sqrt/cbrt

8498fa6

Refactor cbrt() tests

9894981

Remove unusde variable

d02216c

Simplify edge cases and add cover tests

9160e2e

Handle unused var and add yet more tests

c70b8c3

ckormanyos mentioned this pull request May 19, 2024

uint256_t multiplication is incorrect #559

Closed

ckormanyos added 4 commits May 19, 2024 14:23

Handle yet another warning/error

bec7819

Handle yet another compiler wanr/error

50635cf

Handle a troubled lgamma range and spelling

e6b5436

Handle compiler warning/error and casts

e5e8d24

Increase a tgamma() test-tolerance

bf3b2a7

ckormanyos requested a review from mborland May 19, 2024 17:51

Improve negative-arg cbrt() and syntax

96b950a

mborland reviewed May 20, 2024

View reviewed changes

Add new constants and correct trivial errors

30936d7

ckormanyos added 4 commits May 20, 2024 13:58

Fix a uint128 problem and add tests

d93adb8

Merge branch 'develop' into study_mul256

101407c

Handle conversion error in a test case

fe4305f

Tighten 128-bit tols and cover-system-files

fe4392e

ckormanyos merged commit 10089df into develop May 20, 2024
62 checks passed

ckormanyos deleted the study_mul256 branch May 20, 2024 16:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Study mul256 #574

Study mul256 #574

ckormanyos commented May 18, 2024 •

edited

Loading

ckormanyos commented May 18, 2024 •

edited

Loading

ckormanyos commented May 19, 2024

ckormanyos commented May 19, 2024 •

edited

Loading

ckormanyos commented May 19, 2024

ckormanyos commented May 19, 2024

codecov bot commented May 20, 2024 •

edited

Loading

ckormanyos commented May 20, 2024

ckormanyos commented May 20, 2024 •

edited

Loading

ckormanyos commented May 20, 2024

Study mul256 #574

Study mul256 #574

Conversation

ckormanyos commented May 18, 2024 • edited Loading

ckormanyos commented May 18, 2024 • edited Loading

ckormanyos commented May 19, 2024

ckormanyos commented May 19, 2024 • edited Loading

ckormanyos commented May 19, 2024

ckormanyos commented May 19, 2024

codecov bot commented May 20, 2024 • edited Loading

Codecov Report

ckormanyos commented May 20, 2024

ckormanyos commented May 20, 2024 • edited Loading

ckormanyos commented May 20, 2024

ckormanyos commented May 18, 2024 •

edited

Loading

ckormanyos commented May 18, 2024 •

edited

Loading

ckormanyos commented May 19, 2024 •

edited

Loading

codecov bot commented May 20, 2024 •

edited

Loading

ckormanyos commented May 20, 2024 •

edited

Loading