Fix operations with self #35

pietroppeter · 2020-10-18T16:46:45Z

This was a bit tricky. Fixes #27 and problem with self multiplication.

Essentially the issue was that for a generic operation op(a: var BigInt, b: BigInt, c: BigInt) when calling setXLen on a.limbs if b is the same as a then b.limbs gets reset to empty inside op (and c is only involved for computing resetting length). For a minimal example that shows this (playground):

import sugar

var
  a = @[0]

proc op(a: var seq[int], b: seq[int]) =
  dump a.repr
  dump b.repr
  a.setLen(2)
  dump a.repr
  dump b.repr

op(a, a)

output:

a.repr = 0x7ff35fdfd050@[0]

b.repr = 0x7ff35fdfd050@[0]

a.repr = 0x7ff35fdfd1a0@[0, 0]

b.repr = 0x7ff35fdfd050@[]

Note that the "reset" is not always the case, in particular issue #27 if added to the tester.nim file did not raise an error, while it raises an error when in a separate file (tissue27.nim, added to tests). This is possibly due to the fact that when resetting length, in the heap there is space for lengthening the sequence without copying (probably the sequence was allocated in a space previously occupied by a longer sequence).

The fix I propose is to change behaviour of SetXLen to always create a new sequence and copying the content according to new length.

unrelated change: added .exe to .gitignore.

def- · 2020-10-18T21:53:57Z

src/bigints.nim

-    s = newSeq[T](newlen)
-  else:
-    s.setLen(newlen)
+  let t = s


I guess this change sacrifices performance. I'm not really against it though, just wanted to clarify.

Yes, I guess you are right. I think it makes sense to do a small benchmarking. Also I could make available old behaviour behind a switch (e.g. bigintsUseOldSetXLen). And if the benchmarking show too much of a degradation I could try to think of a new way to make the fix.

But the old behaviour is so easy to get wrong that the better performance is probably not important.

def- · 2020-10-18T21:54:18Z

bigints.nimble

@@ -1,6 +1,6 @@
 # Package

-version     = "0.4.4"
+version     = "0.4.5"


Should I release after this is merged? Maybe a larger number like 0.5 is appropriate?

Yes, if there is performance degradation it would probably call for an increase of minor version. In that case probably the best would be to tag current master as 0.4.4 (otherwise I am not sure nimble is able to pick up the correct version if asked to require bigints <= 0.4.4). And then after the changes we could tag also the new release. Anyway let's see after I do some benchmarking

pietroppeter · 2020-10-19T07:24:37Z

After review, open points:

add switch for old behaviour
benchmark old vs new behaviour
~~other options for the fix?~~
revise the version bump (minor increment?)

regarding other options I was thinking of having a check inside functions where SetXLen might cause issue. The check should check if we are using the same input and probably the best way would be to check if limbs have same addr. Need to think this through anyway.

pietroppeter · 2020-11-24T18:34:38Z

finally added the benchmarking against old setXLen. I am a bit disappointed that chainedAddition task almost doubles its time, so I am now thinking that I should consider other options for the fix as outlined in last comment.

For other comments regarding benchmark results: simple addition and multiplication are not impacted, which is kind of expected. I was surprised that chainedMultiplication is not impacted as much as chainedAddition: I guess this is because multiplication even with old setXLen needs to copy the sequence since the memory allocated is not sufficient to extend. In principle the optimal way to deal with var that grow is to preallocate a seq with enough capacity (if possible). I guess this is doable with current API (there is an init that can get as parameter the seq of limbs and this can be created with a preset capacity).

Below the automatic benchmarking report (available also here). A few notes on benchmark implementation:

the idea is to add full benchmarking to the repo (as a separate PR, I will clean up this one after it is completed)
benchmark is based on https://github.com/disruptek/criterion and is run again (including report generation) running nimble bench
regarding the strange labels for parameters (100d means using 2 100 digits numbers), this is partly due to a limitation of criterion in handling parameters. anyway looking at benchmark.nim should clarify what is being used.
idea is to implement all benchmark from https://peterolson.github.io/BigInteger.js/benchmark/
and also add other benchmarks such as those called chainedAddition and chainedMultiplication found in bigints source code (commented)
ideally the benchmarking in the master there should be between this library and bignum.

Benchmark results

addition

100d

	cpuTimes	CI(lower)	CI(upper)
benchmark.json	135.6	130.6	143.0
benchmark_oldSetXLen.json	139.4	133.6	147.9

chainedAddition

100d,3i

	cpuTimes	CI(lower)	CI(upper)
benchmark.json	907.7	883.0	933.5
benchmark_oldSetXLen.json	574.2	558.4	593.8

100d,33i

	cpuTimes	CI(lower)	CI(upper)
benchmark.json	7614.7	7535.1	7696.5
benchmark_oldSetXLen.json	4147.9	4098.1	4203.3

multiplication

100d

	cpuTimes	CI(lower)	CI(upper)
benchmark.json	460.8	449.5	479.0
benchmark_oldSetXLen.json	456.4	445.0	473.9

chainedMultiplication

10d,3i

	cpuTimes	CI(lower)	CI(upper)
benchmark.json	22098.0	21961.3	22266.7
benchmark_oldSetXLen.json	21685.0	21591.8	21807.4

pietroppeter · 2021-05-18T07:14:41Z

Sorry for leaving this open for such a long time... I think the easiest option would be actually merging the fix (bumping the version) and maybe opening an issue to track the possibility of further improvement. In the next days I plan to clean up this towards this goal.

pietroppeter · 2021-05-19T22:36:35Z

cleaned up (removed also the benchmarks, if this branch is not deleted the code should be still available in this commit) and ready to be merged.

also created the issue for tracking performance regression: https://github.com/def-/nim-bigints/issues/36

After merging remember to tag the release both of 0.5.0 (latest commit after merge) and of 0.4.4 (latest commit before merge), so that nimble can pick up requirements <= 0.4.4 in case someone wants to use it.

def- · 2021-05-20T08:34:38Z

Thanks, will tag them.

pietroppeter added 8 commits October 13, 2020 18:21

add failing tests

a4a2eb9

add *.exe in gitignore

ccb6e78

possible fix changing setXLen

9345f16

bump version and update readme

17aaa8f

add separate test for issue 27 also in CI

35e45ac

added test for multiplicationInt

008e32b

cleanup comments

2809551

cleanup task test27 in nimble

1622f25

def- reviewed Oct 18, 2020

View reviewed changes

pietroppeter marked this pull request as draft October 19, 2020 07:24

pietroppeter added 3 commits October 21, 2020 18:23

add switch for old setXLen behaviour

b5256d6

add benchmark

808947c

fix table generation in report.nim

af4e56c

pietroppeter added 3 commits May 20, 2021 00:17

removed benchmarks

7a9e550

renamed define for using legacy setXLen (and added comment)

ea1969a

bump minor version

d6c1a79

pietroppeter marked this pull request as ready for review May 19, 2021 22:33

pietroppeter mentioned this pull request May 19, 2021

avoid unnecessary copy in setXLen #36

Closed

def- merged commit c5ea3d5 into nim-lang:master May 20, 2021

pietroppeter mentioned this pull request Aug 19, 2022

Benchmarks #113

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix operations with self #35

Fix operations with self #35

pietroppeter commented Oct 18, 2020

def- Oct 18, 2020

pietroppeter Oct 19, 2020

def- Oct 19, 2020

def- Oct 18, 2020

pietroppeter Oct 19, 2020

pietroppeter commented Oct 19, 2020 •

edited

Loading

pietroppeter commented Nov 24, 2020

pietroppeter commented May 18, 2021

pietroppeter commented May 19, 2021 •

edited

Loading

def- commented May 20, 2021

Fix operations with self #35

Fix operations with self #35

Conversation

pietroppeter commented Oct 18, 2020

def- Oct 18, 2020

Choose a reason for hiding this comment

pietroppeter Oct 19, 2020

Choose a reason for hiding this comment

def- Oct 19, 2020

Choose a reason for hiding this comment

def- Oct 18, 2020

Choose a reason for hiding this comment

pietroppeter Oct 19, 2020

Choose a reason for hiding this comment

pietroppeter commented Oct 19, 2020 • edited Loading

pietroppeter commented Nov 24, 2020

Benchmark results

addition

100d

chainedAddition

100d,3i

100d,33i

multiplication

100d

chainedMultiplication

10d,3i

pietroppeter commented May 18, 2021

pietroppeter commented May 19, 2021 • edited Loading

def- commented May 20, 2021

pietroppeter commented Oct 19, 2020 •

edited

Loading

pietroppeter commented May 19, 2021 •

edited

Loading