Optimize ScalarMult with NAF #10

jimmysong · 2014-09-28T02:52:57Z

Use Non-Adjacent Form (NAF) of large numbers to reduce ScalarMult computation times.

Preliminary results indicate around a 8-9% speed improvement according to BenchmarkScalarMult.

The algorithm used is 3.77 from Guide to Elliptical Curve Crytography by Hankerson, et al.

This closes #3

coveralls · 2014-09-28T02:54:48Z

Coverage decreased (-0.02%) when pulling d8b15e8 on jimmysong:3 into d694428 on conformal:master.

coveralls · 2014-09-28T03:01:10Z

Coverage increased (+0.37%) when pulling c4bb9ca on jimmysong:3 into d694428 on conformal:master.

coveralls · 2014-09-28T03:03:38Z

Coverage increased (+0.17%) when pulling c9b455f on jimmysong:3 into d694428 on conformal:master.

coveralls · 2014-09-28T11:54:39Z

Coverage increased (+0.38%) when pulling 867aabe on jimmysong:3 into d694428 on conformal:master.

coveralls · 2014-09-28T12:27:56Z

Coverage increased (+0.37%) when pulling 94680e9 on jimmysong:3 into d694428 on conformal:master.

coveralls · 2015-01-22T16:29:11Z

Coverage decreased (-0.04%) to 97.82% when pulling 99aa45e on jimmysong:3 into f9365fd on btcsuite:master.

davecgh · 2015-01-22T16:39:19Z

@jimmysong Thanks for rebasing. I was planning to get these merged before the btcec repo is merged into btcd.

davecgh · 2015-02-01T18:01:57Z

@jimmysong Can you rebase this again? There is probably going to be a conflict since the PrintBytePoints stuff has been changed. I'll be pushing to get this and the other pr in next week.

coveralls · 2015-02-01T19:54:04Z

Coverage increased (+0.4%) to 97.32% when pulling 26217f9 on jimmysong:3 into 9535058 on btcsuite:master.

jimmysong · 2015-02-01T19:55:58Z

@davecgh rebased and ready to go.

coveralls · 2015-02-03T16:05:06Z

Coverage increased (+0.42%) to 97.13% when pulling e750009 on jimmysong:3 into 46829e8 on btcsuite:master.

davecgh · 2015-02-03T20:02:40Z

$ golint ./...
btcec.go:666:9: if block ends with a return statement, so drop this else and outdent its block

This implements a speedup to ScalarMult using the endomorphism available to secp256k1. Note the constants lambda, beta, a1, b1, a2 and b2 are from here: https://bitcointalk.org/index.php?topic=3238.0 Preliminary tests indicate a speedup of between 17%-20% (BenchScalarMult). More speedup can probably be achieved once splitK uses something more like what fieldVal uses. Unfortunately, the prime for this math is the order of G (N), not P. Note the NAF optimization was specifically not done as that's the purview of another issue. Changed both ScalarMult and ScalarBaseMult to take advantage of curve.N to reduce k. This results in a 80% speedup to large values of k for ScalarBaseMult. Note the new test BenchmarkScalarBaseMultLarge is how that speedup number can be checked. This closes btcsuite#1

davecgh · 2015-02-04T15:38:30Z

I haven't narrowed it down to this PR or the endomorphism one, but I suspect it's this one that is the cause. The memory usage has skyrocketed. I let btcd run with --nocheckpoints to force all of the script validation and ecc to run and it was over 1.6GB after a few hours. Without these PRs it's aroud 200MB. I'm going to let it run with just the endomorphism to verify.

davecgh · 2015-02-04T18:00:04Z

Ok, I've verified this PR is the culprit. I've been running the endomorphism PR for a couple of hours now and memory usage is stable and very similar to master.

jimmysong · 2015-02-04T21:35:25Z

@davecgh I've implemented the speedup to NAF as you've asked. I put this in a separate commit so you don't have to figure out what's changed. But basically, I used your suggestion to use byte arrays instead of a large int array.

davecgh · 2015-02-05T04:38:47Z

btcec.go

+	// P1 below is P in the equation, P2 below is ϕ(P) in the equation
+	p1x, p1y := curve.bigAffineToField(Bx, By)
+	// For NAF, we need the negative point
+	p1yNeg := new(fieldVal).Set(p1y).Negate(1)


Minor optimization here. If you use NegateVal you can negate and set the value in one operation without having to copy it first with Set.

p1yNeg := new(fieldVal).NegateVal(p1y, 1)

davecgh · 2015-02-05T05:17:07Z

btcec.go

+// non-zero.
+// The algorithm here is from Guide to Elliptical Cryptography 3.30 (ref above)
+// Essentially, this makes it possible to minimize the number of operations
+// since the resulting ints returned will be at least 50% 0's.


This comment is no longer accurate.

should be fixed. not sure why it's not outdated yet.

davecgh · 2015-02-05T05:36:10Z

Feel free to squash the two commits. I'm done reviewing the changes. Thanks for splitting them out for review as it made it easier!

I've been running on the new NAF code since this afternoon and memory usage is now stable and similar to the memory usage on master. I also noticed the speed increase. Nicely done!

Use Non-Adjacent Form (NAF) of large numbers to reduce ScalarMult computation times. Preliminary results indicate around a 8-9% speed improvement according to BenchmarkScalarMult. The algorithm used is 3.77 from Guide to Elliptical Curve Crytography by Hankerson, et al. This closes btcsuite#3

jimmysong · 2015-02-05T14:29:12Z

Squashed.

jimmysong force-pushed the 3 branch from d8b15e8 to c4bb9ca Compare September 28, 2014 02:59

jimmysong force-pushed the 3 branch from c4bb9ca to c9b455f Compare September 28, 2014 03:01

jimmysong force-pushed the 3 branch from c9b455f to 867aabe Compare September 28, 2014 11:52

jimmysong force-pushed the 3 branch from 867aabe to 94680e9 Compare September 28, 2014 12:25

jimmysong force-pushed the 3 branch from 94680e9 to 99aa45e Compare January 22, 2015 16:14

jimmysong mentioned this pull request Jan 22, 2015

Optimize ScalarMult using endomorphism #8

Merged

jimmysong force-pushed the 3 branch from 99aa45e to 26217f9 Compare February 1, 2015 19:52

jimmysong force-pushed the 3 branch 2 times, most recently from e71e1b8 to e750009 Compare February 3, 2015 15:59

jimmysong force-pushed the 3 branch from e750009 to 46c1953 Compare February 3, 2015 20:06

jimmysong force-pushed the 3 branch from 46c1953 to 93ff1a5 Compare February 3, 2015 20:14

davecgh reviewed Feb 5, 2015
View reviewed changes

jimmysong force-pushed the 3 branch from d9a2e26 to 1183b60 Compare February 5, 2015 05:16

davecgh reviewed Feb 5, 2015
View reviewed changes

jimmysong force-pushed the 3 branch 2 times, most recently from d42f2eb to 7a364d7 Compare February 5, 2015 05:21

jimmysong force-pushed the 3 branch from 7a364d7 to 6c36218 Compare February 5, 2015 14:28

conformal-deploy merged commit 6c36218 into btcsuite:master Feb 5, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize ScalarMult with NAF #10

Optimize ScalarMult with NAF #10

jimmysong commented Sep 28, 2014

coveralls commented Sep 28, 2014

coveralls commented Sep 28, 2014

coveralls commented Sep 28, 2014

coveralls commented Sep 28, 2014

coveralls commented Sep 28, 2014

coveralls commented Jan 22, 2015

davecgh commented Jan 22, 2015

davecgh commented Feb 1, 2015

coveralls commented Feb 1, 2015

jimmysong commented Feb 1, 2015

coveralls commented Feb 3, 2015

davecgh commented Feb 3, 2015

davecgh commented Feb 4, 2015

davecgh commented Feb 4, 2015

jimmysong commented Feb 4, 2015

davecgh Feb 5, 2015

davecgh Feb 5, 2015

jimmysong Feb 5, 2015

davecgh commented Feb 5, 2015

jimmysong commented Feb 5, 2015

Optimize ScalarMult with NAF #10

Optimize ScalarMult with NAF #10

Conversation

jimmysong commented Sep 28, 2014

coveralls commented Sep 28, 2014

coveralls commented Sep 28, 2014

coveralls commented Sep 28, 2014

coveralls commented Sep 28, 2014

coveralls commented Sep 28, 2014

coveralls commented Jan 22, 2015

davecgh commented Jan 22, 2015

davecgh commented Feb 1, 2015

coveralls commented Feb 1, 2015

jimmysong commented Feb 1, 2015

coveralls commented Feb 3, 2015

davecgh commented Feb 3, 2015

davecgh commented Feb 4, 2015

davecgh commented Feb 4, 2015

jimmysong commented Feb 4, 2015

davecgh Feb 5, 2015

Choose a reason for hiding this comment

davecgh Feb 5, 2015

Choose a reason for hiding this comment

jimmysong Feb 5, 2015

Choose a reason for hiding this comment

davecgh commented Feb 5, 2015

jimmysong commented Feb 5, 2015