Faster proofs for arm/proofs/bignum_k{mul,sqr}_*_neon.ml #121

aqjune-aws · 2024-04-17T06:42:34Z

(Marked as a draft because (1) this is made on top of #118, (2) the speedup needs to be measured on the CI check machines, and (3) the proofs may fail due to my mistake)

This makes the proofs of arm/proofs/bignum_k{mul,sqr}_{16_32,32_64}_neon.ml
faster by unusing the inlined hoare triple proofs for local 8->16 multiplication
and instead using the corresponding lemmas in bignum_{mul,sqr}_8_16_neon.ml.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

This patch adds `bignum_mont{sqr,mul}_p256_neon` functions. These are vectorized and instruction-rescheduled versions of `bignum_mont{sqr,mul}_p256`. They are verified using the equivalence checking tactics. A new bash script `tools/external/slothy.sh` is added to help reproduce the optimized output. The 'intermediate' functions of the two functions are written as comments in the two assembly files. Additionally, - A new instruction `umull2` is formalized add added to the simulator in order to verify the new functions. - Old `*_neon` functions' proofs are refactored a bit.

This makes the proofs of `arm/proofs/bignum_k{mul,sqr}_{16_32,32_64}_neon.ml` faster by unusing the inlined hoare triple proofs for local 8->16 multiplication and instead using the corresponding lemmas in bignum_{mul,sqr}_8_16_neon.ml.

aqjune-aws added 2 commits April 10, 2024 17:43

aqjune-aws closed this Apr 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Faster proofs for arm/proofs/bignum_k{mul,sqr}_*_neon.ml #121

Faster proofs for arm/proofs/bignum_k{mul,sqr}_*_neon.ml #121

aqjune-aws commented Apr 17, 2024

Faster proofs for arm/proofs/bignum_k{mul,sqr}_*_neon.ml #121

Faster proofs for arm/proofs/bignum_k{mul,sqr}_*_neon.ml #121

Conversation

aqjune-aws commented Apr 17, 2024