ML-KEM: Bring encoding functions in line with the spec #160

marsella · 2024-10-21T19:08:42Z

Addresses part of #144, but does not complete it.

This handles all the encode / decode parts of #144. It was getting a little hefty as-is so I'm breaking it into its own PR.

This was mainly a project in mucking around with types. The spec assumes that all arithmetic operations can be done on all integral types, but that's not true in Cryptol. For example, you can't multiply a 1-bit vector by 2, and you can't mod an integer mod q by 2. These things "just happen" in the spec, but in this implementation we have to manually convert types between bit vectors and integers mod, and sometimes we have to use more width-independent operations (like bit shifting instead of multiplying by 2^j).

The original plan for this PR was to combine the ByteEncode and ByteEncode12 functions into one (and similarly for decoding). However in order to do so I would've had to represent d-bit vectors as integers mod 2^d (e.g. using the Z type), and that was ugly, hard to understand, and slow to prove any properties about. So I kept them separate, as they were previously, and just changed the names to be more in line with the spec.

The other overloaded name in the spec is that the encode/decode functions use the same name to apply to either one or many inputs. I added a _Vec suffix to distinguish these cases.

This feels very cluttered to me with so many properties and conditional functions to handle just 2 algorithms in the spec. I welcome suggestions to reduce that mess.

- adds the bytedecode function - adds an inversion property about decode + encode - adds some properties about rewrites that I did for decoding

Also adds the relevant property

Also adds vector versions of encode/decode-12.

marsella · 2024-10-21T20:45:55Z

Primitive/Asymmetric/Cipher/ML_KEM/Specification.cry

+ByteDecode12 : [32 * 12]Byte -> [256](Z q)
+ByteDecode12 B = F' where


note: this function is never actually used alone, but I don't feel like it's reasonable skip it and only make ByteDecode12_Vec.

marsella · 2024-10-21T20:46:27Z

Primitive/Asymmetric/Cipher/ML_KEM/Specification.cry

- */
-EncodeBytes' : {ell, c} (fin ell, ell > 0, fin c) => [c * 8][ell] -> [c * ell]Byte
-EncodeBytes' = regroup
+ * Encode an array of `d`-bit integers into a byte array, for `d < 12`.


todo: I was also thinking of making all these private, since they're not part of the public API of ML-KEM.

deletes two functions that were previously used in encoding, and relocate one to the one place where it's used.

nlschimanski

The EncodeBytes and DecodeBytes implementations look quite a bit different from the spec, but the explanations for why that's the case help build confidence. I especially appreciate proving each of the claims that were made (e.g., multiplying by 2^j is the same as left shift).

nlschimanski · 2024-10-25T22:44:20Z

Primitive/Asymmetric/Cipher/ML_KEM/Specification.cry

- * This is used in some places where the `ByteDecode` function is required in
- * the spec. It's a 3D version of `DecodeBytes'`.
+ * The subtract-and-divide algorithm applied to `a` in `ByteEncode` is the
+ * same as shifting right.


I think it'd be helpful to add the step in the algorithm (5) where this comes up in the explanation.

nlschimanski · 2024-10-25T22:47:43Z

Primitive/Asymmetric/Cipher/ML_KEM/Specification.cry


 /**
- * This is used in some places where the `ByteEncode` function is required in
- * the spec. It's a 3D version of `EncodeBytes'`.
+ * Encode a set of `k` vectors of integers mod `q` into a byte array.


Is [FIPS-203] Section 2.4.8 relevant here like it is in ByteEncode_Vec?

Good catch, thanks.

nlschimanski · 2024-10-25T22:50:51Z

Primitive/Asymmetric/Cipher/ML_KEM/Specification.cry

- * :prove CorrectnessEncodeBytes
+ * :prove mod2IsFinalBit`{d_u}
+ * :prove mod2IsFinalBit`{d_v}
+ * :prove mod2IsFinalBit`{12}
 * ```


It might be helpful to add some text about why we care that mod 2 is the final bit as it relates to the EncodeBytes algorithm.

nlschimanski · 2024-10-25T22:56:01Z

Primitive/Asymmetric/Cipher/ML_KEM/Specification.cry

+    // Step 1.
+    b = BytesToBits B
+    // Steps 2-4. The `mod m` is implicit in the type because the `[d]` type
+    // always operates `mod 2^d`.


I paused for a second here and had to look back at the spec. Maybe be explicit about how m=2^d?

nlschimanski · 2024-10-25T23:05:42Z

Primitive/Asymmetric/Cipher/ML_KEM/Specification.cry


 /**
- * Proof that the efficient decode function is the same as the spec version.
+ * Multiplying a value by `2^^j` is the same as bit-shifting it left by `j`


Should 2^^j be 2^j in this line?

nlschimanski · 2024-10-25T23:12:30Z

Primitive/Asymmetric/Cipher/ML_KEM/Specification.cry

+ * ```repl
+ * :prove ByteEncodeInvertsByteDecode`{1}
+ * :prove ByteEncodeInvertsByteDecode`{d_u}
+ * :prove ByteEncodeInvertsByteDecode`{d_v}


Where does d_u and d_v come from?

Added an explanation here and everywhere I use these without comment in the doctests.

mccleeary-galois

LGTM as well, note that #166 applies here as well.

marsella added 7 commits October 21, 2024 10:26

mlkem: Add ByteEncode function for d < 12 #144

40a62f3

mlkem: Add ByteDecode for small d #144

4effe92

- adds the bytedecode function - adds an inversion property about decode + encode - adds some properties about rewrites that I did for decoding

mlkem: apply new byte encode/decode functions #144

72f9896

mlkem: add encode / decode for remaining case #144

19a2a77

Also adds the relevant property

mlkem: apply 12-bit encode/decode #144

ccee91d

Also adds vector versions of encode/decode-12.

mlkem: remove old encode/decode functions #144

62bef29

mlkem: improve docs on encode/decode #144

820991b

marsella force-pushed the 144-encoding branch from 2d917b4 to 820991b Compare October 21, 2024 20:26

mlkem: test properties w/ relevant parameters #144

f240c39

marsella changed the title ~~144 encoding~~ ML-KEM: Bring encoding functions in line with the spec Oct 21, 2024

marsella marked this pull request as ready for review October 21, 2024 20:44

marsella commented Oct 21, 2024

View reviewed changes

mlkem: (re)move unused conversion functions #144

32668f5

deletes two functions that were previously used in encoding, and relocate one to the one place where it's used.

marsella requested a review from nlschimanski October 23, 2024 16:59

marsella mentioned this pull request Oct 23, 2024

ML-KEM: Improve compression and byte conversion functions #161

Open

marsella linked an issue Oct 23, 2024 that may be closed by this pull request

Bring encoding and compression functions in ML-KEM up to gold standard #144

Open

11 tasks

nlschimanski reviewed Oct 25, 2024

View reviewed changes

mlkem: improve docs on encoding functions #144

875f925

nlschimanski approved these changes Oct 29, 2024

View reviewed changes

mlkem: move encoding functions to be private #144

2d41a3d

mccleeary-galois approved these changes Oct 31, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ML-KEM: Bring encoding functions in line with the spec #160

ML-KEM: Bring encoding functions in line with the spec #160

marsella commented Oct 21, 2024 •

edited

Loading

marsella Oct 21, 2024

marsella Oct 21, 2024

nlschimanski left a comment

nlschimanski Oct 25, 2024

nlschimanski Oct 25, 2024

marsella Oct 29, 2024

nlschimanski Oct 25, 2024

nlschimanski Oct 25, 2024

nlschimanski Oct 25, 2024

nlschimanski Oct 25, 2024

marsella Oct 29, 2024

mccleeary-galois left a comment

		ByteDecode12 : [32 * 12]Byte -> [256](Z q)
		ByteDecode12 B = F' where

ML-KEM: Bring encoding functions in line with the spec #160

Are you sure you want to change the base?

ML-KEM: Bring encoding functions in line with the spec #160

Conversation

marsella commented Oct 21, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nlschimanski left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mccleeary-galois left a comment

Choose a reason for hiding this comment

marsella commented Oct 21, 2024 •

edited

Loading