EIP4844: Add check to ensure Blobs are canonical #3057

kevaundray · 2022-10-23T19:29:17Z

Problem

When we receive a blob as a sequence of bytes and interpret it as a integer mod p, we do not check that the byte representation is canonical.

Example

Lets say p is 5.

If I have a byte array b1 which encodes the integer 2 and a byte array b2 which encodes the integer 7. When I convert both byte arrays to an integer mod 5, they will both produce the value 2.

This can be a problem because two different blobs will produce the same commitment.

Solution

Check that the integer mod p when converted back to a byte array, does indeed produce the original byte array.

The text was updated successfully, but these errors were encountered:

Inphi · 2022-10-24T14:22:15Z

This is sorta specified in the BLSFieldElement type description used to represent each point in a blob. While it should also be explicitly done in the executable spec, it'll depend on client implementations where to make such checks. Since the blobs are sourced from execution, clients can only trust that blobs are encoded correctly.

kevaundray · 2022-10-24T16:21:42Z

This is sorta specified in the BLSFieldElement type description used to represent each point in a blob. While it should also be explicitly done in the executable spec, it'll depend on client implementations where to make such checks. Since the blobs are sourced from execution, clients can only trust that blobs are encoded correctly.

After 3038, is it possible that these checks are moved solely to the cryptography functions that create and verify proofs?

Inphi · 2022-10-24T22:04:59Z

After 3038, is it possible that these checks are moved solely to the cryptography functions that create and verify proofs?

That won't do since the blob encoding is up to the user. For example, Blobs could be packed tightly where unused bits in the field element could be used to encode the next byte. This is a non-canonical but valid usecase.
Since encoding isn't context-free, only the user/encoder will be able to determine whether the blobs should be canonical.

kevaundray · 2022-10-24T23:38:23Z

After 3038, is it possible that these checks are moved solely to the cryptography functions that create and verify proofs?

That won't do since the blob encoding is up to the user. For example, Blobs could be packed tightly where unused bits in the field element could be used to encode the next byte. This is a non-canonical but valid usecase.

Since encoding isn't context-free, only the user/encoder will be able to determine whether the blobs should be canonical.

Ah I didn't know that, so blobs do not need to be canonical and in fact this should not be checked by the cryptography code since it doesn't have the context to decide this.

Can you explain why it would not be a problem, if two different blobs, A and B, produced the same commitment, where in both cases the encoder is expecting a non-canonical blob?

In particular, what context can one use to determine that blob A is the correct non-canonical blob?

Inphi · 2022-10-25T14:09:05Z

Yup, the cryptography code shouldn't check the encoding.

If two pieces of data, A and B, generate the same blob and thus commitment, then that's a problem with the encoding. I'd attribute that to User Error and not a real problem that can be solved by the specs.

Inphi · 2022-10-25T14:10:43Z

What we're trying to solve with "Data Availability" is really "Blob Availability". (and maybe we should rename it for accuracy :-)

protolambda · 2022-10-25T15:23:03Z

I don't like the idea of quietly applying a modulus or bit truncation on input that's otherwise invalid, +1 on making the the input validation on crypto functions strict. The cryptography code maybe shouldn't have the responsibility to actually perform the checks if we can demand valid inputs from the user before the crypto function is called, but we should make the specs strict on input.

kevaundray · 2022-11-03T15:05:57Z

Closing as #3038 has been merged

kevaundray mentioned this issue Oct 30, 2022

EIP4844: Update cryptography API and Fiat-Shamir logic #3038

Merged

kevaundray closed this as completed Nov 3, 2022

xrchz mentioned this issue Nov 25, 2022

check for non-canonical field element representations ethereum/c-kzg-4844#11

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EIP4844: Add check to ensure Blobs are canonical #3057

EIP4844: Add check to ensure Blobs are canonical #3057

kevaundray commented Oct 23, 2022

Inphi commented Oct 24, 2022

kevaundray commented Oct 24, 2022

Inphi commented Oct 24, 2022

kevaundray commented Oct 24, 2022

Inphi commented Oct 25, 2022

Inphi commented Oct 25, 2022

protolambda commented Oct 25, 2022

kevaundray commented Nov 3, 2022

EIP4844: Add check to ensure Blobs are canonical #3057

EIP4844: Add check to ensure Blobs are canonical #3057

Comments

kevaundray commented Oct 23, 2022

Problem

Solution

Inphi commented Oct 24, 2022

kevaundray commented Oct 24, 2022

Inphi commented Oct 24, 2022

kevaundray commented Oct 24, 2022

Inphi commented Oct 25, 2022

Inphi commented Oct 25, 2022

protolambda commented Oct 25, 2022

kevaundray commented Nov 3, 2022