feat: blobs. #9302

MirandaWood · 2024-10-21T13:17:14Z

The Blobbening

It's happening and I can only apologise.

Follows #8955.

Intro

More detailed stuff below, but the major changes are:

Publish DA through blobs rather than calldata. This means:
- No more txs effects hashing
- Proving that our effects are included in a blob (see below for details) using base -> root rollup circuits
- Accumulating tx effects in a blob sponge (SpongeBlob)
- Building blocks by first passing a hint of how many tx effects the block will have
- Updating forge to handle blobs
- Tests for all the above

Major Issues

Things that we should resolve before merging:

Run times massively increased:
- This is largely because the nr code for blobs is written with the BigNum lib, which uses a lot of unconstrained code then a small amount of constrained code to verify results. Unfortunately this means we cannot simulate circuits containing blobs (currently block-root) using wasm or set nr tests to unconstrained because that causes a stack overflow in brillig.
- To avoid straight up failures, I've created nr tests which are not unconstrained (meaning rollup-lib tests take 10mins or so to run) and I've forced circuit simulation to run in native ACVM rather than wasm (adding around 1min to any tests that simulate block-root).
- Yes, there's more! All the above is happening while we only create one blob per block. This is definitely not enough space (we aim for 3 per block), but I imagine tripling the blob nr code would only cause more runtime issues.
~~Data retrieval~~ The below will be done in feat: Integrate beacon chain client/web2 blob getter #9101, and for now we use calldata just to keep the archiver working:
- The current (interim) solution is to still publish the same block body calldata as before, just so the archiver actually runs. This calldata is no longer verified with the txs effects hash, but is checked (in ts) against the known blob hash, so a mismatch will still throw.
- The actual blob contents will look different to the body calldata since we will be tightly packing effects and adding length markers before each section (like how log lengths work). I've added to/from methods to aid conversion in data-retrieval to use.
~~Blob verification precompile gas~~ Batching blob KZG proofs is being thought about (see Epic: Blobs #8955 for progression):
- The current approach to verify that the published blob matches the tx effects coming from the rollup is to call the point evaluation precompile for each blob. This costs 50k gas each time, so is not sustainable.
- We think it's possible to accumulate the KZG proofs used to validate blobs into one. Mike is thinking about this and whether it's doable using nr, so we can call the precompile once per epoch rather than 3 times per block.

General TODOs

Things I'm working on:

Moving from 1 to 3 blobs per block
- This will slow everything down massively so I'd prefer to solve the runtime issues before tackling this.
- It's also going to be relatively complex, because the base rollup will need code to fill from one of three SpongeBlob instances and will need to know how to 'jump' from one full blob to the next at any possible index. Hopefully this does not lead to a jump in gates.

Description

The general maths in nr and replicated across foundation/blob is described here.

Old DA Flow

From the base rollup to L1, the previous flow for publishing DA was:

Nr:

In the base rollup, take in all tx effects we wish to publish and sha256 hash them to a single value: tx_effects_hash
This value is propogated up the rollup to the next merge (or block-root) circuit
Each merge or block-root circuit simply sha256 hashes each 'child' tx_effects_hash from its left and right inputs
Eventually, at block-root, we have one value: txs_effects_hash which becomes part of the header's content commitment

Ts:

The txs_effects_hash is checked and propogated through the orchestrator and becomes part of the ts class L2Block in the header
The actual tx effects to publish become the L2Block's .body
The publisher sends the serialised block body and header to the L1 block propose function

Sol:

In propose, we decode the block body and header
The body is deconstructed per tx into its tx effects and then hashed using sha256, until we have N tx_effects_hashes (mimicing the calculation in the base rollup)
Each tx_effects_hash is then input as leaves to a wonky tree and hashed up to the root (mimicing the calculation from base to block-root), forming the final txs_effects_hash
This final value is checked to be equal to the one in the header's content commitment, then stored to be checked against for validating data availability
*Later, when verifying a rollup proof, we use the above header values as public inputs. If they do not match what came from the circuit, the verification fails.

*NB: With batch rollups, I've lost touch with what currently happens at verification and how we ensure the txs_effects_hash matches the one calculated in the rollup, so this might not be accurate.

New DA Flow

The new flow for publishing DA is:

Nr:

In the base rollup, we treat tx effects as we treat PartialStateReferences - injecting a hint to the start and end state we expect from processing this base's transaction
We take all the tx effects to publish and absorb them into the given start SpongeBlob state. We then check the result is the same as the given end state
Like with PartialStateReferences, each merge or block-root checks that the left input's end blob state is equal to the right input's start blob state
Eventually, at block-root, we check the above and that the left's start blob state was empty. Now we have a sponge which has absorbed, as a flat array, all the tx effects in the block we wish to publish
We inject the flat array of effects as a private input, along with the ts calculated blob commitment, and pass them and the sponge to the blob function
The blob function:
- Poseidon hashes the flat array of effects, and checks this matches the accumulated sponge when squeezed (this confirms that the flat array is indeed the same array of tx effects propogated from each base)
- Computes the challenge z by hashing this ^ hash with the blob commitment
- Evaluates the blob polynomial at z using the flat array of effects in the barycentric formula (more details on the engineering design link above), to return y
The block-root adds this triple (z, y, and commitment C) to a new array of BlobPublicInputs
*Like how we handle fees, each block-merge and root merges the left and right input arrays, so we end up with an array of each block's blob info

*NB: this will likely change to accumulating to a single set of values, rather than one per block, and is being worked on by Mike. The above also describes what happens for one blob per block for simplicity (it will actually be 3).

Ts:

The BlobPublicInputs are checked against the ts calculated blob for each block in the orchestrator
They form part of a serialised array of bytes called blobInput (plus the expected L1 blobHash and a ts generated KZG proof) sent to L1 to the propose function
The propose transaction is now a special 'blob transaction' where all the tx effects (the same flat array as dealt with in the rollup) are sent as a sidecar
*We also send the serialised block body, so the archiver can still read the data back until feat: Integrate beacon chain client/web2 blob getter #9101

*NB: this will change once we can read the blobs themselves from the beacon chain/some web2 client.

Sol:

In propose, instead of recalcating the txs_effects_hash, we send the blobInput to a new validateBlob function. *This function:
- Gathers the real blobHash from the EVM and checks it against the one in blobInput
- Calls the point evaluation precompile and checks that our z, y, and C indeed correspond to the blob we claim
We now have a verified link between the published blob and our blobInput, but still need to link this to our rollup circuit:
- Each set of BlobPublicInputs is extracted from the bytes array and stored against its block number
- When the root proof is verified, we reconstruct the array of BlobPublicInputs from the above stored values and use them in proof verification
- If any of the BlobPublicInputs are incorrect (equivalently, if any of the published blobs were incorrect), the proof verification will fail
To aid users/the archiver in checking their blob data matches a certain block, the EVM blobHash is been added to BlockLog once it has been verified by the precompile

*NB: As above, we will eventually call the precompile just once for many blobs with one set of BlobPublicInputs. This will still be used in verifying the root proof to ensure the tx effects match those from each base.

…st working

yarn-project/sequencer-client/src/publisher/l1-publisher.ts

MirandaWood · 2024-11-05T14:49:07Z

l1-contracts/src/core/Rollup.sol

    require(
-      _flags.ignoreDA || _header.contentCommitment.txsEffectsHash == _txsEffectsHash,
-      Errors.Rollup__UnavailableTxs(_header.contentCommitment.txsEffectsHash)
+      _flags.ignoreDA || 1 == 1, // _header.blobHash == _blobHash,


@ TMNT team: here we need to check the data has been published. I could add the EVM's blobHash to the block header (we can calculate it in ts in advance) and uncomment the check?
However it could be doable without adding more fields to the header by having some isAvailable mapping filled once _validateBlob is called. Not sure which is preferable based on the team's priorities!

MirandaWood · 2024-11-05T14:50:16Z

noir-projects/noir-protocol-circuits/crates/rollup-lib/src/components.nr

    offset += 1;

    // TX FEE
-    tx_effects_hash_input[offset] = transaction_fee;
+    // TODO(Miranda): how many bytes do we expect tx fee to be? Using 29 for now


@ TMNT (or any) team: is there a restriction anywhere on the size of the tx fee?

github-actions · 2024-11-11T11:38:32Z

Changes to circuit sizes

Generated at commit: 7f91171905029680872ceb1213903eba65ddf07b, compared to commit: 8f41beef025b7c28fe09bacdf5fe29d1142a5d74

🧾 Summary (100% most significant diffs)

Program	ACIR opcodes (+/-)	%	Circuit size (+/-)	%
rollup_block_root	+184,452 ❌	+4108.98%	+465,637 ❌	+16.26%
rollup_block_root_empty	+192 ❌	+204.26%	+192 ❌	+6.64%
rollup_root	+1,663 ❌	+13.87%	+15,519 ❌	+0.80%
rollup_block_merge	+1,663 ❌	+13.85%	+15,519 ❌	+0.80%
rollup_merge	-1,381 ✅	-40.39%	-7,842 ✅	-0.41%
rollup_base_public	-33,366 ✅	-7.06%	-710,848 ✅	-18.77%
rollup_base_private	-33,171 ✅	-9.90%	-710,265 ✅	-20.58%

Full diff report 👇

Program	ACIR opcodes (+/-)	%	Circuit size (+/-)	%
rollup_block_root	188,941 (+184,452)	+4108.98%	3,328,889 (+465,637)	+16.26%
rollup_block_root_empty	286 (+192)	+204.26%	3,083 (+192)	+6.64%
rollup_root	13,652 (+1,663)	+13.87%	1,956,425 (+15,519)	+0.80%
rollup_block_merge	13,668 (+1,663)	+13.85%	1,956,439 (+15,519)	+0.80%
rollup_merge	2,038 (-1,381)	-40.39%	1,901,604 (-7,842)	-0.41%
rollup_base_public	438,963 (-33,366)	-7.06%	3,076,106 (-710,848)	-18.77%
rollup_base_private	301,891 (-33,171)	-9.90%	2,740,337 (-710,265)	-20.58%

TomAFrench · 2024-11-15T11:31:09Z

noir-projects/noir-protocol-circuits/crates/blob/Nargo.toml

 authors = [""]
 compiler_version = ">=0.30.0"

 [dependencies]
-bigint = {tag = "v0.3.4", git = "https://github.com/noir-lang/noir-bignum" }
+bigint = {tag = "v0.4.0", git = "https://github.com/noir-lang/noir-bignum" }


Suggested change

bigint = {tag = "v0.4.0", git = "https://github.com/noir-lang/noir-bignum" }

bigint = {tag = "v0.4.1", git = "https://github.com/noir-lang/noir-bignum" }

v0.4.1 should have much better brillig output.

MirandaWood added 11 commits September 25, 2024 16:56

feat: first blobs commit - ts + nr only - overflow err

d2544a9

feat: hash blob in chunks, force native sim for block-root

9cefc9a

feat: add ts blob class, use real blob in rollup, more tests

df93cf5

Merge remote-tracking branch 'origin' into mw/blob-circuit

c5539eb

feat: post merge fixes, add sponge to epoch prover, cleanup

5e92023

fix: more merge fixes + cleanup

1d17855

feat: purge txseffectshash from nr and ts

aff5e35

feat: publish blobs to L1, verify, test, a lot of cleanup

dfe722b

chore: fmt, test refactors + fixes

bc0678c

Merge remote-tracking branch 'origin' into mw/blob-circuit

ad090f6

fix: multiple post merge fixes, refactor err logging, prover-coord te…

bf98bbc

…st working

MirandaWood commented Oct 24, 2024

View reviewed changes

yarn-project/sequencer-client/src/publisher/l1-publisher.ts Outdated Show resolved Hide resolved

MirandaWood added 14 commits October 24, 2024 13:05

Merge remote-tracking branch 'origin' into mw/blob-circuit

2cd7570

feat: reinstate bignum code after fix, post merge fixes and fmt

8a8dcab

Merge remote-tracking branch 'origin' into mw/blob-circuit

cc01696

chore: revert e2e prover coord fix in favour of master

6f92b1f

feat: added custom existing sponge absorber to save gates, fmt

233d3fd

feat: tightly pack blobs, encode and decode with prefixes

7c1749d

Merge remote-tracking branch 'origin' into mw/blob-circuit

247a49d

chore: post merge fixes, formatting

8f3d10f

fix: apply fix for public processor state (PR9634)

20674cf

Merge remote-tracking branch 'origin' into mw/blob-circuit

38725f8

chore: update fixtures + fmt

77dcf8e

feat: add blob check override, simulate txs

580b1b2

Merge remote-tracking branch 'origin' into mw/blob-circuit

2166780

feat: update bignum, fmt, comments, small fixes

99ac96e

MirandaWood requested review from LeilaWang and iAmMichaelConnor November 5, 2024 14:38

MirandaWood commented Nov 5, 2024

View reviewed changes

MirandaWood added 2 commits November 8, 2024 10:49

chore: post merge fixes, fmt

5b80e50

feat: add blob hash to header

079aef8

MirandaWood requested a review from LHerskind November 8, 2024 17:01

MirandaWood added 2 commits November 8, 2024 17:05

chore: cleanup, remove unused tat

23092c6

refactor: block builders now add all txs at once

ad7bb09

MirandaWood requested a review from LeilaWang November 8, 2024 19:52

Merge remote-tracking branch 'origin' into mw/blob-circuit

fd6b875

MirandaWood removed request for fcarreiro, IlyasRidhuan and jeanmon November 8, 2024 19:59

MirandaWood added 4 commits November 11, 2024 09:14

Merge remote-tracking branch 'origin' into mw/blob-circuit

a367e6f

chore: post merge fixes, fmt, fixtures

5e506ce

feat: delete neg_roots constants, calc instead, small runtime gains

a418c58

Merge remote-tracking branch 'origin' into mw/blob-circuit

c06c2ef

MirandaWood added the e2e-all CI: Enables this CI job. label Nov 11, 2024

MirandaWood added 5 commits November 12, 2024 10:50

feat: save some gates in sponge absorbing

46d2b3d

Merge remote-tracking branch 'origin' into mw/blob-circuit

3969035

chore: post merge fixes, fmt, fxitures

f4267c5

feat: save more gates

0e9b2c8

Merge remote-tracking branch 'origin' into mw/blob-circuit

96157d0

FilipHarald mentioned this pull request Nov 13, 2024

feat: support aztec blob-structure aztlan-labs/chicmoz#218

Open

4 tasks

MirandaWood added 5 commits November 13, 2024 12:01

Merge remote-tracking branch 'origin' into mw/blob-circuit

fcbc6ce

chore: update constants post merge

bcf452e

chore: rename tofields -> toBlobFields, remove unused code

9a67624

Merge remote-tracking branch 'origin' into mw/blob-circuit

76ccac1

chore: include contract class logs in blob, some renaming

526b4c6

This comment was marked as off-topic.

Sign in to view

MirandaWood mentioned this pull request Nov 15, 2024

Blobs: revert to constant roots of unity rather than unconstrained #9982

Open

TomAFrench reviewed Nov 15, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: blobs. #9302

feat: blobs. #9302

MirandaWood commented Oct 21, 2024 •

edited

Loading

MirandaWood Nov 5, 2024

MirandaWood Nov 5, 2024

github-actions bot commented Nov 11, 2024 •

edited

Loading

This comment was marked as off-topic.

TomAFrench Nov 15, 2024

	bigint = {tag = "v0.4.0", git = "https://github.com/noir-lang/noir-bignum" }
	bigint = {tag = "v0.4.1", git = "https://github.com/noir-lang/noir-bignum" }

feat: blobs. #9302

Are you sure you want to change the base?

feat: blobs. #9302

Conversation

MirandaWood commented Oct 21, 2024 • edited Loading

The Blobbening

Intro

Major Issues

General TODOs

Description

Old DA Flow

New DA Flow

MirandaWood Nov 5, 2024

Choose a reason for hiding this comment

MirandaWood Nov 5, 2024

Choose a reason for hiding this comment

github-actions bot commented Nov 11, 2024 • edited Loading

Changes to circuit sizes

🧾 Summary (100% most significant diffs)

This comment was marked as off-topic.

TomAFrench Nov 15, 2024

Choose a reason for hiding this comment

MirandaWood commented Oct 21, 2024 •

edited

Loading

github-actions bot commented Nov 11, 2024 •

edited

Loading