perf: reduce memory use #57

joonazan · 2023-12-15T15:55:46Z

What ❔

Enables improving peak memory use by computing circuits incrementally.

The API between this crate and the witness generator changes significantly. Instead of returning circuits, this crate now takes callbacks which it calls whenever a new circuit is ready. It could directly take a channel instead but I feel that is maybe not generic enough.

This PR incrementalizes just one type of circuit to make sure that it is possible.

Also removes unnecessary datastructures, clones etc. which improves memory use as well.

Why ❔

We have to improve memory use to be able to process larger batches on reasonable computers.

The new API allows emitting circuits as soon as possible instead of keeping around circuit inputs till the end and then building all of them. I measured that the vast majority of memory goes into storing the inputs of RAM circuits and MainVM circuits. Fixing those could solve all our memory trouble.

Even with every circuit type incrementalized, an array of all ClosedFormInputCompactFormWitnesses is collected. I don't know how hard it is to get rid of that or if it is big enough to be problematic.

The destination of replace isn't used later, so setting it to an empty Vec does nothing.

Still needs further cleanup. This version doesn't reduce memory use much, the focus here is on changing the API. I had to hardcode GoldilocksField instead of parameterizing everything by field. The trait bounds required were ridiculous and caused 30 min long typechecking.

I had assumed that InstanceWitness -> circuit type is a proper function. It is not.

joonazan · 2024-01-25T16:45:26Z

I get the same failures and timeouts when running the tests before and after this PR, so I guess it passes...

shamatar · 2024-01-25T16:46:50Z

We should rollout it on staging, generate witness for the full block, and basically compare them (those are serializable structures, so just compare bytes)

joonazan · 2024-01-25T17:15:37Z

@shamatar A note about comparing witnesses: simple diffing will indicate they have changed because the circuits are in a valid but different order. I used the following script to diff circuits.

import os, os.path, filecmp

def sorted_files(dir):
    return map(lambda x: os.path.join(dir, x), sorted(os.listdir(dir), key=file_order))

def file_order(filename):
    [a, b, c, *r] = os.path.basename(filename).split("_")
    return (int(a), int(c), int(b))

for a, b in zip(sorted_files("reference_artifacts/prover_jobs_fri"), sorted_files("artifacts/prover_jobs_fri")):
    if not filecmp.cmp(a, b):
        print("mismatch", a, b)

We'll do comprehensive testing next week. I'm unavailable tomorrow.

I am very confident that the output of this version is identical to 1.4.1 in all other respects.

## What ❔ Uses an [updated zkevm_test_harness](matter-labs/era-zkevm_test_harness#57) that uses less memory. The updated crate takes callback functions instead of returning a vector of all results. We may have to think about how to not keep the database open the whole time the callbacks are coming in. ## Why ❔ Proving batches currently uses memory proportional to the batch size, which is not ok for large batches. --------- Co-authored-by: Fedor Sakharov <fedor.sakharov@gmail.com> Co-authored-by: AntonD3 <74021421+AntonD3@users.noreply.github.com> Co-authored-by: Stanislav Breadless <stanislavbezkor@gmail.com> Co-authored-by: perekopskiy <53865202+perekopskiy@users.noreply.github.com> Co-authored-by: pompon0 <pompon.pompon@gmail.com> Co-authored-by: Dustin Brickwood <dustinbrickwood204@gmail.com> Co-authored-by: Igor Borodin <hatemosphere@protonmail.com>

joonazan mentioned this pull request Dec 15, 2023

perf: reduce memory consumption of witness generation matter-labs/zksync-era#696

Merged

joonazan force-pushed the jms-fix-memory-use branch 4 times, most recently from 9b2bcc9 to 997463c Compare January 12, 2024 08:59

joonazan force-pushed the jms-fix-memory-use branch 3 times, most recently from a8a88e9 to 70351ec Compare January 24, 2024 18:02

joonazan added 12 commits January 25, 2024 17:23

remove unused and temporary fields from FullBlockArtifacts

b5e42fa

remove unnecessary mem::replace

e7e5c9b

The destination of replace isn't used later, so setting it to an empty Vec does nothing.

return iterators instead of Vecs

64ab2be

only compile tests when testing

3eb45b7

fix lifetime problems

6675fbd

remove patch, as main boojum is fixed

fdf33c2

return compact form witnessses in recursion queue callback

2dd34f9

return circuit id along with recursion queue

23ba6fd

fix: disambiguate events and messages sorter circuits

98c5183

I had assumed that InstanceWitness -> circuit type is a proper function. It is not.

inline everything into create_artifacts_from_tracer

8eb8d31

make storage application circuits incrementally

f0c8a61

joonazan force-pushed the jms-fix-memory-use branch from 70351ec to a8e3b26 Compare January 25, 2024 10:24

joonazan changed the base branch from v1.4.0 to v1.4.1 January 25, 2024 10:24

joonazan marked this pull request as ready for review January 25, 2024 16:43

joonazan requested review from jules and shamatar January 25, 2024 16:43

joonazan added 2 commits January 26, 2024 00:23

make vm_memory_queue_states local

16bf699

make tests compile

1050071

joonazan force-pushed the jms-fix-memory-use branch from fc3e302 to 1050071 Compare January 25, 2024 17:24

expose compute_setups again

cd639c4

joonazan force-pushed the jms-fix-memory-use branch from 1713a78 to cd639c4 Compare January 25, 2024 17:56

use u64 for circuit type as before

98eee54

joonazan merged commit badf56d into v1.4.1 Jan 31, 2024
4 checks passed

mm-zk deleted the jms-fix-memory-use branch April 5, 2024 12:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: reduce memory use #57

perf: reduce memory use #57

joonazan commented Dec 15, 2023 •

edited

Loading

joonazan commented Jan 25, 2024

shamatar commented Jan 25, 2024

joonazan commented Jan 25, 2024 •

edited

Loading

perf: reduce memory use #57

perf: reduce memory use #57

Conversation

joonazan commented Dec 15, 2023 • edited Loading

What ❔

Why ❔

joonazan commented Jan 25, 2024

shamatar commented Jan 25, 2024

joonazan commented Jan 25, 2024 • edited Loading

joonazan commented Dec 15, 2023 •

edited

Loading

joonazan commented Jan 25, 2024 •

edited

Loading