[BUG] - dumping ledger state takes humungous amounts of memory #3691

angerman · 2022-03-08T04:08:26Z

Internal
Internal if an IOHK staff member.

Area
Other Any other topic (Delegation, Ranking, ...).

Summary
Dumping ledger state on macOS takes humungous amount of memory.

Steps to reproduce
On a mac, start a cardano-node instance, then use cardano-cli to dump the ledger state on mainnet.
Observer cardano-node taking ~12G or memory, and cardano-cli another ~30G.

Expected behavior
Hopefully stay within available system memory.

ashisherc · 2022-04-05T14:10:40Z

This is being asked by the community for a long time, It would be best to be able to query specifics from the ledger state dump, eg. only go snapshot? I have heard of some technical limitations with it. But we see that we need to have the specific queries sooner, this dump memory consumption is sky rocketing already

Jimbo4350 · 2022-10-27T08:47:55Z

Closing this. If this is still relevant please reopen.

ashisherc · 2022-10-27T16:03:25Z

It is still the same issue, did I miss any PR that updates this?

AndrewWestberg · 2022-10-27T19:41:37Z

@Jimbo4350 Please re-open this one. It's still a major issue.

Jimbo4350 · 2022-10-29T18:31:06Z

@newhoggy do any of your open PRs address this issue? If they do link the PR here please.

AndrewWestberg · 2022-11-14T21:32:24Z

Just a nice FYI... the ledger state dump in 1.35.4 on mainnet just went over the size that can be held in a normal integer 2^31-1. I now have to re-write a bunch of code since I can no longer parse the ledger state cbor in memory. Many programming languages use INT values to index byte arrays.

If you're parsing the ledger state cbor this way, please check your code. I'm not sure how much time they're going to give us before 1.35.4 is pushed out as a hard-fork requirement.

@JaredCorduan @disassembler

AndrewWestberg · 2022-11-14T23:49:38Z

Not sure how to fix this without rewriting the guts of the google cbor parser. The internal byte array is over 2gb so it overflows the integer array index.

kevinhammond · 2022-11-15T00:01:02Z

I suspect this won't be specific to 1.35.4 (the state might be a little smaller in 1.35.3)? If you're using unsigned for indexing you can get to 4GB presumably

AndrewWestberg · 2022-11-15T03:07:14Z

@kevinhammond The state is slightly smaller in 1.35.3 as it hasn't broken there (yet). Given that we're actively upgrading to 1.35.4 on mainnet, I'll have to implement a workaround soon. Right now, the solution is to implement arrays of arrays in the Google cbor library I'm using. It's painful, but it's the only option I have for now. unsigned indexes aren't allowed in JVM languages.

We really do need piecemeal queries for all this stuff. I believe db-sync is still using this monolithic ledger state dump as well.

papacarp · 2022-11-16T16:12:16Z

1.35.3 is currently over the 2G limit when I tested yesterday.

-rw-------  1 ubuntu ubuntu 1952594762 Nov 11 21:58 ls375.cbor
-rw-------  1 ubuntu ubuntu 2167014930 Nov 15 18:15 lst375.cbor

python cbor2 library still parses it just fine.

newhoggy · 2023-02-16T23:07:09Z

Is querying the entire ledger-state still a thing that is needed?

I understand that the ledger-state was originally meant as a way to quickly get some functionality working. If its possible to provide that functionality by querying for a subset of the ledger-state that's preferred.

In which case, please track this issue: #4140

newhoggy · 2023-02-17T00:12:24Z

I ran this on mainnet and observed the CLI taking up to just over 6G when using --out-file parameter. Using --out-file dumps the binary and skips the decode.

ashisherc · 2023-02-17T04:36:44Z

We also require querying stakeGo, stakeMark from the ledger state. Note that the full stakeMark, stakeGo snapshot, not only the stake amount per pool id.

rdlrt · 2023-02-17T05:16:29Z

Is querying the entire ledger-state still a thing that is needed?

I think major use case for this was stake snapshot indeed.
However - as I understand - ledger-state still contains off-chain information (rewards, treasury), that might not be available elsewhere from node itself - this is also a blocker (limiting) for solutions downstream (cardano-db-sync is the only project that tries to work with ledger-state (disabling ledger-state query let's go of these informations.
Similarly, other solutions like scrolls/ouros/ogmios/carp/cncli face similar restrictions.

So until the complete equivalent ways to fetch this data from node are available, the downstream solutions that require those features will have to unfortunately depend on ledger-state (even if it's supposed to be used only for debugging) 🙂

newhoggy · 2023-03-12T02:32:13Z

@rdlrt Can you create new tickets, one for each of the queries that are needed to not rely on ledger-state anymore?

newhoggy · 2023-03-12T02:34:04Z

@ashisherc does this meet your needs? #4279

ashisherc · 2023-03-12T03:58:30Z

@newhoggy thanks for the review, but that's not what I meant. As I mentioned in my previous comment, we rely on full stakeGo/stakeMark snapshot. which means not just pools info, but we also need delegationMap, stakeMap which are part of stakeGo/stakeMark snapshots

CarlosLopezDeLara · 2023-03-13T19:42:33Z

@ashisherc @rdlrt @AndrewWestberg Can I get your input as users here, please --> #4982

newhoggy · 2023-03-14T00:14:39Z

Yes please. Dumping the ledger state is not a thing that can easily be optimised, so it's best if we create feature requests for the queries that return the parts of the ledger state that people need.

newhoggy · 2023-03-14T00:16:02Z

I propose for this issue to be closed and new FRs be created to track each new query.

rdlrt · 2023-03-14T04:43:39Z

Created new issue #4984 as requested, I did not split it further - but feel free to split them if desired

AndrewWestberg · 2024-01-30T21:49:55Z

It's epoch 464 and the cbor binary version of the ledger state I need to parse each epoch has reached 2.26GB

Mux - LocalStateQueryProtocol processing: 580 buffers, size: 2.26 GiB

angerman added the bug Something isn't working label Mar 8, 2022

angerman changed the title ~~[BUG] -~~ [BUG] - dumping ledger state takes humungous amounts of memory Mar 8, 2022

Jimbo4350 assigned newhoggy Mar 8, 2022

Jimbo4350 added performance An Issue Related to the Performance of the Node and removed bug Something isn't working labels Mar 8, 2022

Jimbo4350 closed this as completed Oct 27, 2022

Jimbo4350 reopened this Oct 29, 2022

AndrewWestberg mentioned this issue Nov 15, 2022

CIP-0078? | Extended Local Chain Sync Protocol cardano-foundation/CIPs#375

Closed

AndrewWestberg mentioned this issue Dec 20, 2022

cncli leader log calculations using significant memory cardano-community/cncli#23

Closed

adavault mentioned this issue Dec 20, 2022

cncli leader log calculations using significant memory cardano-community/guild-operators#1587

Closed

rdlrt mentioned this issue Mar 14, 2023

[FR] - Pending ledger-state dependencies #4984

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] - dumping ledger state takes humungous amounts of memory #3691

[BUG] - dumping ledger state takes humungous amounts of memory #3691

angerman commented Mar 8, 2022

ashisherc commented Apr 5, 2022 •

edited

Loading

Jimbo4350 commented Oct 27, 2022

ashisherc commented Oct 27, 2022

AndrewWestberg commented Oct 27, 2022

Jimbo4350 commented Oct 29, 2022 •

edited

Loading

AndrewWestberg commented Nov 14, 2022

AndrewWestberg commented Nov 14, 2022

kevinhammond commented Nov 15, 2022 •

edited

Loading

AndrewWestberg commented Nov 15, 2022

papacarp commented Nov 16, 2022

newhoggy commented Feb 16, 2023 •

edited

Loading

newhoggy commented Feb 17, 2023

ashisherc commented Feb 17, 2023

rdlrt commented Feb 17, 2023 •

edited

Loading

newhoggy commented Mar 12, 2023

newhoggy commented Mar 12, 2023

ashisherc commented Mar 12, 2023

CarlosLopezDeLara commented Mar 13, 2023

newhoggy commented Mar 14, 2023

newhoggy commented Mar 14, 2023

rdlrt commented Mar 14, 2023

AndrewWestberg commented Jan 30, 2024

[BUG] - dumping ledger state takes humungous amounts of memory #3691

[BUG] - dumping ledger state takes humungous amounts of memory #3691

Comments

angerman commented Mar 8, 2022

ashisherc commented Apr 5, 2022 • edited Loading

Jimbo4350 commented Oct 27, 2022

ashisherc commented Oct 27, 2022

AndrewWestberg commented Oct 27, 2022

Jimbo4350 commented Oct 29, 2022 • edited Loading

AndrewWestberg commented Nov 14, 2022

AndrewWestberg commented Nov 14, 2022

kevinhammond commented Nov 15, 2022 • edited Loading

AndrewWestberg commented Nov 15, 2022

papacarp commented Nov 16, 2022

newhoggy commented Feb 16, 2023 • edited Loading

newhoggy commented Feb 17, 2023

ashisherc commented Feb 17, 2023

rdlrt commented Feb 17, 2023 • edited Loading

newhoggy commented Mar 12, 2023

newhoggy commented Mar 12, 2023

ashisherc commented Mar 12, 2023

CarlosLopezDeLara commented Mar 13, 2023

newhoggy commented Mar 14, 2023

newhoggy commented Mar 14, 2023

rdlrt commented Mar 14, 2023

AndrewWestberg commented Jan 30, 2024

ashisherc commented Apr 5, 2022 •

edited

Loading

Jimbo4350 commented Oct 29, 2022 •

edited

Loading

kevinhammond commented Nov 15, 2022 •

edited

Loading

newhoggy commented Feb 16, 2023 •

edited

Loading

rdlrt commented Feb 17, 2023 •

edited

Loading