feat: cacheless amt iteration #2189

LesnyRumcajs · 2025-07-31T10:35:19Z

This exposes an alternative iteration method without caching. Without it, the memory usage for iterating over deal proposals is too high (my 64 GiB machine gets knocked out under a minute). For reference, this works fine in Go's AMT implementation.

With for_each_cacheless, I can iterate the proposals in ~90s (no-op iteration).

The logic is mostly taken from the for_each_while_mut.

codecov-commenter · 2025-07-31T10:38:24Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 77.56%. Comparing base (9273717) to head (602ac18).
⚠️ Report is 2 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #2189      +/-   ##
==========================================
+ Coverage   77.50%   77.56%   +0.06%     
==========================================
  Files         147      147              
  Lines       15743    15789      +46     
==========================================
+ Hits        12201    12247      +46     
  Misses       3542     3542

Files with missing lines	Coverage Δ
ipld/amt/src/amt.rs	`83.63% <100.00%> (+0.74%)`	⬆️
ipld/amt/src/node.rs	`87.56% <100.00%> (+1.22%)`	⬆️

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

LesnyRumcajs · 2025-07-31T10:40:33Z

CI blocked by #2187

rvagg · 2025-08-01T07:23:34Z

The only concern I have here is that this reportedly reads twice as many blocks as the cached version which seems off to me. I'd like to figure out what's going on there first.

LesnyRumcajs · 2025-08-01T07:52:18Z

Three iterations in the tests lead to this block read count.

Dirty cache - no block reads at all

ref-fvm/ipld/amt/tests/amt_tests.rs

Lines 418 to 424 in bf6fc44

    
           // Iterate over amt with dirty cache 
        
           let mut x = 0; 
        
           a.for_each_cacheless(|_, _: &BytesDe| { 
        
               x += 1; 
        
               Ok(()) 
        
           }) 
        
           .unwrap();

1st pass

ref-fvm/ipld/amt/tests/amt_tests.rs

Lines 434 to 445 in bf6fc44

    
           new_amt 
        
               .for_each_cacheless(|i, _: &BytesDe| { 
        
                   if i != indexes[x] { 
        
                       panic!( 
        
                           "for each cacheless found wrong index: expected {} got {}", 
        
                           indexes[x], i 
        
                       ); 
        
                   } 
        
                   x += 1; 
        
                   Ok(()) 
        
               }) 
        
               .unwrap();

2nd pass

ref-fvm/ipld/amt/tests/amt_tests.rs

Line 448 in bf6fc44

new_amt.for_each_cacheless(|_, _: &BytesDe| Ok(())).unwrap();

Now, given that the only block read operation (and cache read) occurs here, it makes sense.

ref-fvm/ipld/amt/src/node.rs

Lines 451 to 457 in bf6fc44

    
           Link::Cid { cid, cache: _ } => { 
        
               let node = bs 
        
                   .get_cbor::<CollapsedNode<V>>(cid)? 
        
                   .ok_or_else(|| Error::CidNotFound(cid.to_string()))? 
        
                   .expand(bit_width)?; 
        
               node.for_each_cacheless(bs, height - 1, bit_width, offs, f)?;

All entries are "normally" cached on the 1st pass, but in the for_each_cacheless there is none, so the 2nd pass has to re-read all blocks (and not grab it from the cache) as it is here:

ref-fvm/ipld/amt/src/iter.rs

Lines 230 to 236 in bf6fc44

    
           match cache.get_or_try_init(|| { 
        
               self.blockstore 
        
                   .get_cbor::<CollapsedNode<V>>(cid)? 
        
                   .ok_or_else(|| Error::CidNotFound(cid.to_string()))? 
        
                   .expand(self.bit_width) 
        
                   .map(Box::new) 
        
           }) {

rvagg

needs docs, but lgtm
can be rebased on master because the other PR is merged now

LesnyRumcajs · 2025-08-01T10:22:29Z

ipld/amt/src/lib.rs

 //!
 //! Data structure reference:
-//! https://github.com/ipld/specs/blob/51fab05b4fe4930d3d851d50cc1e5f1a02092deb/data-structures/vector.md
+//! <https://github.com/ipld/specs/blob/51fab05b4fe4930d3d851d50cc1e5f1a02092deb/data-structures/vector.md>


warning: this URL is not a hyperlink --> ipld/amt/src/lib.rs:8:5 | 8 | //! https://github.com/ipld/specs/blob/51fab05b4fe4930d3d851d50cc1e5f1a02092deb/data-structures/vector.md | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | = note: bare URLs are not automatically turned into clickable links = note: `#[warn(rustdoc::bare_urls)]` on by default help: use an automatic link instead | 8 | //! <https://github.com/ipld/specs/blob/51fab05b4fe4930d3d851d50cc1e5f1a02092deb/data-structures/vector.md> | + +

There are other doc links issues in the repo; I'll have a look later and see if there can be a friendly CI check to ensure this doesn't happen in the future.

ipld/amt/src/amt.rs

rvagg

nice
I bet the HAMT could get exactlty the same treatment

github-project-automation bot moved this to 📌 Triage in FilOz Jul 31, 2025

github-project-automation bot added this to FilOz Jul 31, 2025

LesnyRumcajs mentioned this pull request Jul 31, 2025

Improve market state retrieval ChainSafe/forest#5880

Open

2 tasks

rvagg reviewed Aug 1, 2025

View reviewed changes

feat: cacheless amt iteration

d782519

LesnyRumcajs force-pushed the amt-for-each-cacheless branch from bf6fc44 to d782519 Compare August 1, 2025 09:16

docs, comments

9b21dbd

LesnyRumcajs force-pushed the amt-for-each-cacheless branch from 1ced61e to 9b21dbd Compare August 1, 2025 10:14

LesnyRumcajs commented Aug 1, 2025

View reviewed changes

LesnyRumcajs marked this pull request as ready for review August 1, 2025 10:25

LesnyRumcajs changed the title ~~[draft] feat: cacheless amt iteration~~ feat: cacheless amt iteration Aug 1, 2025

rvagg reviewed Aug 1, 2025

View reviewed changes

ipld/amt/src/amt.rs Show resolved Hide resolved

LesnyRumcajs added 2 commits August 1, 2025 13:11

doc: remove flush hint

a88162d

revert doc fix

602ac18

rvagg approved these changes Aug 4, 2025

View reviewed changes

github-project-automation bot moved this from 📌 Triage to ✔️ Approved by reviewer in FilOz Aug 4, 2025

LesnyRumcajs merged commit a633547 into master Aug 4, 2025
16 checks passed

LesnyRumcajs deleted the amt-for-each-cacheless branch August 4, 2025 17:25

github-project-automation bot moved this from ✔️ Approved by reviewer to 🎉 Done in FilOz Aug 4, 2025

LesnyRumcajs mentioned this pull request Aug 4, 2025

consider for_each_cacheless for HAMT and KAMT #2193

Open

rvagg mentioned this pull request Sep 2, 2025

[Testing Only] cacheless hamt iteration #2215

Draft

hanabi1224 mentioned this pull request Sep 4, 2025

feat: cacheless hamt iteration #2216

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: cacheless amt iteration #2189

feat: cacheless amt iteration #2189

Uh oh!

LesnyRumcajs commented Jul 31, 2025

Uh oh!

codecov-commenter commented Jul 31, 2025 •

edited

Loading

Uh oh!

LesnyRumcajs commented Jul 31, 2025

Uh oh!

rvagg commented Aug 1, 2025

Uh oh!

LesnyRumcajs commented Aug 1, 2025

Uh oh!

rvagg left a comment

Uh oh!

LesnyRumcajs Aug 1, 2025

Uh oh!

LesnyRumcajs Aug 1, 2025

Uh oh!

LesnyRumcajs Aug 1, 2025

Uh oh!

Uh oh!

rvagg left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: cacheless amt iteration #2189

feat: cacheless amt iteration #2189

Uh oh!

Conversation

LesnyRumcajs commented Jul 31, 2025

Uh oh!

codecov-commenter commented Jul 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

LesnyRumcajs commented Jul 31, 2025

Uh oh!

rvagg commented Aug 1, 2025

Uh oh!

LesnyRumcajs commented Aug 1, 2025

Uh oh!

rvagg left a comment

Choose a reason for hiding this comment

Uh oh!

LesnyRumcajs Aug 1, 2025

Choose a reason for hiding this comment

Uh oh!

LesnyRumcajs Aug 1, 2025

Choose a reason for hiding this comment

Uh oh!

LesnyRumcajs Aug 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rvagg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov-commenter commented Jul 31, 2025 •

edited

Loading