The remoting client currently buffers all fetched blobs in memory before storing to LMDB. #17065

jsirois · 2022-09-30T00:22:57Z

That's here:

pants/src/rust/engine/fs/store/src/remote.rs

Lines 383 to 402 in d8fba9a

    
           let read_result_closure = async { 
        
             let mut buf = BytesMut::with_capacity(digest.size_bytes); 
        
             while let Some(response) = stream.next().await { 
        
               // Record the observed time to receive the first response for this read. 
        
               if let Some(start) = start_opt.take() { 
        
                 if let Some(workunit_store_handle) = workunit_store::get_workunit_store_handle() { 
        
                   let timing: Result<u64, _> = 
        
                     Instant::now().duration_since(start).as_micros().try_into(); 
        
                   if let Ok(obs) = timing { 
        
                     workunit_store_handle 
        
                       .store 
        
                       .record_observation(ObservationMetric::RemoteStoreTimeToFirstByteMicros, obs); 
        
                   } 
        
                 } 
        
               } 
        
               buf.extend_from_slice(&(response?).data); 
        
             } 
        
             Ok(buf.freeze()) 
        
           };

For things like ~GB pytorch wheel zips; that could be problematic.

stuhood · 2022-09-30T00:40:01Z

One option would be to store into a temporary file, and then copy into LMDB with:

pants/src/rust/engine/fs/store/src/local.rs

Lines 317 to 331 in abd3bfe

    
           /// 
        
           /// Store data in two passes, without buffering it entirely into memory. Prefer 
        
           /// `Self::store_bytes` for small values which fit comfortably in memory. 
        
           /// 
        
           pub async fn store<F, R>( 
        
             &self, 
        
             entry_type: EntryType, 
        
             initial_lease: bool, 
        
             data_is_immutable: bool, 
        
             data_provider: F, 
        
           ) -> Result<Digest, String> 
        
           where 
        
             R: Read + Debug, 
        
             F: Fn() -> Result<R, io::Error> + Send + 'static, 
        
           {

But that would use more passes over the data than we strictly need. We do need to re-compute and validate the Digest of the data (because we don't immediately trust the data that we get over the wire), but the Digest could be computed while writing the data to the temporary file, rather than via reading it in two passes as fn store will do.

stuhood · 2022-11-09T19:47:00Z

One other much more fundamental idea would be to actually begin to "size split" our local::Store, and to store files larger than a threshold as files on disk. LMDB is great for small files and directory entries, but it's advantages diminish for files which are too large to buffer in RAM.

That would have advantages for this codepath, but it would also have advantages for #17282, because if we chose our heuristics well, we could symlink directly from a large-file store, rather than first copying a large file out of the store and into a real file. cc @thejcannon

This is an no-functionality-change refactoring of `store::remote::ByteStore::load_bytes_with` that's arguably cleaner and also step towards #11149. In particular: 1. that method doesn't need to take a closure any more, and thus is refactored to just be the "simplest": `load_bytes(...) -> Result<Option<Bytes>, String>` 2. that method previously didn't retry, and thus users had to do the retries themselves: this moves the retries to be fully within the `load_bytes` method itself, which is both easier to use, and keeps implementation details like gRPC (previously exposed as the `ByteStoreError::Grpc`/`tonic::Status` error variant) entirely contained to `store::remote::ByteStore` 3. to emphasise that last point, the `ByteStoreError` enum can thus become private, because it's an implementation detail of `store::remote::ByteStore`, no longer exposed in the public API Step 1 resolves (and removes) a TODO comment. That TODO references #17065, but this patch _doesn't_ fix that issue.

stuhood · 2023-01-20T04:30:25Z

One other much more fundamental idea would be to actually begin to "size split" our local::Store, and to store files larger than a threshold as files on disk. LMDB is great for small files and directory entries, but it's advantages diminish for files which are too large to buffer in RAM.

That would have advantages for this codepath, but it would also have advantages for #17282, because if we chose our heuristics well, we could symlink directly from a large-file store, rather than first copying a large file out of the store and into a real file. cc @thejcannon

Opened #18048 for this idea.

huonw · 2023-02-12T21:49:53Z

But that would use more passes over the data than we strictly need. We do need to re-compute and validate the Digest of the data (because we don't immediately trust the data that we get over the wire), but the Digest could be computed while writing the data to the temporary file, rather than via reading it in two passes as fn store will do.

I've opened #18231 for this, since #18054 solves the basic "avoid buffering into memory" issue, without optimising the hashing.

This fixes #17065 by having remote cache loads be able to be streamed to disk. In essence, the remote store now has a `load_file` method in addition to `load_bytes`, and thus the caller can decide to download to a file instead. This doesn't make progress towards #18048 (this PR doesn't touch the local store at all), but I think it will help with integrating the remote store with that code: in theory the `File` could be provided in a way that can be part of the "large file pool" directly (and indeed, the decision about whether to download to a file or into memory ties into that). This also does a theoretically unnecessary extra pass over the data (as discussed in #18231) to verify the digest, but I think it'd make sense to do that as a future optimisation, since it'll require refactoring more deeply (down into `sharded_lmdb` and `hashing`, I think) and is best to build on #18153 once that lands.

jsirois added the bug label Sep 30, 2022

stuhood mentioned this issue Nov 9, 2022

Skip loading of local cache data when possible #17495

Merged

stuhood added the remote label Nov 14, 2022

thejcannon mentioned this issue Dec 23, 2022

Bring back hardlinking big files #17878

Merged

huonw mentioned this issue Jan 18, 2023

Simplify remote store load_bytes API #18034

Merged

stuhood mentioned this issue Jan 20, 2023

Size split the local store #18048

Closed

huonw mentioned this issue Jan 22, 2023

Stream large remote cache downloads directly to disk #18054

Merged

huonw mentioned this issue Feb 12, 2023

Verify digest of remote cache hits while streaming them, not as a separate pass #18231

Closed

huonw closed this as completed in #18054 Feb 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The remoting client currently buffers all fetched blobs in memory before storing to LMDB. #17065

The remoting client currently buffers all fetched blobs in memory before storing to LMDB. #17065

jsirois commented Sep 30, 2022

stuhood commented Sep 30, 2022

stuhood commented Nov 9, 2022 •

edited

Loading

stuhood commented Jan 20, 2023

huonw commented Feb 12, 2023

The remoting client currently buffers all fetched blobs in memory before storing to LMDB. #17065

The remoting client currently buffers all fetched blobs in memory before storing to LMDB. #17065

Comments

jsirois commented Sep 30, 2022

stuhood commented Sep 30, 2022

stuhood commented Nov 9, 2022 • edited Loading

stuhood commented Jan 20, 2023

huonw commented Feb 12, 2023

stuhood commented Nov 9, 2022 •

edited

Loading