Is there a way to load only apartial portion of a given nwb file from cloud storage? #56
-
For my lab I'm hoping for a cloud storage solution that allows the user to query only certain parts of a large NWB file (say, the spike times only, and not the raw voltage traces). Does anybody know if there's already an established way to load only parts of a file from cloud storage? My current setup requires downloading an entire large file from cloud storage, which often takes a long time.. Is this sort of functionality already an established practice in managing online repositories of data? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 4 replies
-
Hi there! This is indeed very possible with NWB files See our streaming tutorial using Though
We plan to add some convenience functions in the future to make this sort of thing a bit easier to specify and learn (together with instructions) Let me know if you run into any difficulties or have any questions about how to harness the power of remote data streaming |
Beta Was this translation helpful? Give feedback.
-
Is streaming supported for zarr NWB files? I've tried modifying the method in the tutorial, replacing I thought faster streaming from the cloud would be one of the primary benefits of using zarr for NWB files - am I doing something wrong? |
Beta Was this translation helpful? Give feedback.
Hi there!
This is indeed very possible with NWB files
See our streaming tutorial using
fsspec
on ans3_url
here: https://pynwb.readthedocs.io/en/stable/tutorials/advanced_io/streaming.html#streaming-method-1-fsspecThough
fsspec
in principle ought to allow you to use any other type of cloud storage, not just S3 (which we choose since that is how the DANDI archive stores NWB files)fsspec
is also especially good with caching, which saves those 'small parts' of the 'big remote file' file to your local system so that you don't need to re-download them every time you want to use them again (read more about all the various options here: https://filesystem-spec.readthedocs.io/en/latest/features…