-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Check memory usage of yokogawa_to_zarr
#72
Comments
This looks very much like the issues we had with ROIs early on, where all the computation happens first and it only writes to the zarr file in the end. I wonder if a similar trick of writing to the file in defined |
I thought the issue with processing 2D images was that we would generate tens of thousands of dask operations (one per image), although it's true that each graph should be very light (one image gets read from disk and written to zarr). I don't know whether dask builds 10000 small graphs or a global one. Given your information on the dask-overhead issue, do you think it matters here? At the moment we started preparing the 3D version, but we can switch back to 2D if it's preferred. |
Hmm, I mostly had this intuition from accessing ROIs sequentially, thus having indexing to load parts of the data. While we also have a large zarr array to fill lazily, I'm not sure we'll have the same issue here when every chunk is loaded individually from disk. In the end, both ways should work and it shouldn't be that different to refactor from one to the other. Let's test whichever way you implement first and see if either there is a large memory overhead for 3D or a large dask overhead (=> also in memory) from 2D. I can see pros & cons for either (e.g. maybe writing 3D arrays is faster? But maybe 2D is more memory efficient? Or the difference is actually negligible? 🤷♂️) |
EDIT: I need to check that I was using the right version of the tasks! I think it was right, but let's re-discuss this on Monday. |
Makes sense. We could then run such tasks in a cpu-intermediate setting (or even cpu-low, though probably not once FOVs have more Z stacks). |
See fractal-analytics-platform/fractal-server#51
The text was updated successfully, but these errors were encountered: