-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactor rawdem #224
Refactor rawdem #224
Conversation
It looks like if `*args` are passes to a logging function, a placeholder `%s` is needed in the logging message, otherwise it issues an error.
As RawLidarDemGenerator is taking care of the temporary folder, and to avoid hiding the I/O operation inside RawDem.add_lidar, I am moving it to one level up, inside RawLidarDemGenerator.run. Overall, the idea is to separate the I/O methods from the preparation methods in RawDem.
This is a continuation of splitting I/O from preparing the data preparation.
These 2 parameters are always the same and used in enough method that adding them to the instance reduce the signature length of multiple methods.
`no_values_mask` is now a property, it is easy to check before adding a new coarse DEM if it is needed, making the coarse DEM loop not too big to be included in RawLidarDemGenerator
Hi, here are my notes about what I have tried so far, in order to improve the behavior of the dask part on the example tile (i.e. getting good cpu utilisation, little transfer overhead reported in the dashboard and no worker timeout errors). Trying without work stealing (https://distributed.dask.org/en/latest/work-stealing.html)
Trying to change priority (https://distributed.dask.org/en/stable/priority.html)
Trying to pre-scatter the mask to clip dem in
Trying to get more reliable comms
Trying to remove all clipping/masking before
Testing with different masking in
But adding clip_mask back everywhere, transfer is bad (significant amount vs. compute) and timeout errors for dask workers :(. TODO export DASK__DISTRIBUTED__COMM__TIMEOUTS__CONNECT="60s" |
…m-refactor_rawdem 227 ensure passing tests form refactor rawdem
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I approve on behalf of Rose :-).
@jennan refactor to try address Dask failures
Specifically:
Create mask arrays instead of repeatably clipping against geometries.
Revisit the save/load
Add a no_values_mask property