Skip to content

Conversation

@CKehl
Copy link
Contributor

@CKehl CKehl commented Mar 26, 2020

This PR is a clone or mirror of #774 , which suffers from a broken CI.

The lazy evaluation from Dask is still lacking robustness and some users still experience errors in some setups, which this PR attempts to resolve.

Test are developed for NEMO, GlobCurrent and POP.

CKehl and others added 24 commits March 19, 2020 17:36
…on paleo-plankton to verify fixes and develop a proper testing procedure
…o-bewritten, but existing tests already pass
…anParcels/parcels into stabilize_lazy_evaluation_NEMOfields
…and a NetCDF dimension (should be self.dataset.dimensions[name].size)
…h individual entry in files. That means that many files are opened multiple times simultaneously, leading to memory errors.
…ld_chunksize and the acutallly used init_chunk_size
…er-defined field_chunksize for dicts of dicts
…ints ...' by also test the dask environment default (which is usually 128MiB)
…s arising from an error in the NetCDFFileBuffer. Also cleared commented code where required and adapted formatting to the PEP8 linter
@CKehl CKehl self-assigned this Mar 26, 2020
@CKehl CKehl changed the title Stbilize dask loading of Ocean Model data Stabilize dask loading of Ocean Model data Mar 26, 2020
CKehl and others added 19 commits March 31, 2020 12:16
…d to actually stop the search only if the filenames also match
…le but not requested py Parcels dimensions nor chunk sizes
Passes in scipy but breaks in jit
The rectilinear field works in both  jit and scipy, but the curvilinear field only works in scipy
…tually also relevant to the requested data, in any 'field_chunksize' formulation
…nable test after #782 fix is thrown. Further adapted and fixed the chunking.
…he requested netcdf structure and the requested parcels dimensionality for each case of field_chunksize being False or None, auto, a tuple or a (dict of) dict(s). Now following with a clean-up.
erikvansebille and others added 4 commits April 2, 2020 14:04
…field() function. It's not ideal as NetCDF fields now need to run 3 times through essentially the same test, but in the current structure and with the required versatility, this is the way it works now. Tests are expanded and all run successfully locally.
@CKehl
Copy link
Contributor Author

CKehl commented Apr 2, 2020

The code runs all tests fine, but the test_palaeo_dataset.py still fails.

CKehl added a commit that referenced this pull request Apr 3, 2020
CKehl added a commit to CKehl/parcels that referenced this pull request Apr 3, 2020
@CKehl CKehl merged commit 272ff59 into master Apr 3, 2020
@CKehl CKehl deleted the stabilize_dask_field_loading branch September 27, 2022 15:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants