Feature/impact write from hdf5 #606

ThomasRoosli · 2022-12-14T18:33:23Z

Changes proposed in this PR:

introduce write_hdf5 and read_hdf5 methods in Impact
so far Impact.event_name does not have to be of type list(str) but just of type list. This is a requirement in Hazard, where this attribute generally originates from. I propose to make this the required type also in Impact. This would mean we have to change the docstring and some tests.

This PR fixes #

PR Author Checklist

Read the Contribution Guide
Correct target branch selected (if unsure, select develop)
Source branch up-to-date with target branch
Documentation updated
Tests updated
Tests passing
No new linter issues

PR Reviewer Checklist

…e list(str) for Impact.event_name

peanutfun

It gets the job done, but I have to say that I find the overall structure of the functions a bit hard to understand. It currently iterates over all __dict__ entries, and then uses a large if/elif/else tree to do specific things with specific items. I would much prefer to instead define local "writer functions" and a mapping from attribute name or type to writer function. Then we can iterate over all attributes, check for a writer function or fall back to a default one. This would also help with some code duplications, specifically for writing all the tags. I'm thinking of something like this:

def write_str(name, value):
    data.attrs[name] = value
def write_csr(name, value):
    data.create_dataset(name, data=value.toarray())
def write_default(name, value):
    data.create_dataset(name, data=value)

attr_writers = {str: write_str, sparse.csr_matrix: write_csr}
for (var_name, var_val) in self.__dict__.items():
    # Fetch the write function if it exists for the type. If not, use the default one.
    write_func = attr_writers.get(type(var_val), write_default)
    # Call it with the attr name and value
    write_func(var_name, var_val)

(The same goes for the reader)

climada/engine/impact.py

peanutfun · 2023-01-27T13:24:34Z

@ThomasRoosli I think I am finished. Would you have a look?

ThomasRoosli · 2023-01-30T16:14:12Z

Thanks @peanutfun for restructuring especially the write_hdf5 function using type specific writers. And thanks for extending the tests. I am now very happy with the status of the code and the functionality it adds to the impact class. I am happy if we can merge this.

peanutfun · 2023-01-31T09:22:59Z

@emanuel-schmid This new Impact.write_hdf5 uses generic writer functions. @ThomasRoosli and I talked about making them part of a utility module so that we can use these functions in any other <Class>.write_hdf5 method. We opted against it to simplify this PR. However, defining all the writer functions inside the write_hdf5 method makes the linter unhappy. What would you recommend: Split the generic writer functions into a utility module or keep them inside write_hdf5 at the expense of some added linter issues?

climada/engine/impact.py

emanuel-schmid · 2023-02-01T10:44:19Z

Very nice! 😁

Split the generic writer functions into a utility module or keep them inside write_hdf5 at the expense of some added linter issues?

My preference would be to make it a TypeWriter class that takes the type_writers dict as __init__ argument and has one method, write. Could be part of the hdf5_handlers module or have a module of its own. I'm positive we'll be using it elsewhere, eventually.
However this can also be a PR of its own. Up to you. 😎

peanutfun · 2023-02-02T11:25:53Z

@emanuel-schmid I don't want to over-complicate this PR. I like your proposal a lot, but I would want to discuss it first and then have a follow-up PR.

climada/engine/test/test_impact.py

Thomas Roosli added 4 commits December 14, 2022 18:53

Introduce write_hdf5 and from_hdf5 methods for the Impact class

8471fb3

Restrict write_hdf5 and from_hdf5 methods for the Impact class to typ…

920f069

…e list(str) for Impact.event_name

Add docstring of parameter todense to Hazard.write_hdf5()

1282b31

correct pylint warnings

bf2b248

ThomasRoosli requested a review from peanutfun December 14, 2022 18:33

peanutfun requested changes Dec 19, 2022

View reviewed changes

peanutfun self-assigned this Jan 24, 2023

peanutfun marked this pull request as draft January 24, 2023 16:21

peanutfun added 3 commits January 26, 2023 18:33

Rework HDF5 I/O for Impact

e5ecdf1

Update docstrings and try fixing linter issues

d94b197

Fix more linter issues and update docstrings

8b1e2db

peanutfun marked this pull request as ready for review January 27, 2023 13:20

Use toarray in favor of todense in tests

cbe50c5

ThomasRoosli changed the title ~~WIP: Feature/impact write from hdf5~~ Feature/impact write from hdf5 Jan 30, 2023

ThomasRoosli requested a review from peanutfun January 30, 2023 16:42

peanutfun approved these changes Jan 31, 2023

View reviewed changes

peanutfun requested a review from emanuel-schmid February 1, 2023 10:25

emanuel-schmid reviewed Feb 1, 2023

View reviewed changes

climada/engine/impact.py Outdated Show resolved Hide resolved

emanuel-schmid reviewed Feb 1, 2023

View reviewed changes

climada/engine/impact.py Show resolved Hide resolved

peanutfun mentioned this pull request Feb 2, 2023

Follow-up from #606: Create generic H5 type writer from Impact.write_hdf5 #638

Open

peanutfun reviewed Feb 2, 2023

View reviewed changes

climada/engine/test/test_impact.py Outdated Show resolved Hide resolved

peanutfun added 4 commits February 2, 2023 13:31

Remove stray unit test case

1435f84

Move dummy_impact to top of test_impact.py

7db5e25

Add default writer to type_writers dict

b4abd2b

Merge branch 'develop' into feature/impact_write_from_hdf5

2b150a1

Update CHANGELOG.md

e654fb4

peanutfun merged commit f990035 into develop Feb 2, 2023

emanuel-schmid deleted the feature/impact_write_from_hdf5 branch February 3, 2023 14:52

peanutfun mentioned this pull request Jun 5, 2023

Store Impact objects into NetCDF and load them again using xarray #514

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/impact write from hdf5 #606

Feature/impact write from hdf5 #606

ThomasRoosli commented Dec 14, 2022 •

edited

Loading

peanutfun left a comment •

edited

Loading

peanutfun commented Jan 27, 2023

ThomasRoosli commented Jan 30, 2023

peanutfun commented Jan 31, 2023

emanuel-schmid commented Feb 1, 2023 •

edited

Loading

peanutfun commented Feb 2, 2023

Feature/impact write from hdf5 #606

Feature/impact write from hdf5 #606

Conversation

ThomasRoosli commented Dec 14, 2022 • edited Loading

PR Author Checklist

PR Reviewer Checklist

peanutfun left a comment • edited Loading

Choose a reason for hiding this comment

peanutfun commented Jan 27, 2023

ThomasRoosli commented Jan 30, 2023

peanutfun commented Jan 31, 2023

emanuel-schmid commented Feb 1, 2023 • edited Loading

peanutfun commented Feb 2, 2023

ThomasRoosli commented Dec 14, 2022 •

edited

Loading

peanutfun left a comment •

edited

Loading

emanuel-schmid commented Feb 1, 2023 •

edited

Loading