add at_reg_id_event #642

aleeciu · 2023-02-05T19:25:29Z

This PR addresses #595

climada/engine/impact.py

climada/engine/test/test_impact.py

Co-authored-by: Lukas Riedel <34276446+peanutfun@users.noreply.github.com>

climada/engine/impact.py

chahank · 2023-02-28T10:50:56Z

climada/engine/impact.py

+        elif not isinstance(agg_regions, pd.Series):
+            agg_regions = pd.Series(agg_regions)


Why make it a pandas series instead of a numpy array?

for two reasons:

I find it cleaner as you know what impact belongs to what region simply by looking at the dataframe

I like the pd.Series.unique() method because it retains the order. Doing the same with numpy is more cumbersome, to my knowledge.

I would argue that this is a misuse of a pandas series.

I find it cleaner as you know what impact belongs to what region simply by looking at the dataframe
That is independent on using numpy arrays or pandas series. But using a pandas series when it is not needed is imho actually less clean.
I like the pd.Series.unique() method because it retains the order. Doing the same with numpy is more cumbersome, to my knowledge.
I would argue that this is suboptimal as an important point is now hidden in the subtle internal working of the pandas.series.unique() vs the np.unique() methods.

@peanutfun : what do you think?

on the first point you are right: having a pd.Series is not strictly needed to generate a pd.DataFrame

on the second: I am transforming agg_regions into pd.Series only when the user supplies it as an np.array or list. But the user can supply a pd.Series in the first place. So I don't think it's a misuse, I just wrote the code in a way that it works with a pd.Series.

So I think your doubt is rather: should we use pandas for things that can be done with numpy? Not sure I have a clear answer to that, as this would probably apply to any code that make use of pandas.

The point is that pd.series are objects that are made to be used in pandas dataframes, but they are not made to be used a standalone objects. They are essentially numpy arrays, with some more. Thus, instead of making an unclear use of the pd.series, I suggest to use numpy arrays, and to make the ordering requirement clear (as currently this is a hidden feature).

I don't really see how the order is important because the order in the agg_regions parameter is determined by the "order" of the exposure points. Also, the order of columns in the final data frame does not appear relevant to me 😅

I would actually argue that the unordered result of pandas.Series.unique does not need further explanation, whereas I feel we should add a note to the docstring that the unique items will be ordered when using np.unique. So, I think the easiest solution is to just stick to the current implementation.

@chahank: Following our discussion on the "clean" indexing, I just realized that pandas.Series.unique returns a numpy array, so everything is fine from that perspective.

You are right, the order does not count. I am now using only numpy arrays.

I would argue that this is a misuse of a pandas series.

Not necessarily. According to the docs a Series is a "one-dimensional ndarray with axis labels (including time series)". Sounds useful also outside of a DataFrame.
🤷

I very much agree with @emanuel-schmid here 😃

I'm fine with the current implementation. However, please make sure to only call np.unique once, and store the result for later use. It is currently executed twice and will be costly for large arrays. I can also take care of that when merging. Are we ready to go ahead?

Co-authored-by: Chahan M. Kropf <chahan.kropf@usys.ethz.ch>

climada/engine/impact.py

aleeciu · 2023-02-28T16:28:06Z

I guess we are ready to merge

climada/engine/impact.py

emanuel-schmid · 2023-03-01T08:48:01Z

Yes, looks good. 😄 I did some cosmetics in the doc string. If your happy with them @aleeciu, let me know or just commit and merge.

Co-authored-by: Emanuel Schmid <51439563+emanuel-schmid@users.noreply.github.com>

emanuel-schmid · 2023-03-01T08:54:09Z

Oh and one more thing we need to change the CHANGELOG.md

peanutfun · 2023-03-01T08:55:06Z

Will do and handle the merge! 👌

add at_reg_id_event

7b01fed

aleeciu mentioned this pull request Feb 5, 2023

Impact Aggregation by Region #595

Closed

emanuel-schmid marked this pull request as draft February 7, 2023 12:02

peanutfun requested changes Feb 15, 2023

View reviewed changes

climada/engine/impact.py Outdated Show resolved Hide resolved

climada/engine/impact.py Outdated Show resolved Hide resolved

climada/engine/impact.py Outdated Show resolved Hide resolved

peanutfun marked this pull request as ready for review February 15, 2023 16:58

remove as attr, add admin_0 option, add unit test

f0096d7

emanuel-schmid reviewed Feb 17, 2023

View reviewed changes

climada/engine/impact.py Outdated Show resolved Hide resolved

aleeciu added 2 commits February 17, 2023 14:10

work with agg regs and not exposure

a93474f

grammar

09254d1

peanutfun requested changes Feb 23, 2023

View reviewed changes

aleeciu and others added 8 commits February 27, 2023 16:21

Update climada/engine/impact.py

4236025

Co-authored-by: Lukas Riedel <34276446+peanutfun@users.noreply.github.com>

Update climada/engine/impact.py

af28fef

Co-authored-by: Lukas Riedel <34276446+peanutfun@users.noreply.github.com>

Update climada/engine/impact.py

cff3318

Co-authored-by: Lukas Riedel <34276446+peanutfun@users.noreply.github.com>

add haz indexes as df indexes

f96ae47

raise error when no imp_mat

bdcf0d4

Update climada/engine/impact.py

c99ce83

Co-authored-by: Lukas Riedel <34276446+peanutfun@users.noreply.github.com>

update docstring

8cc4769

update test

17d6d5e

chahank reviewed Feb 28, 2023

View reviewed changes

climada/engine/impact.py Outdated Show resolved Hide resolved

chahank reviewed Feb 28, 2023

View reviewed changes

aleeciu and others added 2 commits February 28, 2023 11:53

Update climada/engine/impact.py

f4ea128

Co-authored-by: Chahan M. Kropf <chahan.kropf@usys.ethz.ch>

delete commented code

afb5f3b

peanutfun reviewed Feb 28, 2023

View reviewed changes

climada/engine/impact.py Outdated Show resolved Hide resolved

aleeciu and others added 5 commits February 28, 2023 13:43

use np.array and cntries iso_codes

f262bb1

remove commented code

5fcf72b

Only compute unique aggregation regions once

6cce7ce

Split TestImpactReg into three test cases

cf9c79c

Format new code with black

c84deb0

emanuel-schmid reviewed Mar 1, 2023

View reviewed changes

climada/engine/impact.py Outdated Show resolved Hide resolved

emanuel-schmid reviewed Mar 1, 2023

View reviewed changes

climada/engine/impact.py Outdated Show resolved Hide resolved

Apply suggestions for docstring

1188c45

Co-authored-by: Emanuel Schmid <51439563+emanuel-schmid@users.noreply.github.com>

peanutfun approved these changes Mar 1, 2023

View reviewed changes

peanutfun added 2 commits March 1, 2023 09:55

Merge branch 'develop' into feature/impact_at_reg_id

4a9b991

Update CHANGELOG.md

57053be

peanutfun merged commit 187a7d5 into develop Mar 1, 2023

emanuel-schmid deleted the feature/impact_at_reg_id branch March 6, 2023 09:16

peanutfun mentioned this pull request Feb 5, 2024

Avoid redundant calls to np.unique in Impact.impact_at_reg #848

Merged

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add at_reg_id_event #642

add at_reg_id_event #642

aleeciu commented Feb 5, 2023

chahank Feb 28, 2023

aleeciu Feb 28, 2023 •

edited

Loading

chahank Feb 28, 2023

aleeciu Feb 28, 2023 •

edited

Loading

chahank Feb 28, 2023

peanutfun Feb 28, 2023

aleeciu Feb 28, 2023

emanuel-schmid Feb 28, 2023

peanutfun Feb 28, 2023 •

edited

Loading

aleeciu commented Feb 28, 2023

emanuel-schmid commented Mar 1, 2023

emanuel-schmid commented Mar 1, 2023

peanutfun commented Mar 1, 2023

		elif not isinstance(agg_regions, pd.Series):
		agg_regions = pd.Series(agg_regions)

add at_reg_id_event #642

add at_reg_id_event #642

Conversation

aleeciu commented Feb 5, 2023

chahank Feb 28, 2023

Choose a reason for hiding this comment

aleeciu Feb 28, 2023 • edited Loading

Choose a reason for hiding this comment

chahank Feb 28, 2023

Choose a reason for hiding this comment

aleeciu Feb 28, 2023 • edited Loading

Choose a reason for hiding this comment

chahank Feb 28, 2023

Choose a reason for hiding this comment

peanutfun Feb 28, 2023

Choose a reason for hiding this comment

aleeciu Feb 28, 2023

Choose a reason for hiding this comment

emanuel-schmid Feb 28, 2023

Choose a reason for hiding this comment

peanutfun Feb 28, 2023 • edited Loading

Choose a reason for hiding this comment

aleeciu commented Feb 28, 2023

emanuel-schmid commented Mar 1, 2023

emanuel-schmid commented Mar 1, 2023

peanutfun commented Mar 1, 2023

aleeciu Feb 28, 2023 •

edited

Loading

aleeciu Feb 28, 2023 •

edited

Loading

peanutfun Feb 28, 2023 •

edited

Loading