Performance fix #410

gjm174 · 2024-07-24T10:11:42Z

After profiling some changes which improve the overall performance, especially the loading of NWP files.

codecov · 2024-07-24T10:24:15Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 83.88%. Comparing base (953f799) to head (ff03ea1).
Report is 8 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #410      +/-   ##
==========================================
+ Coverage   83.52%   83.88%   +0.35%     
==========================================
  Files         159      160       +1     
  Lines       12575    12780     +205     
==========================================
+ Hits        10503    10720     +217     
+ Misses       2072     2060      -12

Flag	Coverage Δ
unit_tests	`83.88% <100.00%> (+0.35%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

dnerini

Thanks @gjm174 for this contribution! I was curious to get an idea of the performance improvement, do you have some numbers?

Also, I'm a bit skeptical on some of these changes: you removed copies of the data, which surely ensures a lower memory footprint. Are we sure these copies were not needed to avoid side effects later on in the code? All tests are green, which is good, but I wouldn't rely too much on our test suite to detect such unintended effects.

perhaps @RubenImhoff could also have a look since you worked more closely on the steps blending code?

dnerini · 2024-07-24T19:26:05Z

pysteps/extrapolation/semilagrangian.py

@@ -173,7 +173,7 @@ def extrapolate(

    if xy_coords is None:
        x_values, y_values = np.meshgrid(
-            np.arange(velocity.shape[2]), np.arange(velocity.shape[1])
+            np.arange(velocity.shape[2]), np.arange(velocity.shape[1]), copy=False


I can see how this provides better performance, but are we 100% sure that there are no side effects?

perhaps @pulkkins as the original author of this code can jump in here and give us a feedback on whether those copies were intentional or not?

Is it likely that the velocity shape changes? If not, I expect that the copy is not needed.

I think not, since the np.arange(...) creates a new array, and it's this new array that is no longer copied. When copy=True, the reference to the brand-new array at some point goes out of scope and the memory will be garbage collected anyway. I think this just avoids unnecessarily creating and throwing away an intermediate array.

dnerini · 2024-07-24T19:31:20Z

pysteps/blending/steps.py

-        ]
-        precip_cascade = np.stack(precip_cascade)
+
+        precip_cascade = np.stack(


In this new version, you use a repeated reference to the same sliced data, which is more memory efficient but can lead to unintended side effects if the data is modified later. The previous version used deep copies to ensure that modifications to one ensemble member do not affect the others. Did you consider such aspects?

I expect that this is still fine here, as this takes places prior to the dask-based parellization of the ensemble members.

gjm174 · 2024-07-25T13:20:13Z

Thanks @gjm174 for this contribution! I was curious to get an idea of the performance improvement, do you have some numbers?

Also, I'm a bit skeptical on some of these changes: you removed copies of the data, which surely ensures a lower memory footprint. Are we sure these copies were not needed to avoid side effects later on in the code? All tests are green, which is good, but I wouldn't rely too much on our test suite to detect such unintended effects.

perhaps @RubenImhoff could also have a look since you worked more closely on the steps blending code?

The main performance improvement occurs in the reading of NWP files. The other minor changes save approximately 6-7 seconds. The loading of the NWP files now takes only one-fifth of the time it did previously. I have ensured that no unintended effects will occur by not making copies.

dnerini · 2024-07-26T09:53:26Z

Very good thanks! I'll wait for a second opinion from @RubenImhoff and then I'm happy to merge this PR

RubenImhoff · 2024-07-26T15:54:27Z

I will have a look at it after this weekend. Great work so far, @gjm174!

RubenImhoff

@gjm174, nice work! I left a few comments, but other than, it seems good to go. Nice work!

RubenImhoff · 2024-07-29T06:59:44Z

pysteps/blending/steps.py

-        ]
-        precip_cascade = np.stack(precip_cascade)
+
+        precip_cascade = np.stack(


I expect that this is still fine here, as this takes places prior to the dask-based parellization of the ensemble members.

RubenImhoff · 2024-07-29T07:03:23Z

pysteps/blending/steps.py

Talking about deep copies, should:

forecast_prev = precip_cascade noise_prev = noise_cascade

get a deep copy?

RubenImhoff · 2024-07-29T07:04:48Z

pysteps/extrapolation/semilagrangian.py

@@ -173,7 +173,7 @@ def extrapolate(

    if xy_coords is None:
        x_values, y_values = np.meshgrid(
-            np.arange(velocity.shape[2]), np.arange(velocity.shape[1])
+            np.arange(velocity.shape[2]), np.arange(velocity.shape[1]), copy=False


Is it likely that the velocity shape changes? If not, I expect that the copy is not needed.

Improved NWP loading

d58c144

gjm174 force-pushed the Performance-fix branch from 5262db0 to b72f9cf Compare July 24, 2024 12:08

gjm174 requested a review from dnerini July 24, 2024 12:33

dnerini reviewed Jul 24, 2024

View reviewed changes

gjm174 force-pushed the Performance-fix branch from b72f9cf to 13da711 Compare July 25, 2024 09:40

dnerini requested review from pulkkins and RubenImhoff July 26, 2024 09:55

RubenImhoff reviewed Jul 29, 2024

View reviewed changes

performance fix and black complaince

ff03ea1

gjm174 force-pushed the Performance-fix branch from 13da711 to ff03ea1 Compare July 29, 2024 10:10

RubenImhoff approved these changes Jul 29, 2024

View reviewed changes

RubenImhoff merged commit 917c83b into master Aug 1, 2024
10 checks passed

ladc deleted the Performance-fix branch August 20, 2024 14:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance fix #410

Performance fix #410

gjm174 commented Jul 24, 2024

codecov bot commented Jul 24, 2024 •

edited

Loading

dnerini left a comment

dnerini Jul 24, 2024

dnerini Jul 26, 2024

RubenImhoff Jul 29, 2024

ladc Aug 20, 2024

dnerini Jul 24, 2024

RubenImhoff Jul 29, 2024

gjm174 commented Jul 25, 2024

dnerini commented Jul 26, 2024

RubenImhoff commented Jul 26, 2024

RubenImhoff left a comment

RubenImhoff Jul 29, 2024

RubenImhoff Jul 29, 2024

RubenImhoff Jul 29, 2024

Performance fix #410

Performance fix #410

Conversation

gjm174 commented Jul 24, 2024

codecov bot commented Jul 24, 2024 • edited Loading

Codecov Report

dnerini left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gjm174 commented Jul 25, 2024

dnerini commented Jul 26, 2024

RubenImhoff commented Jul 26, 2024

RubenImhoff left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Jul 24, 2024 •

edited

Loading