Implementation of Zanna-Bolton-2020 equation discovery model of mesoscale momentum fluxes. Combined commits. #356

Pperezhogin · 2023-04-24T18:59:38Z

As you suggested, I created a new PR instead of #344.

One possible combination of the namelist parameters:

USE_ZB2020=True
STRESS_SMOOTH_PASS=4
ZB_SCALING=0.75

codecov · 2023-04-25T14:45:13Z

Codecov Report

Merging #356 (e424754) into dev/gfdl (dd1ee34) will decrease coverage by 0.16%.
The diff coverage is 1.77%.

❗ Current head e424754 differs from pull request most recent head a9f7e2a. Consider uploading reports for the commit a9f7e2a to get more accurate results

@@             Coverage Diff              @@
##           dev/gfdl     #356      +/-   ##
============================================
- Coverage     38.38%   38.22%   -0.16%     
============================================
  Files           268      269       +1     
  Lines         76021    76359     +338     
  Branches      13987    14025      +38     
============================================
+ Hits          29183    29191       +8     
- Misses        41601    41929     +328     
- Partials       5237     5239       +2

Impacted Files	Coverage Δ
src/parameterizations/lateral/MOM_Zanna_Bolton.F90	`0.91% <0.91%> (ø)`
src/parameterizations/lateral/MOM_hor_visc.F90	`50.17% <33.33%> (-0.14%)`	⬇️

... and 1 file with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

Hallberg-NOAA

Thank you very much for this revised commit. After this passes our regression testing (which I am confident it will because this code is not yet being exercised by those tests), we will be able to merge this into dev/gfdl.

Everything here seems correct after my visual inspection of the code. However, it looks to me like this new contribution is using a very large number of halo updates and global reductions, which may make this code inefficient on some large PE-count applications. Specifically, for every model layer, there are 44 calls to get global minimum or maximum values and a total of (22 + 4 x ssd_iter + 6 x HPF_iter x HPF_order + 6 x LPF_iter x LPR_order) halo updates. If, for example, one of these is a Laplacian filter and the other biharmonic, and we use 3 iterations on each step, in a 75 layer model, this will be a total of (88 x 75) = 6600 blocking halo updates and (44 x 75) = 3300 global reductions per call to Zanna_Bolton_2020. For comparison, the barotropic solver is usually the part of the model that has the worst scaling in MOM6, and it typically has about 50 blocking halo updates and might have 1 global reduction (to determine the maximum stable timestep) per call. These blocking halo update calls can be relatively expensive, depending on your configuration and on your computer.

The good news is that the answers would not change if we were to refactor this code (e.g., with some combination of 3-d arrays, wide halos or grouped calculations) to reduce the number of blocking halo updates. The bad news is that such refactoring could take a lot of work. My recommendation is that we should take in this code now, and defer this restructuring and refactoring until someone finds a case where their model becomes unacceptably slow when ZB2020 is turned on. The one thing, though, that I think would be a good idea right now is to add cpu-clock timers (calls to cpu_clock_begin() and cpu_clock_end()) around the call to ZanaBolton_2020 with a CLOCK_ROUTINE granularity to help users to detect if this starts routine starts taking up too large a fraction of the model's overall run-time.

Pperezhogin · 2023-05-26T19:36:51Z

Hi @Hallberg-NOAA!

Some refactoring of the code to improve computational efficiency will indeed be needed. Currently, it works same fast as MEKE parameterization in Double Gyre. A few tests in NeverWorld2 show that using this parameterization can increase the runtime twice.

I implemented filters in a very general case, but a typical configuration of filter parameters will require around 4 filter passes per stress tensor element. A similar amount of filtering is used in backscatter parameterization in FESOM (Juricke 2019, Juricke 2020).

Reduction operation (min, max) is retained for debugging reasons and will likely be needed only for one test run in every new configuration.

I will discuss the computational efficiency and timing of the code with Alistair...

Hallberg-NOAA

Given that the reduction operations are only used for debugging, they should be wrapped in a logical test for the runtime parameter DEBUG stored in the control structure of this module, as is done for other debugging code elsewhere in the MOM6 code (look for CS%debug. This should be added, along with the CPU timers around the call to Zanna_Bolton_2020, before this is merged into the MOM6 code, as it can be done with a modest number of added lines.

The refactoring to work with 3-d arrays to reduce the number of blocking pass_var() calls could be done in a future refactoring step, as it will take more work and it should not change answers.

marshallward · 2023-06-26T16:24:16Z

Despite some potential performance issues with excessive ~~collectives~~ halo updates applied per-layer and associated with debug output, we'll go ahead and merge this into dev/gfdl.

A separate issue will be opened to address these problems in the future.

These issues will be resolved in a separate PR.

marshallward · 2023-06-26T17:15:09Z

Gaea regression: https://gitlab.gfdl.noaa.gov/ogrp/MOM6/-/pipelines/19630 ✔️ 🟡

Pperezhogin and others added 6 commits April 12, 2023 00:35

Implementation of ZB sheme

e66a143

Filters for ZB. Regression changed (FGR changed to amplitude)

c8eabbb

Rotate test is passed. Regression changed (order of operatrions)

dd1473c

ZB submitted via PR

dfb143c

ZB: Response to the code review

8a41019

Merge branch 'dev/gfdl' into PR_ZB_2020_combined

522a738

marshallward requested a review from adcroft May 3, 2023 15:11

Hallberg-NOAA approved these changes May 26, 2023

View reviewed changes

Hallberg-NOAA previously requested changes May 29, 2023

View reviewed changes

marshallward added 2 commits May 31, 2023 11:09

Merge branch 'dev/gfdl' into PR_ZB_2020_combined

7c75722

Merge branch 'dev/gfdl' into PR_ZB_2020_combined

a9f7e2a

marshallward approved these changes Jun 26, 2023

View reviewed changes

marshallward removed the request for review from adcroft June 26, 2023 17:13

marshallward merged commit 7bb452b into NOAA-GFDL:dev/gfdl Jun 26, 2023

marshallward mentioned this pull request Jun 26, 2023

Excess pass_var calls in ZB2020 #389

Closed

marshallward mentioned this pull request Jul 3, 2023

GFDL candidate branch to main (2023-07-03) mom-ocean/MOM6#1603

Merged

NoraLoose mentioned this pull request Sep 3, 2023

Datasets and repos to be highlighted on website m2lines/m2lines.github.io#55

Open

22 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation of Zanna-Bolton-2020 equation discovery model of mesoscale momentum fluxes. Combined commits. #356

Implementation of Zanna-Bolton-2020 equation discovery model of mesoscale momentum fluxes. Combined commits. #356

Pperezhogin commented Apr 24, 2023

codecov bot commented Apr 25, 2023 •

edited

Loading

Hallberg-NOAA left a comment

Pperezhogin commented May 26, 2023

Hallberg-NOAA left a comment

marshallward commented Jun 26, 2023 •

edited

Loading

marshallward commented Jun 26, 2023

Implementation of Zanna-Bolton-2020 equation discovery model of mesoscale momentum fluxes. Combined commits. #356

Implementation of Zanna-Bolton-2020 equation discovery model of mesoscale momentum fluxes. Combined commits. #356

Conversation

Pperezhogin commented Apr 24, 2023

codecov bot commented Apr 25, 2023 • edited Loading

Codecov Report

Hallberg-NOAA left a comment

Choose a reason for hiding this comment

Pperezhogin commented May 26, 2023

Hallberg-NOAA left a comment

Choose a reason for hiding this comment

marshallward commented Jun 26, 2023 • edited Loading

marshallward commented Jun 26, 2023

codecov bot commented Apr 25, 2023 •

edited

Loading

marshallward commented Jun 26, 2023 •

edited

Loading