Model verification tests. #81

ali-ramadhan · 2019-02-24T16:46:12Z

This has been discussed quite a bit but beyond unit tests we also need model verification tests that ensure the model output is mathematically/scientifically correct by comparison with known analytic solutions or statistics.

The only one we've done so far is confirming the existence of steady-state convection rolls in Rayleigh–Bénard convection at Ra=5000 with an aspect ratio of 6. We can start with basic tests like isotropic diffusion and build up to more complicated stuff like slantwise convection.

@edoddridge has a lot of ideas regarding this, he might be a good person to talk to here.

ali-ramadhan · 2019-02-25T19:43:36Z

Some ideas by @christophernhill and @edoddridge

Barotropic gyre (requires β effect).
Frontal collapse.
Chimney collapse.

edoddridge · 2019-02-25T19:55:59Z

To my mind there are three main ways to approach this:

design simulations such that the output can be directly compared with analytical solutions;
design simulations such that the statistics of the output can be compared with theory; or
design simulations to mimic published results

Option 1 restricts us to systems with tractable analytical solutions, but still contains a wealth of feasible problems, such as:

Munk gyre
Spin down of a flow-field under the influence of friction
Thermal wind balance: specify a density structure and compare model velocity fields with analytical solution
Rayleigh–Bénard convection (as mentioned previously)
Onset of baroclinic instability: compare growth rates with analytical predictions

Option 2 lets us explore dynamics in the presence of turbulence. Potential test cases here include:

2D QG turbulence: explore energy and enstrophy cascades
lee wave generation and breaking (will require large-scale flow field and bathymetry)

Option 3 lets you do whatever you want - you just need to find a published result and try to reproduce it. It's unlikely that you'll get the exact same answer, so this option is more difficult to implement in a testing framework that doesn't require eyeballs to validate.

glwagner · 2019-02-26T02:21:22Z

What is the intent of these tests? Are they intended to be included in CI, or are they more in the style of “benchmarks” that are run relatively infrequently?

@edoddridge A twist on option 1 is to design a forcing that exactly cancels the terms associated with nonlinear and linear terms in a given equation for some simple initial condition consisting of sines and cosines. For example, pick an initial condition, calculate all terms in a given equation, and then write a forcing that exactly cancels those terms. Then check that the initial condition doesn’t change after a few time-steps. This method allows a test at low resolution with low computational burden and allows each nonlinear and linear term in each equation to be assessed separately.

It would also be good to run “benchmarks” that are designed to be run less frequently, which is a category I think some of the suggested tests fall into (?) Is the algorithm in Oceananigans.jl identical to some configuration of MITgcm? If so that opens the possibility to compare a solution grid-point for grid-point.

edoddridge · 2019-02-26T19:09:10Z

That's a good point @glwagner - I think it's a mix of CI tests and more involved experiments. Some of these can be easily integrated in to CI since they are very cheap to run. A one level wind-driven gyre should take only a few minutes of run time to equilibrate sufficiently.

Full simulations from option 3 are almost certainly too heavy for CI (unless there are plans afoot to use external resources for CI). These sort of simulations are more likely to be run occasionally and interrogated by real eyeballs. Having said that, you could setup CI to run a few time steps and compare the output with blessed output - this is what MITgcm does for its CI tests. This comes with a couple of advantages:

the tests are useful setups for people to start using; and
because they run regularly (for at least a few time steps) you know when the examples break

I like your idea of designing a forcing that exactly cancels the expected tendencies. It is a more rigorous test than "is the output the same as it was when I decided it was correct?".

Is the algorithm in Oceananigans.jl identical to some configuration of MITgcm? If so that opens the possibility to compare a solution grid-point for grid-point.

This might work, but you'll need to decide how closely it should match. You definitely won't get machine precision matches - we can't even do that with different MITgcm runs. The output from MITgcm depends on the machine, the compiler, and the optimisation level.

glwagner · 2019-02-26T21:38:51Z

Having said that, you could setup CI to run a few time steps and compare the output with blessed output - this is what MITgcm does for its CI tests.

Indeed --- the test of the Golden Master! That sounds like an excellent idea for Oceananigans.jl. No master is more golden than MITgcm.

This might work, but you'll need to decide how closely it should match. You definitely won't get machine precision matches - we can't even do that with different MITgcm runs. The output from MITgcm depends on the machine, the compiler, and the optimisation level.

Touche. I was naive.

glwagner · 2019-02-26T21:54:58Z

I'm going to start with some simple analytical solution tests (perhaps heat/salinity diffusion) until #73 is resolved. Then can begin on the 'designed forcing' CI tests.

ali-ramadhan · 2019-03-21T14:10:35Z

We now have some simple boundary condition physics test thanks to PR #118 and a couple of golden master tests in PR #140. #136 has some more specific physics tests that will be implemented.

We should either come up with some concrete goals for this issue to become resolvable, or we should close it and rely on it for inspiration in designing future tests.

ali-ramadhan · 2019-03-29T18:01:12Z

Closing as this issue is not resolvable but will serve as inspiration for future tests.

edoddridge · 2019-03-29T18:04:49Z

Are you planning to open targeted issues for (some of) the tests discussed here?

ali-ramadhan · 2019-03-29T18:20:37Z

Yes. We've implemented some simple tests recently (#126, #118, #140) and we're working on #136 but we should definitely do more.

Some tests will require a channel model which we're working towards (#100) but others that you've suggested should work in doubly periodic domains I think, e.g. spin down of a flow-field under the influence of friction, thermal wind balance, and Rayleigh–Bénard convection. Might be good to open issues for those.

edoddridge · 2019-03-29T18:23:07Z

Might be good to open issues for those.

Unless your brain is much better at remembering things than mine, I'd generally recommend doing that before closing the vague issue.

ali-ramadhan · 2019-03-29T18:26:13Z

Good point. I just have anxiety about the ever-increasing number of open issues :(

glwagner · 2019-03-29T18:28:53Z

My 2 cents: I think it’s fine to plan to refer back to this closed issue if, at some point in the future, we’d like to create many more verification tests.

Personally I think we should keep the number of verification tests from ballooning — they will become a burden to maintain as we evolve the API. This code is young! There’s a lot of progress to make.

ali-ramadhan · 2019-03-29T18:39:12Z

Woah, never thought I'd see the day that @glwagner advocates for fewer tests!

I think it's good to keep these things well-documented so I split these tests into three issues.

I agree we shouldn't implement all of these right now, so I'm going to create a far future milestone to categorize issues that we'd like to work on as the code matures.

ali-ramadhan added science 🌊 Sometimes addictive, sometimes allergenic numerics 🧮 So things don't blow up and boil the lobsters alive labels Feb 24, 2019

ali-ramadhan assigned christophernhill, glwagner and ali-ramadhan Feb 24, 2019

ali-ramadhan mentioned this issue Mar 6, 2019

[WIP] Reverse k index so that k=1 is at the bottom. #107

Closed

ali-ramadhan added this to the v0.5 milestone Mar 6, 2019

ali-ramadhan added the help wanted 🦮 plz halp (guide dog provided) label Mar 15, 2019

This was referenced Mar 15, 2019

Some tests for no flux boundary conditions, diffusion, and Boussinesq. #126

Closed

We need to upgrade our testing infrastructure soon-ish. #139

Closed

ali-ramadhan removed this from the v0.5 milestone Mar 21, 2019

ali-ramadhan added this to the v0.5 milestone Mar 29, 2019

ali-ramadhan closed this as completed Mar 29, 2019

This was referenced Mar 29, 2019

Physics tests for doubly-periodic domains with analytic solutions #157

Closed

Physics tests for channel or box models with analytic solutions #158

Closed

Physics tests with analytic solutions for the statistics #159

Closed

glwagner mentioned this issue Oct 3, 2019

Verification tests comparing performance of different LES closures #441

Closed

ali-ramadhan added a commit that referenced this issue Oct 19, 2020

Merge pull request #81 from thabbott/ar/timestepper-struct

3bdf979

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model verification tests. #81

Model verification tests. #81

ali-ramadhan commented Feb 24, 2019

ali-ramadhan commented Feb 25, 2019

edoddridge commented Feb 25, 2019

glwagner commented Feb 26, 2019

edoddridge commented Feb 26, 2019

glwagner commented Feb 26, 2019

glwagner commented Feb 26, 2019

ali-ramadhan commented Mar 21, 2019

ali-ramadhan commented Mar 29, 2019 •

edited

Loading

edoddridge commented Mar 29, 2019

ali-ramadhan commented Mar 29, 2019

edoddridge commented Mar 29, 2019

ali-ramadhan commented Mar 29, 2019

glwagner commented Mar 29, 2019

ali-ramadhan commented Mar 29, 2019 •

edited

Loading

Model verification tests. #81

Model verification tests. #81

Comments

ali-ramadhan commented Feb 24, 2019

ali-ramadhan commented Feb 25, 2019

edoddridge commented Feb 25, 2019

glwagner commented Feb 26, 2019

edoddridge commented Feb 26, 2019

glwagner commented Feb 26, 2019

glwagner commented Feb 26, 2019

ali-ramadhan commented Mar 21, 2019

ali-ramadhan commented Mar 29, 2019 • edited Loading

edoddridge commented Mar 29, 2019

ali-ramadhan commented Mar 29, 2019

edoddridge commented Mar 29, 2019

ali-ramadhan commented Mar 29, 2019

glwagner commented Mar 29, 2019

ali-ramadhan commented Mar 29, 2019 • edited Loading

ali-ramadhan commented Mar 29, 2019 •

edited

Loading

ali-ramadhan commented Mar 29, 2019 •

edited

Loading