Convergence monitors #5590

jakobtorben · 2024-09-10T09:02:06Z

Implement Convergence Monitoring

This PR introduces a new convergence monitoring feature to improve the robustness and efficiency of the simulator. It is based on the following publication:

Lie, K., Moyner, O., Klemetsdal, Ø., Skaflestad, B., Moncorgé, A., & Kippe, V. (2024). Enhancing Performance of Complex Reservoir Models via Convergence Monitors. ECMOR, 2024(1), 1-9. (https://doi.org/10.3997/2214-4609.202437057)

The convergence monitoring system tracks the convergence behaviour across iterations, applying penalties for non-convergence. If the total penalty count exceeds the specified cut-off limit, the simulator will cut the timestep.

This feature allows early-exiting for steps that are not converging, saving wasted iterations and assembly.

This is the first version that will be iterated on before considering to merge it.

jakobtorben · 2024-09-10T09:03:51Z

This first version implements the following convergence monitors:

Distance decay: Define the distance from convergence as a vector of d_i = max(log(r_i ), 0), for each of the convergence metrics of the reservoir. Calculate the L1 norm of the distance vector, and add add a penalty card if the current distance norm is greater than the previous distance norm multiplied by some decay factor (default 0.75): d^k > σ d^k−1.
Degradation of reservoir metrics: Add a penalty for each of the metrics (here CNV and MB) that have increased from the previous iteration
Unconverged wells: Add a penalty card if there are unconverged wells.

If the total penalty cards if above a given cut-off limit (default 30), cut the timestep.

I tested this on Norne, where we observe a slight decrease in nonlinear and linear iterations. But other cases are probably more suited since Norne does not fail a lot to begin with. (Ignore the zero wasted, I am not exiting the timestep cut gracefully yet).

akva2 · 2024-09-10T10:43:56Z

opm/simulators/timestepping/ConvergenceReport.hpp

+            }
+
+            template <typename Serializer>
+            void serializeOp(Serializer& serializer)


maybe this will change, but I don't see any situation where this needs to be serialized, obviously not in the initial eclipse state broadcast, and as restart-serialization happens at report steps, there is also no reason to serialize as afaict this is reset at each nonlinear iteration, right?

You are perhaps correct. I just followed the class structure of the other classes but was unsure if I actually needed this part. Subject to change, I am planning that for each report there is a penalty card. Such that at the end of the simulation, I can add the counts to the INFORITER (or similar) for analysis. Do I need the serializer for that?

Just picking up this point now, sorry for the delayed response.

I don't see any situation where this needs to be serialized,

I switched to using the serializeOp() protocol for the ConvergenceReport object in commit 0b40277 (PR #5338). As long as this information is intended for output to the .INFOITER file, it needs to have a serializeOp() member function.

jakobtorben · 2024-09-12T08:59:20Z

In the second version, the implementation should be the same as the paper, given that the tolerance for adding a card for too large well residual, is the same as OPM default such that the well is unconverged. The reporting has also been fixed, such that the wasted iterations coming from the convergence monitoring is counted. (This fix also involved making sure that failed iterations from NaN and too large residuals errors are counted).

When tested on Norne, the results are currently similar to without using convergence monitoring

jakobtorben · 2024-09-17T09:07:33Z

Fixed some bugs and added the penalty counts to the INFOITER file for analysis.

The INFOITER file was used to analyse the convergence behaviour and cut-off values, using a tool similar to the paper.

Using this tool we can also estimate the number of iterations saved if using convergence monitoring. Which can be used to find the optimal parameters to use for a specific case:
$NORNE_fraction_of_iterations_remaining_after_early_exit$

And number of incorrectly aborted timesteps:

NORNE_number_of_incorrectly_aborted_timesteps

Optimal parameters found at cut-off 14 and distance decay factor 0.60, which should give an estimated number of Newton iterations as a factor of 0.989. The small reduction is likely due to Norne not failing a lot to begin with for OPM.

Using these optimal parameters, we can run OPM with convergence monitoring to see if we get any improvements. Here run with relaxed CNV and MB tol equal to original tol to match the format used in the analysis tool.

From the results, we see a small reduction in Newton iterations and runtime, as expected from our analysis. Better results are likely achieved on cases with more failed timesteps.

… has increased

jakobtorben · 2024-10-03T06:59:47Z

Depends on OPM/opm-common#4244.

jakobtorben · 2024-10-03T07:05:30Z

Note that the well convergence metric is not used at the moment. But the plan is to also include well convergence metrics in the convergence monitoring. However, this requires two things:

WellConvergenceMetric must to be extended to multi-segment wells.
Some logic needs to be added to deal with the fact that number of wells increases during the simulation, which needs to be dealt with when comparing the number of unconverged residuals to the previous iteration.

jakobtorben · 2024-10-03T07:07:52Z

jenkins build this opm-common=4244 please

atgeirr

Nice, just some small fry to fix.

opm/simulators/timestepping/ConvergenceReport.hpp

opm/simulators/timestepping/AdaptiveTimeStepping.hpp

opm/simulators/flow/BlackoilModel.hpp

jakobtorben · 2024-10-03T16:05:26Z

jenkins build this please

atgeirr · 2024-10-04T06:20:58Z

All good, all green! Merging.

akva2 reviewed Sep 10, 2024

View reviewed changes

jakobtorben force-pushed the convergence_monitors branch from e134c32 to fa8564a Compare September 13, 2024 13:56

jakobtorben mentioned this pull request Oct 3, 2024

Add exception for convergence monitoring OPM/opm-common#4244

Merged

jakobtorben added 13 commits October 3, 2024 08:56

Add object to keep track of penalty cards for convergence monitoring

988ca3a

Add penalty card for increase in convergence metrics

861cfee

Add well convergence metric

56da845

Add penalty for unconverged well

e1e577f

Reset the total penalty card after each nonlinear iteration

3e6f9c5

Add distance decay penalty

d476a32

Cut timestep if penlaty exceeds limit

5f17c9d

Register convergence monitoring parameters

cda47a6

Change non-convergence monitor to checking if number of non-converged…

eef0ba5

… has increased

Store report of failed step before cutting from convergence monitoring

6d53daa

Ensure correct propagation of failed report

71a64fb

Write penalty count to infoiter file

6ff83e1

Fix calculation of distance

ff20c1f

jakobtorben force-pushed the convergence_monitors branch from fa8564a to ff20c1f Compare October 3, 2024 07:07

jakobtorben marked this pull request as ready for review October 3, 2024 07:07

atgeirr self-assigned this Oct 3, 2024

atgeirr reviewed Oct 3, 2024

View reviewed changes

PR review changes

b830208

atgeirr merged commit 9654215 into OPM:master Oct 4, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convergence monitors #5590

Convergence monitors #5590

jakobtorben commented Sep 10, 2024

jakobtorben commented Sep 10, 2024

akva2 Sep 10, 2024

jakobtorben Sep 10, 2024

bska Sep 20, 2024

jakobtorben commented Sep 12, 2024

jakobtorben commented Sep 17, 2024

jakobtorben commented Oct 3, 2024

jakobtorben commented Oct 3, 2024

jakobtorben commented Oct 3, 2024

atgeirr left a comment

jakobtorben commented Oct 3, 2024

atgeirr commented Oct 4, 2024

Convergence monitors #5590

Convergence monitors #5590

Conversation

jakobtorben commented Sep 10, 2024

Implement Convergence Monitoring

jakobtorben commented Sep 10, 2024

akva2 Sep 10, 2024

Choose a reason for hiding this comment

jakobtorben Sep 10, 2024

Choose a reason for hiding this comment

bska Sep 20, 2024

Choose a reason for hiding this comment

jakobtorben commented Sep 12, 2024

jakobtorben commented Sep 17, 2024

jakobtorben commented Oct 3, 2024

jakobtorben commented Oct 3, 2024

jakobtorben commented Oct 3, 2024

atgeirr left a comment

Choose a reason for hiding this comment

jakobtorben commented Oct 3, 2024

atgeirr commented Oct 4, 2024