-
Notifications
You must be signed in to change notification settings - Fork 150
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Test SMS_Ld2.ne30pg3_t232.BMT1850.derecho_gnu.allactive-defaultio Fails #1116
Comments
Some diagnostics: The surface pressure (Sa_pslv) passed to the coupler from the atmosphere has nan's in it. The pressure values are reasonable at initialization (after reading in the initial condition file) but the model fails before completing a time-step (i.e. 1st call to coupler). |
I think that maybe the psl field is not being initialized before the first call to atm_import_export.F90 I wonder if the solution might be to just skip the nan check on the first step? |
sea-level pressure is computed in cam_diagnostics
This code is executed from physpkg in tphys_bc
before the call that populates cam%out variables for the coupler
Long story short; PSL should not have nans in it. Could it be that PSL(ncol+1:pcols) is not initialized to zero? (I doubt that is an issue!) @brian-eaton From the ChangeLog it looks like PSL was moved to the physics_buffer back in 2018
Do you have an idea of what is going on? |
@jedwards4b, @PeterHjortLauritzen. During CAM's initialization Do we know whether the nans are trapped during initialization rather than during the run phase? Is is possible that regridding to t232 is causing the problem? |
It's working fine for the intel compiler. There is a newer version of the gnu compiler available, I'm testing that now. This was found with gcc/12.2.0 I am now testing gcc/13.2.0 |
Same error with gcc/13.2.0 /glade/derecho/scratch/jedwards/SMS_Ld2.ne30pg3_t232.BMT1850.derecho_gnu.allactive-defaultio.20240807_154234_oazzge |
It turns out that @PeterHjortLauritzen and I made the same mistake in concluding that In the Peter, @adamrher , what do you think? I'm guessing it's best to leave it where it is since it would appear to mainly be a cam diagnostic field. I'm not sure why other components want to know the value of the surface pressure extrapolated to sea level... |
The MOM ocn model wants it - but it does not use the first timestep. |
@brian-eaton I think we can initilaize My preference is to preserve the relative ordering of all the routines in the original (not cam7) physpkg.F90. That requires However, looking at the code, I guess it never occurred to me that |
OK I've made that way too complicated. I'm advocating we move the call to |
If we move the call to |
Good point, the I don't know if this will create greater than round off differences, so I should probably do a validation run with this mod (the ensemble consistency test would be helpful here!). |
Jim mentioned that MOM uses |
The regression tests confirm that F compsets using cam7 are BFB with this change. The impact on the B compsets using cam7 will be due to removing the one timestep lag that was present in the |
Great, I guess the land model isn't using |
What happened?
20240725 160423.008 ERROR PET5344 /glade/u/home/fischer/code/cesm3_0_alpha02b/components/cmeps/cime_config/../mediator/med_methods_mod.F90:2628 ERROR: 4 nans found in Sa_pslv
20240725 160423.008 ERROR PET5344 /glade/u/home/fischer/code/cesm3_0_alpha02b/components/cmeps/cime_config/../mediator/med_methods_mod.F90:2633 ABORTING JOB
What are the steps to reproduce the bug?
Run the test
What CAM tag were you using?
cam6_4_016
What machine were you running CAM on?
CISL machine (e.g. cheyenne)
What compiler were you using?
GNU
Path to a case directory, if applicable
No response
Will you be addressing this bug yourself?
Yes
Extra info
I'll try in debug mode and see if I get any more info.
Note that this test works with intel.
The text was updated successfully, but these errors were encountered: