-
Notifications
You must be signed in to change notification settings - Fork 132
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
finalize 0-layer thermo and cesm ponds deprecation in CICE #787
Conversation
I ran a full test suite on cheyenne with 3 compilers including CICE-Consortium/Icepack#411 and everything passes and is bit-for-bit. This was done with #4935c98 in this PR and #926c338449 in the Icepack PR. Some additional changes were made after that, but I think only to documentation and hs0=0. Do we need to retest? See https://github.com/CICE-Consortium/Test-Results/wiki/cice_by_hash_forks#4935c98e1ff45eb1b7850cea5639028ef636c60d |
I recommend we merge the Icepack PR when ready, CICE-Consortium/Icepack#411, then update Icepack in the PR before merging it. |
Setting hs0=0 as the default might have changed the answers. If so, I can fix it in the set_nml files, but I'm not sure which ones might have changed -- most likely those with topo ponds. That might just be alt03 -- recompile and test that one? There's alt03 and also a 'pondtopo' option in Icepack. I'm guessing I'll need to add hs0=0.03 to all of those files. |
I retested with hs0=0. Ran a nearly full test suite on 3 compilers on cheyenne. The following runs seem to hang and then time out,
All other tests pass and are bit-for-bit vs main. It's also odd that it's just the intel compiler that struggles with a handful of tests that seem to relatively random. I believe the results are mostly reproducible, but not 100% sure. I don't know what hs0 does, but it's extremely unusual that the code hangs and times out. Not sure why that might be happening, again, just for intel for a few cases. How should we proceed. I think we either need to debug this or reset the default hs0 namelist value. Maybe there are other options. |
This is very odd, definitely not what I'd expect. Did you run a new baseline at the same time, and it didn't have these problems? The cases with debugging on don't throw errors? This seems like a cheyenne/intel problem to me. I had convinced myself that changing the initial values of hs0 in CICE and Icepack from 0.03 to 0 would be BFB for all tests, since hs0=0 in ice_in and icepack_in. So I don't understand what's going on. My inclination is to reset the default values in the code to 0.03 and slightly modify these new documentation changes to be consistent in this PR, and then revise hs0 behavior as needed after discussion of #635 in a separate PR. |
I agree it's odd. It could be a compiler issue. Let me do a bit more testing to confirm results and see if I can understand what's happening better. |
I reran the suite with intel and it looks like everything is OK. I had some odd failures the last few days on cheyenne and even saw some repeated failures, but I think everything is OK again. I think we should merge the Icepack PR, update this PR, then merge this if others agree. |
That's a relief, thanks for checking. How to best address the hs0 issue #635 isn't resolved yet, so I think it would be better to back out the hs0 initialization changes here (return to original values) but keep the documentation changes, modified to be consistent with the current state of the code. Then fix the hs0 behavior separately. |
I agree, back out hs0 changes and fix separately after further discussion. Let me know when the Icepack and CICE PRs are ready again and I can review and/or test as needed. Thanks. |
Removes all ifdef options allowing 'undeprecation' of 0-layer thermo and cesm meltponds, and updates documentation
@eclare108213 @dabail10
ENTER INFORMATION HERE