-
Notifications
You must be signed in to change notification settings - Fork 249
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG]: frac_grid input in non-frac mode is not restart reproducible in coupled model at c96,c192 or c384 #227
Comments
To clarify: The existing restart test in ufs-s2s uses c96mx025 (the current default resolution). Originally when we committed CMEPS we wanted to add the restart test for c384mx025 but found an issue on lake_frac points (see Issues ufs-community/ufs-s2s-model#34 and ufs-community/ufs-s2s-model#108). So we switched the restart test to c96mx025. I have not tested the c96mx025 resolution in this branch since we are not carrying it forward as a tested resolution. Testing this branch with C384mx025 resolution using the original input data in FV3_input_data384 does reproduce. It does not reproduce in my testing using the c384mx025 input in FV3_input_frac. |
Denise, thanks for confirming that the C384mx025 restart test using the
original input data in FV3_input_data384 reproduces. So this might be
purely IC issues then
…On Wed, Oct 7, 2020 at 3:23 PM Denise Worthen ***@***.***> wrote:
To clarify:
The existing restart test in ufs-s2s uses c96mx025 (the current default
resolution). Originally when we committed CMEPS we wanted to add the
restart test for c384mx025 but found an issue on lake_frac points (see
Issues ufs-community/ufs-s2s-model#34 <ufs-community/ufs-s2s-model#34> and
ufs-community/ufs-s2s-model#108 <ufs-community/ufs-s2s-model#108>). So we
switched the restart test to c96mx025.
I have not tested the c96mx025 resolution in this branch since we are not
carrying it forward as a tested resolution.
Testing this branch with C384mx025 resolution using the original input
data in FV3_input_data384 does reproduce. It does not reproduce in my
testing using the c384mx025 input in FV3_input_frac.
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<https://github.com/ufs-community/ufs-s2s-model/issues/214#issuecomment-705143972>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AI7D6TNAWXAW3VN7TUWU4BLSJS52JANCNFSM4SHNVHIA>
.
|
I've made this assignment to myself for tracking but I will require assistance from @shansun6 |
Hi Denise,
Yes!
I noticed one difference between frac oro data and the default one at
/scratch1/NCEPDEV/nems/emc.nemspara/RT/FV3-MOM6-CICE5/develop-20200928/FV3_input_data/INPUT/
is that the latter has no lakes. I am running some tests without lakes
(lakefrac=0 but slmsk=0, so it would be a water point for FV3, but CMEPS
and MOM6 are unaware) in the frac oro, and will get back to you this
afternoon.
Thanks,
Shan
…On Thu, Oct 8, 2020 at 6:36 AM Denise Worthen ***@***.***> wrote:
I've made this assignment to myself for tracking but I will require
assistance from @shansun6 <https://github.com/shansun6>
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<https://github.com/ufs-community/ufs-s2s-model/issues/214#issuecomment-705538384>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ALORMVTSMEGUQVNGEG737ADSJWW3LANCNFSM4SHNVHIA>
.
|
Hi Denise, I am able to reproduce at C384mx025 resolution using the original input data, but fail this test when running with frac input. Since it takes a while to run C384, I am starting a test using C96 and with FV3 only without other submodules, as I think the same problem would occur in the FV3 standalone model. How do you like this approach? Thanks, |
I found one bug that prevented the coupled model from restart reproducible: lake ice needs to be saved in the restart file. After fixing this by modifying GFS_surface_composites.F90 & GFS_surface_composites.meta, C96mx025 and C384mx025 can reproduce after restart. The updated code is at https://github.com/shansun6/ufs-s2s-model/tree/bugfix/denise_restart_lakeice, which is based on 64eeba7 of https://github.com/DeniseWorthen/ufs-s2s-model/tree/feature/restart/. However, with this revised code, it still fails at C96mx100 and C192mx050 when restarting at 12h (43200s). Results at 44100s are identical but differences appeared on every tiles at 45000s for both C96mx100 and X192mx050, suggesting it has something to do with coupling with ocean, as atm and ice use 900s time step, and the ocean's time step is 1800s. Attached is a plot of difference in atmlmp_Sa_z at 45000s on the 6 tiles. The fact that differences occurred along lines may indicate something in atm and ocean interface layout? Also coupling with the ocean resolution at mx025 are well tested, while both mx100 and mx050 configurations are new. For more info, see results of all 12 regression tests at /scratch2/BMC/gsd-fv3-dev/Shan.Sun/S2S_RT/rt_157734/. |
Thanks Shan. We've seen this pattern before in the ocean restart. Basically we're seeing the pattern of the ocean grid decomposition with differences arising from the halos. Let me look into it more since we have the right parameter options set for mx025. It is possible that the lower resolution ocean models need different or additional settings. |
Hi Denise,
Thanks for your prompt reply. Good to know that this is not something
totally new. Let me know if you want me to try anything.
Shan
…On Sun, Oct 18, 2020 at 2:29 PM Denise Worthen ***@***.***> wrote:
Thanks Shan. We've seen this pattern before in the ocean restart.
Basically we're seeing the pattern of the ocean grid decomposition with
differences arising from the halos. Let me look into it more since we have
the right parameter options set for mx025. It is possible that the lower
resolution ocean models need different or additional settings.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<https://github.com/ufs-community/ufs-s2s-model/issues/214#issuecomment-711417751>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ALORMVUXQEEQWL5R4ROE2HLSLNFYZANCNFSM4SHNVHIA>
.
|
I've been able to get restart repro for all the test cases by cherry picking an update to MOM6 that is upcoming. It is the commit for Add halo updates needed with VERTEX_SHEAR=True. Substituting a MOM6 branch with that update added to our current emc/develop worked. I did also try setting some of the MOM_input parameter settings for various bugs to their 'non-bug' setting since GFDL has them set for their own reproducibility testing but that had no impact. |
Further testing has shown that the setting of |
@shansun6 Just as a heads-up, I will be moving the restart testing branch to my fork on ufs-weather. |
Hi Denise,
Thanks for the heads-up. Since Moorthi is working on the restart/debug, I
will focus on the benchmark test for frac_grid=T.
Shan
…On Fri, Oct 23, 2020 at 11:04 AM Denise Worthen ***@***.***> wrote:
@shansun6 <https://github.com/shansun6> Just as a heads-up, I will be
moving the restart testing branch to my fork on ufs-weather.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#227 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ALORMVWTAVN7ID2DZAG254DSMGZR7ANCNFSM4SYP55KA>
.
|
The branch I am using for testing is now on my ufs-weather fork feature/cpld_restart. This branch has FV3 updated to the current ufs-weather b955f81. When using a MOM6 branch with the vertex halo fix and When testing the same branch in frac_grid mode ( When testing using PR #238 in non-frac mode, the model fails with saturation vapor pressure errors. |
Thanks for the update. Moorthi has a comprehensive fix for restart to
target these 8 fields on tile3, which should work in both nonfrac and
frac_grid mode.
Moorthi, is it available in your own branch?
Shan
…On Sun, Oct 25, 2020 at 12:22 PM Denise Worthen ***@***.***> wrote:
The branch I am using for testing is now on my ufs-weather fork
feature/cpld_restart
<https://github.com/DeniseWorthen/ufs-weather-model/tree/feature/cpld_restart>
.
This branch has FV3 updated to the current ufs-weather b955f81
<https://github.com/NOAA-EMC/fv3atm/tree/b955f81015b2de589d886e7052feb68485f79466>
.
When using a MOM6 branch with the vertex halo fix and USE_LA_LI2016=False,
I am now getting restart repro with all tested resolutions using frac_grid
input in non-frac mode.
When testing the same branch in frac_grid mode (FRAC_GRID=.T. and
CPLMODE=nems_frac) I find that FV3 does not reproduce on restart with the
same small number of points (a dozen or so) for the same 8 fields on tile3
as previously noted.
When testing using PR #238
<https://github.com/ufs-community/ufs-weather-model/pull/238/files> in
non-frac mode, the model fails with saturation vapor pressure errors.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#227 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ALORMVQVI6VP6PCH4RYTYO3SMRUHVANCNFSM4SYP55KA>
.
|
Denise, can you try my branch:
https://github.com/junwang-noaa/fv3atm/tree/ccppipd with revision 4276ef61,
it has Moorthi's CCPP changes in it. Please do not use the top of the
ccppipd branch ( revision 2f320ece ), I have some failed case withit.
…On Sun, Oct 25, 2020 at 4:27 PM shansun6 ***@***.***> wrote:
Thanks for the update. Moorthi has a comprehensive fix for restart to
target these 8 fields on tile3, which should work in both nonfrac and
frac_grid mode.
Moorthi, is it available in your own branch?
Shan
On Sun, Oct 25, 2020 at 12:22 PM Denise Worthen ***@***.***>
wrote:
> The branch I am using for testing is now on my ufs-weather fork
> feature/cpld_restart
> <
https://github.com/DeniseWorthen/ufs-weather-model/tree/feature/cpld_restart
>
> .
>
> This branch has FV3 updated to the current ufs-weather b955f81
> <
https://github.com/NOAA-EMC/fv3atm/tree/b955f81015b2de589d886e7052feb68485f79466
>
> .
>
> When using a MOM6 branch with the vertex halo fix and
USE_LA_LI2016=False,
> I am now getting restart repro with all tested resolutions using
frac_grid
> input in non-frac mode.
>
> When testing the same branch in frac_grid mode (FRAC_GRID=.T. and
> CPLMODE=nems_frac) I find that FV3 does not reproduce on restart with the
> same small number of points (a dozen or so) for the same 8 fields on
tile3
> as previously noted.
>
> When testing using PR #238
> <https://github.com/ufs-community/ufs-weather-model/pull/238/files> in
> non-frac mode, the model fails with saturation vapor pressure errors.
>
> —
> You are receiving this because you were mentioned.
> Reply to this email directly, view it on GitHub
> <
#227 (comment)
>,
> or unsubscribe
> <
https://github.com/notifications/unsubscribe-auth/ALORMVQVI6VP6PCH4RYTYO3SMRUHVANCNFSM4SYP55KA
>
> .
>
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#227 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AI7D6TN5OBVXLFF36KLCCL3SMSC3LANCNFSM4SYP55KA>
.
|
FYI, restart with my version with "frac_grid=.true." is not reproducing
either.
Moorthi
…On Sun, Oct 25, 2020 at 4:57 PM Jun Wang ***@***.***> wrote:
Denise, can you try my branch:
https://github.com/junwang-noaa/fv3atm/tree/ccppipd with revision
4276ef61,
it has Moorthi's CCPP changes in it. Please do not use the top of the
ccppipd branch ( revision 2f320ece ), I have some failed case withit.
On Sun, Oct 25, 2020 at 4:27 PM shansun6 ***@***.***> wrote:
> Thanks for the update. Moorthi has a comprehensive fix for restart to
> target these 8 fields on tile3, which should work in both nonfrac and
> frac_grid mode.
>
> Moorthi, is it available in your own branch?
>
> Shan
>
> On Sun, Oct 25, 2020 at 12:22 PM Denise Worthen <
***@***.***>
> wrote:
>
> > The branch I am using for testing is now on my ufs-weather fork
> > feature/cpld_restart
> > <
>
https://github.com/DeniseWorthen/ufs-weather-model/tree/feature/cpld_restart
> >
> > .
> >
> > This branch has FV3 updated to the current ufs-weather b955f81
> > <
>
https://github.com/NOAA-EMC/fv3atm/tree/b955f81015b2de589d886e7052feb68485f79466
> >
> > .
> >
> > When using a MOM6 branch with the vertex halo fix and
> USE_LA_LI2016=False,
> > I am now getting restart repro with all tested resolutions using
> frac_grid
> > input in non-frac mode.
> >
> > When testing the same branch in frac_grid mode (FRAC_GRID=.T. and
> > CPLMODE=nems_frac) I find that FV3 does not reproduce on restart with
the
> > same small number of points (a dozen or so) for the same 8 fields on
> tile3
> > as previously noted.
> >
> > When testing using PR #238
> > <https://github.com/ufs-community/ufs-weather-model/pull/238/files> in
> > non-frac mode, the model fails with saturation vapor pressure errors.
> >
> > —
> > You are receiving this because you were mentioned.
> > Reply to this email directly, view it on GitHub
> > <
>
#227 (comment)
> >,
> > or unsubscribe
> > <
>
https://github.com/notifications/unsubscribe-auth/ALORMVQVI6VP6PCH4RYTYO3SMRUHVANCNFSM4SYP55KA
> >
> > .
> >
>
> —
> You are receiving this because you commented.
> Reply to this email directly, view it on GitHub
> <
#227 (comment)
>,
> or unsubscribe
> <
https://github.com/notifications/unsubscribe-auth/AI7D6TN5OBVXLFF36KLCCL3SMSC3LANCNFSM4SYP55KA
>
> .
>
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#227 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ALLVRYSYJSNFPKTX67FKTS3SMSGMTANCNFSM4SYP55KA>
.
--
Dr. Shrinivas Moorthi
Research Meteorologist
Modeling and Data Assimilation Branch
Environmental Modeling Center / National Centers for Environmental
Prediction
5830 University Research Court - (W/NP23), College Park MD 20740 USA
Tel: (301)683-3718
e-mail: Shrinivas.Moorthi@noaa.gov
Phone: (301) 683-3718 Fax: (301) 683-3718
|
Using 64276ef6 from https://github.com/junwang-noaa/fv3atm/tree/ccppipd the model successfully runs. I only checked c96mx100 but it reproduces. In frac grid mode, the atm doesn't reproduce on any tile. |
Denise, are you doing 24h/12h/24h for control/ restart start/restart end or
you are doing 3d/2d/3d restart test?
…On Mon, Oct 26, 2020 at 12:33 PM Denise Worthen ***@***.***> wrote:
Using 64276ef6 from https://github.com/junwang-noaa/fv3atm/tree/ccppipd
the model successfully runs. I only checked c96mx100 but it reproduces.
In frac grid mode, the atm doesn't reproduce on any tile.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#227 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AI7D6TNS4MY7JFYHQ4H4LNDSMWQEBANCNFSM4SYP55KA>
.
|
I am using 24h/12h/12h->24h in the restart test. |
Can you try a 3d/2d/2d->3d test too?
…On Mon, Oct 26, 2020 at 12:47 PM Denise Worthen ***@***.***> wrote:
I am using 24h/12h/12h->24h in the restart test.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#227 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AI7D6TJIC72YAYSUF2W4PG3SMWR2VANCNFSM4SYP55KA>
.
|
I can set something like that up, but which exact case would you want me to test? The original c96mx025? That would test this FV3 branch in non-frac mode using the old (non-lake) input c96 input. |
I was testing global restart, it reproduces control for 24/12/12->24. but
not 48/24/24->48. I'd like to check if s2s restart has the same issue.
c96mx025 is OK, I am running with non-frac mode using old oro data(no lake
vars)
…On Mon, Oct 26, 2020 at 1:07 PM Denise Worthen ***@***.***> wrote:
I can set something like that up, but which exact case would you want me
to test? The original c96mx025? That would test this FV3 branch in non-frac
mode using the old (non-lake) input c96 input.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#227 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AI7D6TPTAJTEJSY7QT63GS3SMWUEJANCNFSM4SYP55KA>
.
|
OK, let me set that up. |
If it reproduces control for 24/12/12->24, but not 48/24/24->48, I'd look
at gcycle first as it is set to kick in every 24hrs.
Shan
…On Mon, Oct 26, 2020 at 11:33 AM Denise Worthen ***@***.***> wrote:
OK, let me set that up.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#227 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ALORMVRCXYY74ORQWEAPCA3SMWXEXANCNFSM4SYP55KA>
.
|
@junwang-noaa That must be why utest passes since it tests 24/12/12->24 |
@shansun6, @MinsukJi-NOAA Yes, that's right.
…On Mon, Oct 26, 2020 at 1:37 PM Minsuk Ji ***@***.***> wrote:
I was testing global restart, it reproduces control for 24/12/12->24. but
not 48/24/24->48. I'd like to check if s2s restart has the same issue.
c96mx025 is OK, I am running with non-frac mode using old oro data(no lake
vars)
@junwang-noaa <https://github.com/junwang-noaa> That must be why utest
passes since it tests 24/12/12->24
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#227 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AI7D6TNK5RFOBIDUZHTPYTLSMWXXHANCNFSM4SYP55KA>
.
|
This is the directory w/ the 2d/3d/1d mx025 restart test using FV3 @b955f810 : /scratch1/NCEPDEV/stmp2/Denise.Worthen/FV3_RT/restart_2d3d The mediator restart for the restart test at the end of the run is identical to the continuous 3d run restart. |
I am currently testing the cpld_restart branch (41586b7). This has FV3 @ develop + the MOM6 halo fix for vertex shear (PR 1221). Using frac_grid input in non-frac mode, I get restart repro at c96mx100, c192mx050, c384mx025 using the 1d/12h/12h test. Testing the same configuration code in frac_grid mode, none of the cases give restart repro. The fields exported by FV3 at the first timestep after restarting are different than the continuous run. Digging further, for the c96mx100 case, only tile3 and tile2 don't reproduce in the frac grid mode. Tile2 doesn't reproduce on two grid points. Both of these tile 2 points have land_frac=1.0. They are (i=59,j=80) and (i=46,j=93). The fields which don't reproduce are: atmImp_Faxa_lwdn |
Hi Denise,
I am looking into it right now. How do output
ufs.cpld.cpl.r.2016-10-03-43200.nc every time step?
Thanks,
Shan
…On Thu, Oct 29, 2020 at 11:55 AM Denise Worthen ***@***.***> wrote:
I am currently testing the cpld_restart branch (41586b7
<41586b7>).
This has FV3 @ develop + the MOM6 fix for the vertex shear.
Using frac_grid input in non-frac mode, I get restart repro at c96mx100,
c192mx050, c384mx025 using the 1d/12h/12h test.
Testing the same configuration code in frac_grid mode, none of the cases
give restart repro. The fields exported by FV3 at the first timestep after
restarting are different than the continuous run.
Digging further, for the c96mx100 case, only tile3 and tile2 don't
reproduce in the frac grid mode. Tile2 doesn't reproduce on a two grid
points. Both of these tile 2 points have land_frac=1.0. They are
(i=59,j=80) and (i=46,j=93). The fields which don't reproduce are:
atmImp_Faxa_lwdn
atmImp_Faxa_lwnet
atmImp_Sa_pbot
atmImp_Sa_shum
atmImp_Sa_tbot
atmImp_Sa_z
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#227 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ALORMVXI5WGT54MGFU6JH7LSNGUB3ANCNFSM4SYP55KA>
.
|
The feature/cpld_restart branch in my fork is set up to output the cpl history files at every timestep. The control is by |
I am using 41586b7 of feature/cpld_restart and the only changes I made are
rt.sh (new RTPWD) and default_vars.sh (cplmode="nems_frac" & frac_grid=T).
When I ran the 3 tests of C96mx100, I see in the output dir that
nems.configure: history_n = 1
nems.configure: history_option = nsteps
but the only ufs file is ufs.cpld.cpl.r.2016-10-03-43200.nc
My source code is at /scratch1/BMC/gsd-fv3-dev/sun/denise_rst_1028/tests/
and the output dir is
/scratch2/BMC/gsd-fv3-dev/Shan.Sun/FV3_RT/rt_108037/.
Another strange thing is the output shows frac_grid= T, but all 3 tests
passed the "current" baselines, which should be made for nonfrac. Any ideas?
Thanks,
Shan
…On Thu, Oct 29, 2020 at 2:46 PM Denise Worthen ***@***.***> wrote:
The feature/cpld_restart
<https://github.com/DeniseWorthen/ufs-weather-model/tree/feature/cpld_restart>
branch in my fork is set up to output the cpl history files at every
timestep. The control is by history_n=1 and history_option=nsteps.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#227 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ALORMVUS7HP3DRUA3AUXXC3SNHICRANCNFSM4SYP55KA>
.
|
The baselines pass because I had to comment out all the comparison files to get the ufs-weather branch to run the dep_run test using ecflow. I think the rt.sh for ufs-weather has an error control that if the control test fails, it does not run the dep_run. Since I'm not comparing against baselines for the restart test I don't care if the baseline fails. The only way I could get ufs-weather to run the dep_run test is to comment out the LIST_FILES. The history files you're looking for are in the RESTART directory. The history files have the name |
Thank you, Denise, for answering my questions. I found "ufs.cpld.cpl.hi"
under /RESTART/ - sorry about that.
…-Shan
On Fri, Oct 30, 2020 at 3:40 AM Denise Worthen ***@***.***> wrote:
The baselines pass because I had to comment out all the comparison files
to get the ufs-weather branch to run the dep_run test using ecflow. I think
the rt.sh for ufs-weather has an error control that if the test control
test fails, it does not run the dep_run. Since I'm not comparing against
baselines for the restart test I don't care if the baseline fails. The only
way I could get ufs-weather to run the dep_run test is to comment out the
LIST_FILES.
The history files you're looking for are in the RESTART directory. The
history files have the name ufs.cpld.cpl.hi. The restart files are the
ufs.cpld.cpl.r.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#227 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ALORMVWL2X5X5WSSSUKJIELSNKCYRANCNFSM4SYP55KA>
.
|
Does 41586b7 point to the right FV3 & ccpp/physics submodule, or should I switch to the right submodules? I noticed ccpp/physics uses Oct. 9 version and doesn't have the fix I put in on Oct. 17? Thanks, |
Yes, b955f81 indeed can reproduce with frac_grid=F and FRAC_GRID_INPUT=T,
despite the fact that it points to ccpp/physics dated on Oct. 9.
This hash failed to reproduce after restart with frac_grid=T on tile2 and
tile3 at 1st time step, all are lake points. The failed points have ice on
tile3 (i=52:53, j=74, total 8 points), but no ice on tile2, only one point
i=46,j=93. I saved "slmsk" in the rsf, but it still cannot reproduce. Will
keep trying.
Thanks,
Shan
…On Fri, Oct 30, 2020 at 2:08 PM Denise Worthen ***@***.***> wrote:
It was my impression that the fix in I think PR #238
<#238>, which had
the changes you are referring to (?), was having trouble reproducing in
Moorthi's longer tests. Since I knew that the non-frac tested as repro-ing
using FV3 b955f81 I went back to that commit as a first test for the frac
grid.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#227 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ALORMVWEDOBNNFBQNZKTFMLSNMMKFANCNFSM4SYP55KA>
.
|
Hi Denise, I am able to get restart reproducible for all tests in rt.cpldrestart.conf, after I switched FV3 submodule to https://github.com/shansun6/fv3atm/ -b fix_frac_rst_20201114 where I made 2 changes, including one from Moorthi. See if you get the same. Thanks, Moorthi! Shan |
Hi Shan, I tried to run with frac_grid =T in the cpld_restart branch. I got compute_qs: saturation vapor pressure table overflow, nbad= 1 My run is here : /scratch1/NCEPDEV/stmp2/Denise.Worthen/FV3_RT/rt_36433/cpld_control_prod I added the changes from your fv3atm branch. Are there other changes I need? |
Hi Denise,
May I look at your source dir? Maybe I miss something.
Thanks,
Shan
…On Mon, Nov 16, 2020 at 4:13 PM Denise Worthen ***@***.***> wrote:
Hi Shan,
I tried to run with frac_grid =T in the cpld_restart branch. I got
compute_qs: saturation vapor pressure table overflow, nbad= 1
My run is here :
/scratch1/NCEPDEV/stmp2/Denise.Worthen/FV3_RT/rt_36433/cpld_control_prod
I added the changes from your fv3atm branch. Are there other changes I
need?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#227 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ALORMVXEA5L7PTMC3SKOJ5TSQGWYHANCNFSM4SYP55KA>
.
|
It is here: /scratch2/NCEPDEV/climate/Denise.Worthen/WORK/ufs_restart It has an updated CICE and some changes I'm working on in CMEPS for normalizing the fluxes coming back from the atm. There is an extra field exported by ATM but nothing is being done w/ it. |
Got it - thanks! -Shan
…On Mon, Nov 16, 2020 at 7:09 PM Denise Worthen ***@***.***> wrote:
It is here: /scratch2/NCEPDEV/climate/Denise.Worthen/WORK/ufs_restart
It has an updated CICE and some changes I'm working on in CMEPS for
normalizing the fluxes coming back from the atm. There is an extra field
exported by ATM but nothing is being done w/ it.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#227 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ALORMVRQL3WVNHWVDOJV4T3SQHLPFANCNFSM4SYP55KA>
.
|
I cannot compile on hera at the moment, due to this error:
ifort: error #10052: could not checkout FLEXlm license
make[2]: *** [CMakeFiles/fms.dir/FMS/mpp/mpp_parameter.F90.o] Error 1
I have written to hera help.
Shan
On Mon, Nov 16, 2020 at 7:40 PM Shan Sun - NOAA Federal <shan.sun@noaa.gov>
wrote:
… Got it - thanks! -Shan
On Mon, Nov 16, 2020 at 7:09 PM Denise Worthen ***@***.***>
wrote:
> It is here: /scratch2/NCEPDEV/climate/Denise.Worthen/WORK/ufs_restart
>
> It has an updated CICE and some changes I'm working on in CMEPS for
> normalizing the fluxes coming back from the atm. There is an extra field
> exported by ATM but nothing is being done w/ it.
>
> —
> You are receiving this because you were mentioned.
> Reply to this email directly, view it on GitHub
> <#227 (comment)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/ALORMVRQL3WVNHWVDOJV4T3SQHLPFANCNFSM4SYP55KA>
> .
>
|
Hi Denise,
I can reproduce your crash now. Sorry about that. I have a temporary fix
but don't understand why the earlier try would fail.
Could you please give my fv3 branch of "fix_frac_rst_20201114" another try?
With this fix, I am able to reproduce restart with frac_grid=T. My code is
at /scratch2/BMC/gsd-fv3-dev/sun/denise_rst_ceil/
and my output is at /scratch2/BMC/gsd-fv3-dev/Shan.Sun/FV3_RT/rt_126914/.
Let me know if this makes sense. Thanks,
Shan
On Mon, Nov 16, 2020 at 8:11 PM Shan Sun - NOAA Federal <shan.sun@noaa.gov>
wrote:
… I cannot compile on hera at the moment, due to this error:
ifort: error #10052: could not checkout FLEXlm license
make[2]: *** [CMakeFiles/fms.dir/FMS/mpp/mpp_parameter.F90.o] Error 1
I have written to hera help.
Shan
On Mon, Nov 16, 2020 at 7:40 PM Shan Sun - NOAA Federal ***@***.***>
wrote:
> Got it - thanks! -Shan
>
> On Mon, Nov 16, 2020 at 7:09 PM Denise Worthen ***@***.***>
> wrote:
>
>> It is here: /scratch2/NCEPDEV/climate/Denise.Worthen/WORK/ufs_restart
>>
>> It has an updated CICE and some changes I'm working on in CMEPS for
>> normalizing the fluxes coming back from the atm. There is an extra field
>> exported by ATM but nothing is being done w/ it.
>>
>> —
>> You are receiving this because you were mentioned.
>> Reply to this email directly, view it on GitHub
>> <#227 (comment)>,
>> or unsubscribe
>> <https://github.com/notifications/unsubscribe-auth/ALORMVRQL3WVNHWVDOJV4T3SQHLPFANCNFSM4SYP55KA>
>> .
>>
>
|
Hi Shan, I put in your revision to fv3 and I am now getting restart repro for c96,c192,c384 in frac_grid mode. |
Hi Denise,
Thanks for the good news! I suggest using that for now, unless I find the
root of the problem for "the saturation pressure crash".
Shan
…On Tue, Nov 17, 2020 at 7:23 AM Denise Worthen ***@***.***> wrote:
Hi Shan,
I put in your revision to fv3 and I am now getting restart repro for
c96,c192,c384 in frac_grid mode.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#227 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ALORMVW5OANZZEFSGHTXUETSQKBODANCNFSM4SYP55KA>
.
|
* Add support on NSSL/Odin * Add wlfow_odin.env and modify detect_machine.sh * update comment * Detect explicitly odin1/odin2
Describe the bug
When using the new frac_grid input in non-frac mode, FV3 does not restart repro for up to 8 fields.
When using the original FV3_input_data384, FV3 does restart repro.
To Reproduce
Checkout this branch: https://github.com/DeniseWorthen/ufs-s2s-model/tree/feature/restart
This branch has restart tests at c96mx100, c192mx050 and c384mx025 added, each using a 1d run, a 12h run and then a restart from the 12h run.
The nems.configure has an added mediator history write phase added at each timestep.
The rt.conf has only the 3 restart for each resolution active.
When any one resolution is run, first the 1d and 12h runs complete. The restart test starts after the 12h run completes, using the restart files produced in that run. Comparisons should then be made between the just completed 1d run and the restart run (not the baseline!).
For any given resolution, compare the mediator history file at the first timestep of the restart run against the same timestep of the continuous run, eg:
produces:
These are the 8 fields which are not the same the first time FV3 sends fields to the mediator after restart.
Since the mediator history files contain the atm fields on the mesh, an additional step is required to put them onto a tile grid to view. This ncl script can be used to transfer the mesh fields to the tile for any resolution. The user must point to their own run directory, called
RT
in this script. The tool will produce files such asufs.s2s.cpl.hi.43650.tile3.nc
which can be compared between the 1d and restart runs.To show that the original FV3_input_data384 does reproduce, uncomment the line
in the s2s c384 tests (in tests/tests).
Expected behavior
A clear and concise description of what you expected to happen.
Additional context
Add any other context about the problem here. Directly reference any issues or PRs in this or other repositories that this is related to, nd describe how they are related.
The text was updated successfully, but these errors were encountered: