Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

problem with st_archive and short runs (< 1 month) #594

Closed
bertinia opened this issue Sep 27, 2016 · 10 comments · Fixed by #912, #1837 or #1838
Closed

problem with st_archive and short runs (< 1 month) #594

bertinia opened this issue Sep 27, 2016 · 10 comments · Fixed by #912, #1837 or #1838
Assignees
Milestone

Comments

@bertinia
Copy link
Contributor

bertinia commented Sep 27, 2016

If the short term archiver is run prior to the end of month boundary then it errors out because it can't find the component monthly history file corresponding to the component restart history file.

The st_archive log errors from a F-WACCM case are:

==> f.e15.FWscAMIP.f19_f19.misc08_cam5_4_81.002.st_archive.570708 <==
out=
err=ncdump: nhfil: No such variable
histfiles_savein_rundir ['f.e15.FWscAMIP.f19_f19.misc08_cam5_4_81.002.cam.h0.1979-01.nc', 'f.e15.FWscAMIP.f19_f19.misc08_cam5_4_81.002.cam.h1.1979-01-01-00000.nc', 'f.e15.FWscAMIP.f19_f19.misc08_cam5_4_81.002.cam.h2.1979-01-01-00000.nc']
doing short term archiving for clm (lnd)
removing interim restart file /glade/scratch/mmills/f.e15.FWscAMIP.f19_f19.misc08_cam5_4_81.002/run/f.e15.FWscAMIP.f19_f19.misc08_cam5_4_81.002.clm2.r.1979-01-06-00000.nc
WARNING: ncdump -v locfnh /glade/scratch/mmills/f.e15.FWscAMIP.f19_f19.misc08_cam5_4_81.002/run/f.e15.FWscAMIP.f19_f19.misc08_cam5_4_81.002.clm2.rh0.1979-01-06-00000.nc failed rc=1
out=
err=ncdump: locfnh: No such variable
removing interim restart file /glade/scratch/mmills/f.e15.FWscAMIP.f19_f19.misc08_cam5_4_81.002/run/f.e15.FWscAMIP.f19_f19.misc08_cam5_4_81.002.clm2.rh0.1979-01-06-00000.nc
ERROR: restart file /glade/scratch/mmills/f.e15.FWscAMIP.f19_f19.misc08_cam5_4_81.002/run/f.e15.FWscAMIP.f19_f19.misc08_cam5_4_81.002.clm2.h0.1979-01.nc does not exist

@rljacob rljacob changed the title problem with st_archive and short runs (< 1 month) problem with st_archive and short runs (< 1 month) Sep 28, 2016
@rljacob rljacob added the ready label Sep 28, 2016
@rljacob
Copy link
Member

rljacob commented Oct 7, 2016

Can this be reproduced using CIME only?

@drmikemills
Copy link

drmikemills commented Oct 27, 2016

I am still having this issue, and it also occurs when the run ends on an end-of-month boundary. I am using my copy of Francis’ Phase 3 sandbox, which I have here:

/glade/u/home/mmills/cesm/phase3_n03_cam5_4_90_compsets/

This is the latest case to fail:

/glade/p/work/mmills/case/f.e20.FSDW.f09_f09.phase3_n03_cam5_4_90.02

See f.e20.FSDW.f09_f09.phase3_n03_cam5_4_90.02.st_archive.259967

ERROR: restart file /glade/scratch/mmills/f.e20.FSDW.f09_f09.phase3_n03_cam5_4_90.02/run/f.e20.FSDW.f09_f09.phase3_n03_cam5_4_90.02.clm2.h0.1990-09.nc does not exist

I am not seeing any .clm2.h0. files in the run directory or the short-term archive:

/glade/scratch/mmills/f.e20.FSDW.f09_f09.phase3_n03_cam5_4_90.02/run

/glade/scratch/mmills/archive/f.e20.FSDW.f09_f09.phase3_n03_cam5_4_90.02

@drmikemills
Copy link

I have located the CLM history output. They are in the short-term archive under atm/hist, instead of under lnd/hist. So are the history files for CICE and MOSART.

See:

/glade/scratch/mmills/archive/f.e15.FWscAMIP.f19_f19.misc08_cam5_4_81.002/atm/hist
/glade/scratch/mmills/archive/f.e20.FSDW.f09_f09.phase3_n03_cam5_4_90.02/atm/hist
/glade/scratch/mmills/archive/f.e20.FWAMIP.f09_f09.phase3_n03_cam5_4_90.03/atm/hist

@rljacob rljacob added in progress and removed ready labels Dec 9, 2016
mvertens pushed a commit that referenced this issue Dec 9, 2016
This was referenced Dec 9, 2016
@rljacob rljacob closed this as completed Dec 16, 2016
@bertinia bertinia reopened this Aug 11, 2017
@bertinia
Copy link
Contributor Author

@drmikemills came across another problem with the STA where a cam restart history
file for a 1 day run was not copied correctly into the restart set. The file did remain
in the rundir. Here's the specifics for the case in order to reproduce the problem:

casedir: /glade/p/work/mmills/case/f.e20.FWHIST.f09_f09_mg17.20thC.190_02

I ran for 1 day. The short-term archive is here:

/glade2/scratch2/mmills/archive/f.e20.FWHIST.f09_f09_mg17.20thC.190_02/rest/1970-01-02-00000

This file did not get put in the short-term archive, but it is required for continuing the run:

f.e20.FWHIST.f09_f09_mg17.20thC.190_02.cam.rh0.1970-01-02-00000.nc

Source code: /glade/p/work/hannay/cesm_tags/cesm2_0_alpha07b_clm4_5_16_r251

@bertinia bertinia self-assigned this Aug 11, 2017
@bertinia bertinia added this to the cesm2 milestone Aug 11, 2017
@jedwards4b
Copy link
Contributor

jedwards4b commented Aug 11, 2017

In config_archive.xml I see <rest_file_extension>\.r[sh]\.*</rest_file_extension>
which would not match a file*.cam.rh0.*- should this include a \d?

@mvertens
Copy link
Contributor

mvertens commented Aug 11, 2017 via email

@jedwards4b
Copy link
Contributor

jedwards4b commented Aug 11, 2017

To be clear I think that this needs to be two seperate entries:

 <rest_file_extension>\.rh\d\.*</rest_file_extension>
 <rest_file_extension>\.rs\.*</rest_file_extension>

@mvertens
Copy link
Contributor

mvertens commented Aug 11, 2017 via email

@jedwards4b
Copy link
Contributor

See my previous comment.

<rest_file_extension>\.rh\d\.*</rest_file_extension>
<rest_file_extension>\.rs\.*</rest_file_extension>

@bertinia
Copy link
Contributor Author

bertinia commented Aug 12, 2017 via email

jgfouca added a commit that referenced this issue Aug 25, 2017
Add irt test

Adds IRT (Interum Restart Test) using the system_test_compare_two methodology
This test revealed a bug in the driver config_archive.xml file which I also fixed.
Changed cime_developer tests from ERS to IRT

Test suite: scripts_regression_tests.py
Test baseline:
Test namelist changes:
Test status: bit for bit

Fixes #594
Fixes #1731
User interface changes?:

Update gh-pages html (Y/N)?:

Code review:
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment