Automate update of consistency test baseline data #574

GeorgeGayno-NOAA · 2021-08-24T13:18:34Z

Code updates often change results, which means the consistency test baseline data must be updated. Currently, updates are done manually on five machines. But as the number of weekly PRs grows, we need an automated way to do this.

GeorgeGayno-NOAA · 2021-08-24T13:21:45Z

I think @junwang-noaa has done this for the ufs-weather-model.

kgerheiser · 2021-09-13T19:32:52Z

Wouldn't be too difficult to port this from the weather model:

https://github.com/ufs-community/ufs-weather-model/tree/develop/tests/auto

You can add a "run test" label to a PR and then the script will read it from Github and submit the job(s).

BrianCurtis-NOAA · 2021-09-27T13:57:05Z

@GeorgeGayno-NOAA Are you using the same systems that we are with UFS? I haven't ported it to WCOSS machines (yet, I think WCOSS2 will have pygithub package) because Mars/Luna/Surge/Venus didn't have pygithub.

The labels we've setup in GitHub are <machine>-<compiler>-<job>, if the <machine> matches the $HOSTNAME (with a wildcard search), it will start the <job> (from jobs/<job>.py) using the <compiler> you specify. The machines use a cronjob to check for open github PR's. It's easy to specify a different repo, the current one is just hard-coded.

There is still a level of manual work for the cases where a machine kills a job or it times out etc.. but the scripts should post in the PR with any issues that arise so someone can go take a look. For UFS the jobs submits the log from the machine as the signal that all went well because it uses the log file to determine if all jobs were successful or if even one failed.

Hopefully with a second group interested in a similar work flow, we can improve upon the current code.

GeorgeGayno-NOAA · 2021-09-27T16:02:31Z

@GeorgeGayno-NOAA Are you using the same systems that we are with UFS? I haven't ported it to WCOSS machines (yet, I think WCOSS2 will have pygithub package) because Mars/Luna/Surge/Venus didn't have pygithub.

The labels we've setup in GitHub are --, if the matches the $HOSTNAME (with a wildcard search), it will start the (from jobs/.py) using the you specify. The machines use a cronjob to check for open github PR's. It's easy to specify a different repo, the current one is just hard-coded.

There is still a level of manual work for the cases where a machine kills a job or it times out etc.. but the scripts should post in the PR with any issues that arise so someone can go take a look. For UFS the jobs submits the log from the machine as the signal that all went well because it uses the log file to determine if all jobs were successful or if even one failed.

Hopefully with a second group interested in a similar work flow, we can improve upon the current code.

We run on WCOSS and Hera, Jet and Orion.

We already run our tests off the cron. But I would be interested in how you do that. Also, how do you update the baseline data when updates change results?

BrianCurtis-NOAA · 2021-09-27T16:16:13Z

how do you update the baseline data when updates change results?
With the UFSWM if we know updates change the results we create baselines (BL job) which automatically calls the regression tests (RT job) after successful completion. New baselines are created --> script checks log files for errors --> baselines are moved to where we keep baselines --> RT's are run against new baselines.

BrianCurtis-NOAA · 2021-09-27T16:20:43Z

Cronjob for UFSWM (Orion example):

# Automated Regression Testing
MAILTO="brian.curtis@noaa.gov"
*/15 * * * * cd /work/noaa/nems/emc.nemspara/autort/tests/auto && /bin/bash --login start_rt_auto.sh >> rt_auto.out 2>&1

The start_rt_auto.sh script loads PYTHONPATH into $PATH and calls rt_auto.py
rt_auto.py (if machine matches label) gets all information for all jobs from GitHub and the machine, stores it into an object and passes it into the jobs.

to save a new set of baseline data. Fixes ufs-community#574

Update test scripts to call the update script. Fixes ufs-community#574

Fixes ufs-community#574.

Fixes ufs-community#574

Update the update_baseline.sh script to process the fix_sfc baseline subdirectory used by the grid_gen. Fixes ufs-community#574.

Fixes ufs-community#574

Fixes ufs-community#574.

the baseline is to be updated. Fixes ufs-community#574

Fixes ufs-community#574

Fixes ufs-community#574.

Fixes ufs-community#574

Fixes ufs-community#574.

Fixes ufs-community#574

Add logic to the consistency test scripts to automatically update the baseline data when code updates change results. Fixes #574

GeorgeGayno-NOAA added the enhancement New feature or request label Aug 24, 2021

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Oct 13, 2021

An initial idea of updating the cycle reg test scripts

67bbcdb

to save a new set of baseline data. Fixes ufs-community#574

GeorgeGayno-NOAA self-assigned this Oct 14, 2021

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Oct 14, 2021

New script - update.sh - that will update the baseline data directory.

c07db3a

Update test scripts to call the update script. Fixes ufs-community#574

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Oct 14, 2021

Generalize the scripts and start testing them from chgres_cube.

4ef2152

Fixes ufs-community#574.

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Oct 14, 2021

Rename some scripts

12106cf

Fixes ufs-community#574

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Oct 15, 2021

Incorporate baseline scripts into grid_gen reg tests.

5d0ee75

Update the update_baseline.sh script to process the fix_sfc baseline subdirectory used by the grid_gen. Fixes ufs-community#574.

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Oct 18, 2021

Updates to run the chgres and global_cycle tests on Hera.

a0efdd2

Fixes ufs-community#574

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Oct 22, 2021

Merge branch 'develop' into feature/baseline

b0b12e4

Fixes ufs-community#574.

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Nov 19, 2021

Merge branch 'develop' into feature/baseline

6c57168

Fixes ufs-community#574.

GeorgeGayno-NOAA mentioned this issue Nov 19, 2021

Automate update of consistency test baseline data. #603

Merged

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Dec 17, 2021

Merge branch 'develop' into feature/baseline

3e3294f

Fixes ufs-community#574.

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Jan 11, 2022

Add logic to run the 'get_hash.sh' script only when

a919025

the baseline is to be updated. Fixes ufs-community#574

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Jan 12, 2022

Updates for the snow2mdl test.

a6bda24

Fixes ufs-community#574

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Jan 12, 2022

Update for ice_blend test.

ae33ff7

Fixes ufs-community#574

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Jan 12, 2022

Update chgres script for Orion.

94deb26

Fixes ufs-community#574

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Jan 12, 2022

Update grid_gen scripts.

a24edc5

Fixes ufs-community#574

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Jan 12, 2022

Update scripts for Jet.

86cac23

Fixes ufs-community#574.

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Jan 13, 2022

Update snow and ice scripts on WCOSS-Dell.

f564cd7

Fixes ufs-community#574

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Jan 13, 2022

Updates scripts on Hera.

c2db9c9

Fixes ufs-community#574.

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Jan 13, 2022

Update scripts on WCOSS-Cray.

63b18ef

Fixes ufs-community#574.

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Jan 13, 2022

Update remaining chgres regression test scripts.

ea4e891

Fixes ufs-community#574.

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Jan 14, 2022

Minor cleanup of scripts.

41b550f

Fixes ufs-community#574.

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Jan 14, 2022

Merge branch 'develop' into feature/baseline

216e9b1

Fixes ufs-community#574.

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Jan 14, 2022

More minor script clean up.

a418c7b

Fixes ufs-community#574.

GeorgeGayno-NOAA added a commit to GeorgeGayno-NOAA/UFS_UTILS that referenced this issue Jan 14, 2022

Merge branch 'develop' into feature/baseline

bbc0fa5

Fixes ufs-community#574

GeorgeGayno-NOAA closed this as completed in #603 Jan 20, 2022

GeorgeGayno-NOAA added a commit that referenced this issue Jan 20, 2022

Automate update of consistency test baseline data. (#603)

04700f9

Add logic to the consistency test scripts to automatically update the baseline data when code updates change results. Fixes #574

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automate update of consistency test baseline data #574

Automate update of consistency test baseline data #574

GeorgeGayno-NOAA commented Aug 24, 2021

GeorgeGayno-NOAA commented Aug 24, 2021

kgerheiser commented Sep 13, 2021

BrianCurtis-NOAA commented Sep 27, 2021 •

edited

Loading

GeorgeGayno-NOAA commented Sep 27, 2021

BrianCurtis-NOAA commented Sep 27, 2021 •

edited

Loading

BrianCurtis-NOAA commented Sep 27, 2021 •

edited

Loading

Automate update of consistency test baseline data #574

Automate update of consistency test baseline data #574

Comments

GeorgeGayno-NOAA commented Aug 24, 2021

GeorgeGayno-NOAA commented Aug 24, 2021

kgerheiser commented Sep 13, 2021

BrianCurtis-NOAA commented Sep 27, 2021 • edited Loading

GeorgeGayno-NOAA commented Sep 27, 2021

BrianCurtis-NOAA commented Sep 27, 2021 • edited Loading

BrianCurtis-NOAA commented Sep 27, 2021 • edited Loading

BrianCurtis-NOAA commented Sep 27, 2021 •

edited

Loading

BrianCurtis-NOAA commented Sep 27, 2021 •

edited

Loading

BrianCurtis-NOAA commented Sep 27, 2021 •

edited

Loading