Update rt.sh to allow creating new baselines only for a subset of tests #1834

DusanJovic-NOAA · 2023-07-12T15:00:56Z

Description

A new option has been added to rt.sh script. If option -b <file> is passed to rt.sh together with -c (create new baselines) then the script will run only tests specified in <file> (one test name per line) and create new baselines which will be saved in ${NEW_BASELINE} directory. For all other tests specified rt.conf current baseline directory will be linked to ${NEW_BASELINE}.
This allows developers to create new baselines only for the tests they expect to change the baselines while using current baselines for tests they expect to not change the outputs.

For example:

If file named 'new_baselines' contain:

$ cat new_baselines
control_CubedSphereGrid
control_CubedSphereGrid_parallel

then, running:

./rt.sh -e -c -b new_baselines

wil create new baselines only for these two tests and put them in ${NEW_BASELINE}, while symlinks will be created for all other tests.

Finally, running:

./rt.sh -e -m

will run a full test in verify mode against new baselines, which will confirm that no other tests other than those listed above change the outputs.

Input data additions/changes

No changes are expected to input data.
Changes are expected to input data:
- New input data.
- Updated input data.

Anticipated changes to regression tests:

No changes are expected to any regression test.
Changes are expected to the following tests:

Subcomponents involved:

Library Updates/Changes

Not Needed
Create separate issue in JCSDA/spack-stack asking for update to library. Include library name, library version.
Add issue link from JCSDA/spack-stack following this item

Combined with PR's (If Applicable):

Commit Queue Checklist:

Link PR's from all sub-components involved in section below
Confirm reviews completed in ALL sub-component PR's
Add all appropriate labels to this PR.
Run full RT suite on either Hera/Cheyenne AND attach log to a PR comment.
Add list of any failed regression tests to "Anticipated changes to regression tests" section.

Linked PR's and Issues:

Testing Day Checklist:

This PR is up-to-date with the top of all sub-component repositories except for those sub-components which are the subject of this PR.
Move new/updated input data on RDHPCS Hera and propagate input data changes to all supported systems.

Testing Log (for CM's):

RDHPCS
- Hera
- Orion
- Jet
- Gaea
- Cheyenne
WCOSS2
- Dogwood/Cactus
- Acorn
CI
- Completed
opnReqTest
- N/A
- Log attached to comment

BrianCurtis-NOAA · 2023-07-12T15:30:12Z

I would normally just cp rt.conf to rt.test and edit rt.test and run with -l. How does this improve on that?

DusanJovic-NOAA · 2023-07-12T15:49:51Z

I would normally just cp rt.conf to rt.test and edit rt.test and run with -l. How does this improve on that?

If you use '-l rt.test' in create mode (-c) you can not run full test afterwards verifying that all tests pass, some using new baselines, some using current baselines.

BrianCurtis-NOAA · 2023-07-13T19:29:03Z

I wonder if you could write something that upon baseline creation failure for a full rt suite, a file could be generated with the list of baseline generating tests that failed, making it easy to re-run the failed tests?

DusanJovic-NOAA · 2023-07-13T20:14:08Z

I wonder if you could write something that upon baseline creation failure for a full rt suite, a file could be generated with the list of baseline generating tests that failed, making it easy to re-run the failed tests?

I think this sentence:

This allows developers to create new baselines only for the tests they expect to change the baselines while using current baselines for tests they expect to not change the outputs.

describes that situation. How developers come up with the list of tests that need new baselines is up to them. Either by running full set of tests and finding which one failed or by reasoning based on the changes they made.

tests/rt.sh

…tion (-b option)

zach1221 · 2023-08-31T20:25:13Z

Changes combined into #1467 . Closing

Update rt.sh to allow creating new baselines only for a subset of tests

83f8712

DusanJovic-NOAA requested review from BrianCurtis-NOAA, DeniseWorthen and junwang-noaa July 12, 2023 15:01

BrianCurtis-NOAA reviewed Jul 17, 2023

View reviewed changes

tests/rt.sh Outdated Show resolved Hide resolved

DusanJovic-NOAA added 2 commits July 17, 2023 19:22

Update rt.sh to run compile tasks only if needed during baseline crea…

26b8c6a

…tion (-b option)

Revert MAX_BUILDS back to 10

b4ff12a

zach1221 mentioned this pull request Aug 29, 2023

Use optional chunksizes argument in register_restart_field calls (combining PRs #1853 & 1834) #1467

Merged

16 tasks

Merge remote-tracking branch 'origin/develop' into rt_baseline_subset

c372750

zach1221 closed this Aug 31, 2023

DusanJovic-NOAA deleted the rt_baseline_subset branch August 31, 2023 20:32

DusanJovic-NOAA mentioned this pull request Dec 19, 2023

Update PR template organization, adding commit message and priority section. #2053

Closed

41 tasks

DusanJovic-NOAA mentioned this pull request Aug 20, 2024

Allow use of downscaled warmstart files for cpld_control_sfs test #2375

Merged

14 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update rt.sh to allow creating new baselines only for a subset of tests #1834

Update rt.sh to allow creating new baselines only for a subset of tests #1834

DusanJovic-NOAA commented Jul 12, 2023 •

edited by BrianCurtis-NOAA

Loading

BrianCurtis-NOAA commented Jul 12, 2023

DusanJovic-NOAA commented Jul 12, 2023

BrianCurtis-NOAA commented Jul 13, 2023

DusanJovic-NOAA commented Jul 13, 2023

zach1221 commented Aug 31, 2023

Update rt.sh to allow creating new baselines only for a subset of tests #1834

Update rt.sh to allow creating new baselines only for a subset of tests #1834

Conversation

DusanJovic-NOAA commented Jul 12, 2023 • edited by BrianCurtis-NOAA Loading

Description

Input data additions/changes

Anticipated changes to regression tests:

Subcomponents involved:

Library Updates/Changes

Combined with PR's (If Applicable):

Commit Queue Checklist:

Linked PR's and Issues:

Testing Day Checklist:

Testing Log (for CM's):

BrianCurtis-NOAA commented Jul 12, 2023

DusanJovic-NOAA commented Jul 12, 2023

BrianCurtis-NOAA commented Jul 13, 2023

DusanJovic-NOAA commented Jul 13, 2023

zach1221 commented Aug 31, 2023

DusanJovic-NOAA commented Jul 12, 2023 •

edited by BrianCurtis-NOAA

Loading