-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
load esmf/8.5.0 in gaea.intel.lua #1327
Conversation
I don't think the automatic check failures are due to the changes in this PR. There have been several GitHub Incident - Disruption with some GitHub services - 11 October 2024 notifications within the last hour. |
Confirmed that adding |
@RussTreadon-NOAA sorry I didn't catch this sooner, but I see that C5 is having an OS upgrade next week... I fear this will make a lot of spack-stack things become broken, so perhaps it's best to pause this until after that maintenance/upgrade? |
Oops, that's unfortunate. I'll flip this PR to draft since what works today will likely fail next week. |
I just so happened to catch "Gaea C5 upgrade" on my calendar since I subscribe to the RDHPCS calendar. Email search says it is for an OS upgrade. If it is anything like the upgrade on other machines, it will be quite disruptive. |
Agreed. This PR may be in draft mode for quite some time. If the wait gets too long, we can simply close this PR and open a new one once the OS upgrade dust settles. |
Successfully build GDASApp on Gaea C5 using |
GDASApp ctests Unfortunately several GDASApp ctests fail on Gaea C5
|
Rerun
@DavidNew-NOAA , is this failure an example of what you mentioned during today's (10/24) JEDI-T2O meeting: develop: works for non-marine DA apps? Do we have a tentative fix? |
@RussTreadon-NOAA I thought that was in reference to the gdas.cd hash update PR that, while breaking aero and snow, fixes soca? |
test_gdasapp_util_prepdata failure test_gdasapp_util_prepdata passes on Cactus, Hera, Hercules, and Orion. It fails on Gaea-C5.
The above failures correspond to the executions
and
I am currently building GDASApp on Gaea C6 to see if the same error occurs on C6. Tagging @apchoiCMD for awareness. |
@RussTreadon-NOAA Thanks for letting me know and I have never tried to use Gaea machine. Yes, If a |
Failed test_gdasapp tests Rerun failed jobs
Each of these jobs fail on Gaea C5 for the same reason
As a test copy I have not yet found the proper combination of environment variables to set to get these three jobs to run as ctests. Tagging @CoryMartin-NOAA , @danholdaway , @guillaumevernieres , and @DavidNew-NOAA for awareness |
test_gdasapp_util_2ioda failures In looking at the
@apchoiCMD , do ctests
depend on output from FYI, the Gaea C6 build is complete. The above jobs fail on both Gaea C5 and C6. |
depend on output from |
Install
The reasons for the failures are the same as reported earlier
The
Each of these tests run a mpi executable using
On hercules and orion we also specify For the Gaea C5 tests,
Setting additional environment variables on Gaea C5 may allow |
@CoryMartin-NOAA , this PR is ready for review. The slurm / sbatch environment variable issue is outside the repo ... but I'm fine with keeping this PR open if we're concerned that we'll forget to follow up & resolve this after merging. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good @RussTreadon-NOAA thanks also for the CRTM data fix.
I think we merge, the slurm
dependency is a larger issue and needs to be resolved for wcoss and gaea
Good point. We need to step away from the assumption that slurm is the workload manager. |
@RussTreadon-NOAA Even
I will look into more details when my priority is done- I have no idea on both |
Thanks you @apchoiCMD . This is progress! |
This PR adds
load("esmf/8.5.0")
tomodulefiles/GDAS/gaea.intel.lua
. This is required for the GDASApp build to successfully complete on Gaea C5.Resolves #1326