Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Excessive wall clock time for hera.gnu #1440

Closed
DeniseWorthen opened this issue Sep 24, 2022 · 8 comments
Closed

Excessive wall clock time for hera.gnu #1440

DeniseWorthen opened this issue Sep 24, 2022 · 8 comments
Assignees
Labels
bug Something isn't working

Comments

@DeniseWorthen
Copy link
Collaborator

DeniseWorthen commented Sep 24, 2022

Description

Run-times for the current develop on hera.intel/gnu vs cheyenne.intel/gnu taken from the most Regression test logs.

control_p8:
hera.intel 165s
hera.gnu 890s

cheyenne.intel 178s
cheyenne.gnu 300s

cpld_control_p8:

hera.intel 333s
hera.gnu 1720s

cheyenne.intel 321s
cheyenne.gnu 495s

Cheyenne intel/gnu times are similar for both tests (but gnu is slower).

On hera, both control_p8 and cpld_control_p8 take ~5x longer for GNU.

@FernandoAndrade-NOAA
Copy link
Collaborator

@natalie-perlin Are there any current issues with utilizing gnu 10.2? It was suggested that would be a solution to the slower hera/gnu times

@jkbk2004
Copy link
Collaborator

@natalie-perlin Are there any current issues with utilizing gnu 10.2? It was suggested that would be a solution to the slower hera/gnu times

@natalie-perlin can we make sure /scratch1/NCEPDEV/nems/role.epic/hpc-stack/libs/gnu-10.2/modulefiles/stack still up to date ? https://github.com/ufs-community/ufs-weather-model/blob/develop/modulefiles/ufs_common.lua

@natalie-perlin
Copy link
Collaborator

@jkbk2004 @FernandoAndrade-NOAA -
all verified, the /scratch1/NCEPDEV/nems/role.epic/hpc-stack/libs/gnu-10.2/modulefiles/stack is up-to-date.

The following compiler flag needs to be added for gnu>=10 versions:

FFLAGS+="-fallow-argument-mismatch -fallow-invalid-boz"

@zach1221
Copy link
Collaborator

I'm not certain when this was last tested, but I tried again with the current spack-stack 1.4.1 installation on Hera and gnu seemed faster.
control_p8:
hera.intel 175s
hera.gnu 302s

cpld_control_p8:
hera.intel 351s
hera.gnu 651s

@DeniseWorthen
Copy link
Collaborator Author

@zach1221 We have >9 for GNU now on Hera w/ spack-stack, right? I think that was the issue previously (the GNU version).

If it now runs w/in the standard wall clock, we should remove this from the relevant tests

if [[ $MACHINE_ID = hera ]] && [[ $RT_COMPILER = gnu ]]; then
  WLCLK=40
fi

@zach1221
Copy link
Collaborator

@zach1221 We have >9 for GNU now on Hera w/ spack-stack, right? I think that was the issue previously (the GNU version).

If it now runs w/in the standard wall clock, we should remove this from the relevant tests

if [[ $MACHINE_ID = hera ]] && [[ $RT_COMPILER = gnu ]]; then
  WLCLK=40
fi

Ok, let me try removing that from the cpld_control_p8 and control_p8 tests now then retesting. If the result is the same maybe we can remove it in the next ufs-wm PR.

@zach1221
Copy link
Collaborator

zach1221 commented Oct 12, 2023

Ok, I can confirm that cpld_control_p8 and control_p8 gnu continue to run between 600 - 650s for cpld_control_p8_gnu and between 300 - 260s for control_p8_gnu. This is with the standard wall clock. I have removed the hera.gnu wallclock setting from both test files in my ufs-wm #1912 . logs: /scratch1/NCEPDEV/nems/Zachary.Shrader/gnu_test

@zach1221
Copy link
Collaborator

PR 1946 has been merged. Closing this issue.

@github-project-automation github-project-automation bot moved this from In Progress to Done in Backlog: platforms and RT Nov 14, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
Status: Done
Development

No branches or pull requests

5 participants