-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Memory "management" issue with intel #1322
Comments
I'm labeling this as |
fyi @fmahebert |
Yes, we've seen the intel compiler produce executables that take up way more memory for a little while now. We haven't been able to pinpoint the cause yet. Thanks for opening an issue to track this and to share your measurements. Related issue (presumably): |
Running with map profiling: |
@fmahebert has there been any progress on reducing the memory footprint using intel? This has been blocking us for months now - can't run even a low resolution coupled DA experiment. |
@jswhit There's been no progress towards understanding this issue from the JCSDA core team, largely for lack of resources (not a lack of concern). |
If you set me up with a small (<= 32 nodes) reproducer on wcoss2, then I can take a look and work with Intel on a solution. I have a complete build of the global-workflow on dogwood. |
would using gcc for GDASapp (instead of intel) be a potential workaround for this in the short term? |
@jswhit yes, certain compilers/platform combinations certainly work significantly better. E.g. intel + hera doesn't have this issue as Guillaume pointed out. Using gnu on orion/hercules is another option, I think Guillaume had a lot more success with that than with intel. |
@shlyaeva I'm mainly interested in running on gaea, but I think the issue there is the same as on hercules/orion (they all use a newer intel compiler than hera). |
The soca variational application takes an insane amount of memory on Hecules and Gaea (~8TB for the simple 3DVAR), both use
intel 2021.9.0
. The same application on Hera requires ~0.8TB of memory, the intel compiler version on Hera is2021.5.0
.I have no idea if the compiler is the issue.
The text was updated successfully, but these errors were encountered: