You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is a very simple bug that likely is never encountered in practice, but was hit while investigating an issue scaling CTSM out to large processor counts while using native (ultra high-resolution) grids for the map algorithm for this streams file. The hope was that by switching to a native grid, we could avoid some of the communication patterns in the nearest neighbor mapping, but when testing that we noticed that the 'urbantvmapalgo' namelist variable is read, but not broadcast to other ranks. As a result, other ranks got the default 'nn' method, and the main rank didn't, leading to a hang in communication on the MPI communicator.
General bug information
CTSM version you are using:
ctsm5.2.003
Does this bug cause significantly incorrect results in the model's science?
No, but it causes a hang when changing the map algorithm from the default.
Configurations affected:
Details of bug
Simply put, a namelist variable is read, but not broadcast to other tasks. The result is that those other tasks try the default map algorithm vs the specified one. This is likely immaterial to 99.99% of users, but it's an easy fix, hence the upcoming one-line PR.
Important details of your setup / configuration so we can reproduce the bug
Admittedly I haven't tested too many configurations, but the change is a logical one-line one that has been confirmed to work in at least two cases.
Important output or errors that show the problem
No output, since it leads to a hang as N-1 processors call an MPI_Allreduce (when set to 'nn') and the main rank does not.
The text was updated successfully, but these errors were encountered:
Brief summary of bug
This is a very simple bug that likely is never encountered in practice, but was hit while investigating an issue scaling CTSM out to large processor counts while using native (ultra high-resolution) grids for the map algorithm for this streams file. The hope was that by switching to a native grid, we could avoid some of the communication patterns in the nearest neighbor mapping, but when testing that we noticed that the 'urbantvmapalgo' namelist variable is read, but not broadcast to other ranks. As a result, other ranks got the default 'nn' method, and the main rank didn't, leading to a hang in communication on the MPI communicator.
General bug information
CTSM version you are using:
ctsm5.2.003
Does this bug cause significantly incorrect results in the model's science?
No, but it causes a hang when changing the map algorithm from the default.
Configurations affected:
Details of bug
Simply put, a namelist variable is read, but not broadcast to other tasks. The result is that those other tasks try the default map algorithm vs the specified one. This is likely immaterial to 99.99% of users, but it's an easy fix, hence the upcoming one-line PR.
Important details of your setup / configuration so we can reproduce the bug
Admittedly I haven't tested too many configurations, but the change is a logical one-line one that has been confirmed to work in at least two cases.
Important output or errors that show the problem
No output, since it leads to a hang as N-1 processors call an MPI_Allreduce (when set to 'nn') and the main rank does not.
The text was updated successfully, but these errors were encountered: