-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Something is wrong - GEOSldas #1226
Comments
It is this call that crashes GEOSldas . Line 615 in 60342c8
keep looking.... |
I'll add in @tclune just so he knows. Never even heard of |
Also adding @atrayano after seeing what's in |
Interesting: this is the section that I'm reverse engineering today. But had not planned to make any changes. |
Undoubtedly we will need to await wisdom from @atrayano , but my current understanding is that this procedure is the one that propagates imports and exports up from children. @weiyuan-jiang can you provide any more details about how it crashes? |
@tclune , No, I can't at this point. I spended whole day yesterday to figure that out but failed. I am keeping investigating...( Debug mode is successful) |
The error usually looks like Sometimes ( not all the time), it reports a little bit more details And |
Uh oh. I did make some nontrivial changes to VarConn a few weeks back. Worked with @atrayano to be sure we covered the necessary cases and the usual tests worked. But maybe LDAS is doing something unusual? We probably need to review the connectivities there. Looking for anything out of the ordinary. |
I can confirm that MAPL v2.10.0 works but v2.11.0 crashes. Can we squeeze in a tag between them for GEOSldas to fix pFIO_ShaveMantissa.c ? @mathomp4 @tclune @gmao-rreichle |
I am not sure how long it takes to fix our develop branch. For a quick fix for GEOSldas, a tag v2.10.1 seems easy. But I am not sure if it breaks rules and brings more complications. This is the branch off v2.10.0 https://github.com/GEOS-ESM/MAPL/tree/patch/wjiang%2Fv2.10.1 . @mathomp4 . |
@weiyuan-jiang Yes. We can make one. Let me make sure your branch is good and then I'll issue one. I'll probably need to push a few updates to it to make it "official" |
Of course, we'll want @tclune's blessing. But until then, can you provide me a good CHANGELOG entry for this? I can then update your branch with it ETA: Ah. Is this the bit-shaving-works-for-no-tiles fix:
|
@weiyuan-jiang Is that code in ShaveMantissa? |
No. That has nothing to do with ShaveMantissa. I am trying to compare Tom's new commit with his old commit on oomph |
because the name E2E seems export to export, but somehow there is import in it. |
Blessing given. (To patch release) |
E2E is a kludge itself, and I changed the kludge a bit to simplify the underlying logic and data types. Longer term the kludge evaporates. |
The E2E issue is almost certainly related the issues that @weiyuan-jiang is seeing - but we're still at a loss as to why the change would break LDAS but not the GCM. |
Also I noticed the memory is not released and leaked forever. With fix, ( only 4 nodes), it reports |
If the PR #1237 fixes the issue, we probably don't need a new tag v2.10.1 any more @mathomp4 @gmao-rreichle |
@weiyuan-jiang Well, it exists. Who knows, we might have a reason for it in the future. Did you test #1237 with GEOSgcm? |
@mathomp4 I tested it with GEOSldas and am confident it will work with GEOSgcm. There were memory leak in GEOSgcm too. It was just not as severe as GEOSldas's 24-member data assimilation run. |
Thanks, @weiyuan-jiang, for fixing this. I'm ok with using either MAPL v2.10.1 or v2.13.xx (to be created after merging this PR). |
@gmao-rreichle @biljanaorescanin For the upcoming MAPL 2.14.0, you'll want:
ETA: ecbuild hasn't changed in a looooong time. So whatever is there, use that! |
@gmao-rreichle @biljanaorescanin NOTE: For now you'll want to use env 3.7.0 and cmake 3.7.3. I'm planning on releasing two new non-zero-diff versions of each today as @sdrabenh wants to make an NZD release of GEOS. So don't go straight to the latest version without realizing that. We'll probably want @biljanaorescanin or @weiyuan-jiang to do tests with the new env and cmake to make sure the LDAS doesn't go wonky with them. The changes are ESMA_env is moving to Intel 2021.3 (which is NZD to 2021.2) and ESMA_cmake changes the Intel flags to make things more portable at NAS (see GEOS-ESM/ESMA_cmake#240) |
sloved |
With this latest components
env | (t) v3.7.0 (DH)
cmake | (t) v3.7.2 (DH)
ecbuild | (t) geos/v1.0.6 (DH)
GMAO_Shared | (t) v1.4.12 (DH)
MAPL | (b) develop (DH)
GEOSgcm_GridComp | (b) develop (DH)
GEOSldas crashes before finishing Setservice. But it can run with Debug flag..... Something is wrong but I am not sure what it is...
The text was updated successfully, but these errors were encountered: