Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The FC5L45BGC compset was failed in cori-knl with "column nbalance error" #1368

Closed
minxu74 opened this issue Apr 5, 2017 · 9 comments
Closed

Comments

@minxu74
Copy link
Contributor

minxu74 commented Apr 5, 2017

ACME tag: v1.0.0-beta.1-130-g7d39622

Case created by: ./create_newcase --case /global/homes/m/minxu/scratch/F_acmev1_enso_camse_clm45bgc_ne30 -res ne30_ne30 -compset FC5L45BGC -mach cori-knl --compiler intel

PES layout: 2560 x 2 , 64 MPI tasks per node.

Error message:
0014: ERROR in CNBalanceCheckMod.F90 at line 396
1863: column nbalance error = -3.511888906373967E-006 178098 3
0832: column nbalance error = -4.293514919911559E-008 78349 3
1023: column nbalance error = -2.734882737515850E-007 96536 3
2369: column nbalance error = -1.600117219243727E-006 222936 3
2501: column nbalance error = -7.471929814815514E-006 233928 3
0943: column nbalance error = -2.751315590089568E-007 88737 3
0943: Latdeg,Londeg = 59.9466134666123 93.7563801427842
0943: begnb = 2.80072383505868
0943: endnb = 2.80073718112814
0943: delta store = 1.334606945935235E-005
0943: input mass = 1.307093991055480E-005
0943: output mass = 2.010211410065240E-012
0943: net flux = 1.307093790034339E-005
0943: denit = 0.000000000000000E+000
0943: n2onit = 2.010114779247898E-012
0943: no3 leach = 2.137917904780868E-018
0943: no3 runof = 0.000000000000000E+000
0943: ndep = 1.307093991055480E-005
0943: nfix = 0.000000000000000E+000
0943: nsup = 0.000000000000000E+000
0943: fire = 0.000000000000000E+000
0943: dwt = 0.000000000000000E+000
0943: prod = 0.000000000000000E+000

@bishtgautam
Copy link
Contributor

I believe the CLM45%BGC has broken for a very long time. Do you require land BGC to be on for your study?

@minxu74
Copy link
Contributor Author

minxu74 commented Apr 5, 2017

@bishtgautam Could you recommend us a workable F-compset with the land biogeochemical cycle? Thanks!

@bishtgautam
Copy link
Contributor

The land team is presently only testing CLM45%CN, but it appears there is no corresponding F compset. Following the definition of FC5L45BGC, you can try adding a new compset FC5L45CN and see if it works.

  <compset>
    <alias>FC5L45CN</alias>
    <lname>2000_CAM5_CLM45%CN_CICE%PRES_DOCN%DOM_RTM_SGLC_SWAV</lname>
  </compset>

Btw, I'm not sure if CLM45%CN will meet the scientific requirments of your study.

@minxu74
Copy link
Contributor Author

minxu74 commented Apr 6, 2017

@bishtgautam Thanks a lot. I will evaluate the differences between the BGC and CN to see if it fits my research goal. By the way, since those BGCs have broken for a long time, shall we remove them from the config files? Because I used the manage-case to query F-compsets and get a lot of compsets with CLM45%BGC. They should not work. Is it possible to tag those workable or broken compsets?

@bishtgautam
Copy link
Contributor

Even though the CLM45%BGC compset has been broken for awhile (mid-2016 based on #974), at this I'm not in the favor of removing all compsets related to CLM45%BGC unless the land team wants to abandon its support.

Tagging: @thorntonpe, @wjriley

@rljacob
Copy link
Member

rljacob commented Apr 6, 2017

I think we still have a lot of compset definitions held over from cesm. Not sure all the science groups have gone through and removed the ones not being supported.

@thorntonpe
Copy link
Contributor

I have no interest in or plans for supporting these capabilities, but if BIll or others find them useful I'm not opposed to leaving them in place for the time being. We have many new ALM developments which are not currently protected by tests in the developer suites, and I would rather we direct our time and resources to making a robust testing package for the active and ongoing development efforts.

@minxu74
Copy link
Contributor Author

minxu74 commented Apr 6, 2017

Since the error message (N balance error) in this issue is same as that @dmricciuto mentioned in the open issue #974, I would like to close it to avoid the duplicates. Thanks for all suggestions and comments.

@minxu74
Copy link
Contributor Author

minxu74 commented Apr 10, 2017

Same error reported by @worleyph at the issue #1379 but with different land physics CN. The error was gone if I ran the model in a pure MPI process i.e. changing the number of threads from 2 to 1 following his suggestion. Since those issues are related, I reopened mine.

@minxu74 minxu74 reopened this Apr 10, 2017
jqyin added a commit that referenced this issue May 2, 2017
#1468)

Fixes land BGC computation for threads more than one

When nthreads > 1, the size of local variable (e.g. temp_sminp_to_plant)
is not equal to the global variable (e.g. phosphorusflux_vars%sminp_to_plant_col).

Fixes #1379
Fixes #1368
[BFB]
jqyin added a commit that referenced this issue Aug 22, 2017
When nthreads > 1, the size of local variable (e.g. temp_sminp_to_plant)
is not equal to the global variable (e.g. phosphorusflux_vars%sminp_to_plant_col).

Fixes #1379
Fixes #1368
[BFB]
mark-petersen pushed a commit to mark-petersen/E3SM that referenced this issue Jan 19, 2021
Update needed for ACME

Framework features brought in:

* f444d0f Merge PR E3SM-Project#1418 'matthewhoffman/framework/output_record_reference_time' into develop
* 263e14f Merge PR E3SM-Project#1428 'mark-petersen/framework/couple_fixes' into develop
* bcce31d Merge PR E3SM-Project#1424 'amametjanov:az/tools/cp-prebuilt-tools' into develop
* 98cfeea Merge PR# 1349 'akturner/framework/forcing_cleanup' into develop
* 9359319 Merge PR E3SM-Project#1347 'akturner/framework/forcing_restart_timestamp' into develop
* e9ce203 Merge PR E3SM-Project#1348 'akturner/framework/forcing_at_init' into develop
* 4974284 Merge PR E3SM-Project#1368 'akturner/framework/improved_messages_in_driver' into develop
* 86d50c5 Merge PR E3SM-Project#1417 'akturner/framework/forcing_multiple_blocks' into develop
* 9116da3 Merge branch 'framework/validation-of-streams-using-interval_in-interval_out' into develop
* e466b46 Merge branch 'framework/interval_in-interval_out-support-for-streams' into develop
* 30dc955 Merge branch 'az/framework/mpas_dmpar-race-fix' into develop
* b632938 Merge branch 'framework/i8_interval_division' into develop
* 6dac06c Merge branch 'framework/log_write_IBM_error' into develop
* 960a648 Merge branch 'framework/cleanup-logging-stream-manager' into develop
* 504c282 Merge branch 'framework/make-streams-with-direction-none-inactive' into develop
* 5903748 Merge branch 'framework/correctly_remove_blk_fields' into develop
* 3565965 Merge branch 'framework/iostreams-real4dfield-bug' into develop
* 8b60591 Merge branch 'framework/missing-deallocate-nEdgesOnCellField-bootstrapping' into develop
* 70b953b Merge branch 'master' into develop
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants