Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Job submission issue on Summit #3124

Closed
dqwu opened this issue Aug 14, 2019 · 9 comments · Fixed by #3132
Closed

Job submission issue on Summit #3124

dqwu opened this issue Aug 14, 2019 · 9 comments · Fixed by #3132
Assignees
Labels

Comments

@dqwu
Copy link
Contributor

dqwu commented Aug 14, 2019

On Summit, case.submit fails with the following error message:
ERROR: Command: '-env "all, ARGS_FOR_SCRIPT=--resubmit" bsub -q batch -W 02:00 -P cli115 .case.run' failed with error '/bin/sh: - : invalid option'

This issue seems to be caused by PR #3092

Before that PR was merged, preview_run shows:

    SUBMIT CMD:
      bsub -q batch -W 02:00 -P cli115 -env "all, ARGS_FOR_SCRIPT=--resubmit" .case.run

After that PR was merged, preview_run shows:

    SUBMIT CMD:
      -env "all, ARGS_FOR_SCRIPT=--resubmit" bsub -q batch -W 02:00 -P cli115 .case.run
@jgfouca
Copy link
Member

jgfouca commented Aug 14, 2019

Looking at it now.

@jgfouca
Copy link
Member

jgfouca commented Aug 14, 2019

@dqwu , does this only happen for resubmissions? I was unable to reproduce the error for a simple case.

@dqwu
Copy link
Contributor Author

dqwu commented Aug 14, 2019

@jgfouca
Below are detailed steps to reproduce it with a simple F case.
Use GNU compiler as there are some known build errors with default XL compiler.

git clone https://github.com/E3SM-Project/E3SM.git

cd E3SM

git submodule update --init

cd cime/scripts

./create_newcase --compiler=gnu --case FC5AV1C-H01B_ne4_ne4 --compset FC5AV1C-H01B --res ne4_ne4

cd FC5AV1C-H01B_ne4_ne4

./case.setup

./preview_run

./case.build

./case.submit

preview_run shows:

SUBMIT CMD:
  -env "all, ARGS_FOR_SCRIPT=--resubmit" bsub -q batch -W 02:00 -P cli115 .case.run

case.submit shows:

...
Check case OK
submit_jobs case.run
Submit job case.run
Submitting job script -env "all, ARGS_FOR_SCRIPT=--resubmit" bsub -q batch -W 02:00 -P cli115 .case.run
ERROR: Command: '-env "all, ARGS_FOR_SCRIPT=--resubmit" bsub -q batch -W 02:00 -P cli115 .case.run' failed with error '/bin/sh: - : invalid option'

@sarats , could you please try to reproduce it on Summit? Thanks.

@sarats
Copy link
Member

sarats commented Aug 14, 2019

Yes, I tried preview_run after setting up a case and it shows the extraneous stuff before bsub invocation.

@jgfouca
Copy link
Member

jgfouca commented Aug 15, 2019

@dqwu , ok, i see the problem now

jgfouca added a commit that referenced this issue Aug 15, 2019
Fix batch submission command for summit

A recent CIME update broke the case where an lsf batch
system was configured to pass arguments via batch_env.

This PR fixes the issue by changing the command formation logic
to not prepend lfs args if batch_env is set.

Fixes #3124

[BFB]

* origin/jgfouca/cime/fix_env_batch_lsf_env:
  Fix batch submission command for summit
jgfouca added a commit that referenced this issue Aug 15, 2019
Fix batch submission command for summit

A recent CIME update broke the case where an lsf batch
system was configured to pass arguments via batch_env.

This PR fixes the issue by changing the command formation logic
to not prepend lfs args if batch_env is set.

Fixes #3124

[BFB]

* origin/jgfouca/cime/fix_env_batch_lsf_env:
  Fix batch submission command for summit
@amametjanov
Copy link
Member

@jgfouca there is a similar issue on Theta with the latest master: env-vars are not getting resolved in preview_run output:

    MPIRUN (job=case.test):
      aprun -n 9280 -N 64 --cc depth -d 1 -j 1 $ENV{SMP_VARS} $ENV{labeling} /projects/ClimateEnergy_3/azamatm/SMS.ne120_oRRS18v3_ICG.A_WCYCL2000_H01AS.theta_intel.cam-cosplite.20190816_174816_6yenqk/bld/e3sm.exe   >> e3sm.log.$LID 2>&1

@jgfouca
Copy link
Member

jgfouca commented Aug 16, 2019

@amametjanov , ok, looking at it

@jgfouca
Copy link
Member

jgfouca commented Aug 16, 2019

@amametjanov , fixed.

@amametjanov
Copy link
Member

Worked, thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants