-
Notifications
You must be signed in to change notification settings - Fork 110
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Terminate Experiment
does not work as intended
#7700
Comments
Terminate Experiment
on local queue seems to do littleTerminate Experiment
does not work as intended
I was able to reproduce this (need 100 realizations and QUEUE_OPTION MAX_RUNNING at 50). It might be related to #7704 For this poly-case it is probably difficult to be able to cancel the experiment in due time, as it will evaluate very fast. I got a seemingly hanging GUI after initiating termination, but the GUI eventually catches up, and then all four iterations are through. |
Even including the fix of #7704 there is an issue where the next iteration is started even though the cancellation/termination has happened:
Then in logs/jobqueue* we can find entries around the time of termination:
|
Same problem with LSF and --enable-scheduler. Not reproducible with LSF and legacy jobqueue. |
Adding a sleep to |
The remainder issue after #7704 is merged is covered by #1250. Scheduler does not perform technically worse than job_queue, but the problem is amplified by the Scheduler being faster making it hard to click 'Terminate' in the short time-window where it is actually running. This is not a problem for users as they do not run poly-case. |
Running Poly-ert ES_MDA and clicking
Terminate Experiment
seems to do little.The cursor changes to busy-state, but the realizations will all complete in the background.
I clicked terminate once I saw the first realization complete, so there should be plenty of time to stop other jobs.
This was run using bleeding, on Mac with python 3.11 with local queue.
Testing this on RGS yielded:
Seems like something happens, but some realizations keep going regardless, see attachments.
The text was updated successfully, but these errors were encountered: