You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Task (runnable) @0x00002b8be08c5cd0connecting to worker 1 out of 4
srun: job 13332841 queued and waiting for resources
julia> srun: job 13332841 has been allocated resources
connecting to worker 2 out of 4
connecting to worker 3 out of 4
connecting to worker 4 out of 4
However, if I have a lot of workers, the corresponding number of output files appears in the working directory. I decided to add the -o argument and log everything into one file
julia_worker:9007#131.169.193.109
julia_worker:9006#131.169.193.109
julia_worker:9008#131.169.193.109
julia_worker:9009#131.169.193.109
Master process (id 1) could not connect within 60.0 seconds.
exiting.
Master process (id 1) could not connect within 60.0 seconds.
exiting.
Master process (id 1) could not connect within 60.0 seconds.
exiting.
Master process (id 1) could not connect within 60.0 seconds.
exiting.
but does not give any workers
Task (runnable) @0x00002b8be01f7260connecting to worker 1 out of 4
srun: job 13332876 queued and waiting for resources
julia> srun: job 13332876 has been allocated resources
srun: error: max-wn009: tasks 0-3: Exited with exit code 1
I decided to have a look at the source code. If I understand correctly, it specifies values for -o and -D independently of my choice. Maybe, it causes trouble
Hello everyone,
To add workers and schedule jobs on the cluster, I'm using the addprocs_slurm function from ClasterManagers
It works as intended
However, if I have a lot of workers, the corresponding number of output files appears in the working directory. I decided to add the -o argument and log everything into one file
It indeed creates this log file
but does not give any workers
I decided to have a look at the source code. If I understand correctly, it specifies values for -o and -D independently of my choice. Maybe, it causes trouble
The text was updated successfully, but these errors were encountered: