Moneo allows for the assignment of job IDs to a subset of nodes. The running command /tmp/moneo-worker/jobIdUpdate.sh <jobid>
on each compute node accomplishes this.
Doing this manually may be tedious if the frequency of job deployment is high. Integrating job updates into Slurm's Prolog/Epilog scripts will offload the effort from the individual user launching a job on Slurm.
- Moneo must be deployed to the compute nodes prior to job creation.
Configure Moneo Linux service: Moneo services
This makes managing Moneo easier.
-
Place the prolog and epilog scripts in a directory which will be pointed to by the Slurm configuration.
-
Next modify the /etc/slurm.conf file by adding the following variables:
Prolog=/mnt/sched/slurm/etc/prologMoneo.sh PrologFlags=Alloc Epilog=/mnt/sched/slurm/etc/epilogMoneo.sh
Note: This will allow the scripts to be run on each compute node
-
Restart slurmctld and slurmd:
sudo systemctl restart slurmctl
parallel-ssh -h hostfile -i -t 0 "sudo systemctl restart slurmd"
Note: This will allow for the new config to be loaded
-
At this point Moneo will now update job Ids when slurm jobs start.
- If there are issues please check that Moneo is running on all nodes.
- Check the "log_file" to check output from the job id update command.