Skip to content

Clarification on job process location #148

Answered by Darkness4
JossWhittle asked this question in Q&A
Discussion options

You must be logged in to vote

Hello, when a SLURM job is launched on the login node, the slurm controller receives the job allocation request and allocates resources to a compute node where a SLURM daemon is running.

This compute node runs the actual job. Compute nodes are bare metal servers provisioned using the xCAT containerized service. They can also be VMs that can be provisioned via Terraform (we are working on this). For VMs and bare-metal servers, we use Packer to build our images.

SLURM jobs are resource constrained by cgroups, but the user running the job can still access the "host" filesystem. If you want to get the same filesystem isolation as a container, you can use the Pyxis plugin or add an OCI runtime…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@JossWhittle
Comment options

Answer selected by JossWhittle
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
kind/discussion Indicates that an issue, pull request, or discussion needs more information. area/slurm Related to SLURM.
2 participants
Converted from issue

This discussion was converted from issue #147 on November 18, 2022 12:29.