Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support Slurm Heterogeneous Job #1741

Open
sunshine-syz opened this issue Sep 15, 2023 · 2 comments
Open

Support Slurm Heterogeneous Job #1741

sunshine-syz opened this issue Sep 15, 2023 · 2 comments

Comments

@sunshine-syz
Copy link

Does submitit support Slurm Heterogeneous Job?
If so, how can we submit heterogeneous job?
If not, could you enhance the code to support it?

@gwenzek
Copy link
Contributor

gwenzek commented Sep 18, 2023

it's not supported atm, and from an API perspective I'm not sure how to handle this.
currently the api assumes there is one configuration per job, while here you want several configurations in the same job.
Not impossible, but also non trivial. What's the use case ? can you approximate this by starting two jobs ?

@sunshine-syz
Copy link
Author

For example, if you want to start a distributed job running on two different GPUs or CPUs with different specs, and they need to communicate with each other and they cannot be started separately.

Here is one example:
https://research-computing.git-pages.rit.edu/docs/slurm_tutorial_2.html
https://slurm.schedmd.com/heterogeneous_jobs.html#submitting

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants