-
-
Notifications
You must be signed in to change notification settings - Fork 144
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add LSF #4
Comments
Happy to give this a go. We use LSF at work. So this should be pretty useful. |
@jakirkham if it helps, I tried making a dask-distributed LSF script a while back. I encountered an error I couldn't fix, but maybe you can? Here's a gist. |
Just to split the task in smaller chunks:
Once implemented, it would be great to test it on a LSFCluster that you have at your disposal. Looking at dask_jobqueue/slurm.py, dask_jobqueue/pbs.py, dask_jobqueue/sge.py is a good way to get started too. |
Sorry have been busy with other things lately. If someone has interest and time, they should feel free to go ahead. |
Yep, that was what I had in mind by specifying the steps in more details. |
@lesteve I can give this a go; thanks for the direction. Is there a specific fixture-based test implemented for other cluster methods, or am I free to try some of my own tasks? |
It'd be great if you could give this a go! For the first iteration, I think you can try to put together and run a small snippet, e.g. something along these lines and get it to work on your local LSFCluster: from dask_jobqueue import LSFCluster, Client
cluster = LSFCluster(...) # use some arguments that make sense on your local cluster
client = Client(cluster)
result = client.map(lambda x: x + 1, range(10))
client.gather(result) For tests, you could probably took some inspiration from the existing tests from test_slurm.py, test_pbs.py, etc ... |
Still working through my PR and trying to get it to work. Subsequent testing (once working) on other LSF clusters will be great. |
@lesteve Thanks for the heads up. Looks like @raybellwaves is making good headway, I'll step aside. |
While #78 is almost finished, i stumbled across https://github.com/IBMSpectrumComputing/lsf-python-api. May be of interest for the future e.g. implementing something like |
This has been merged now. Happy to know if it works on other LSF schedulers. Not just UM Pegasus. |
I am going to close this one since the associated PR has been merged. @raybellwaves note you can use "Fix #issueNumber" in your PR description, this way the associated issue gets closed automatically when the PR is merged. For more details, look at this. About the psutil problem that you bumped into, I think it's quite hard to guess what the root cause is but I would bet that it was a problem with your environment (somehow you ended up with a broken Note |
It might be valuable to extend this repository with a solution for LSF. My hope is that this is relatively easy for someone with modest LSF experience. Looking at the current solutions for PBS or SLURM might be helpful (they're about 100 lines, mostly docstrings)
The text was updated successfully, but these errors were encountered: