Update examples.rst #211

leej3 · 2018-12-17T17:43:59Z

Add an example passing arguments to workers using the extra keyword

guillaumeeb

LGTM, just fix the spacing and this could go in, thanks!

guillaumeeb · 2018-12-17T19:24:57Z

docs/source/examples.rst

+
+   cluster = SLURMCluster(
+       queue='norm',
+       memory =  '8g',


Ben careful on spaces around equal sign.

guillaumeeb · 2018-12-17T19:27:07Z

Test failure seems unrelated, something must be broken with PBS.

guillaumeeb · 2018-12-17T19:42:29Z

docs/source/examples.rst

+                           processes=1,
+                           cores=8,
+                           extra=['--resources foo=2'],
+                           job_extra=["--time=03:00:00 "],


Is the space after the time needed?

lesteve · 2018-12-18T06:19:58Z

Would it be possible to show in the example what resources can be useful through some code, or maybe just mention it in the text? At the moment it looks quite abstract ...

leej3 · 2018-12-18T10:10:42Z

@guillaumeeb you are correct, apologies.

@lesteve, what do you think of the updated version. I did want to emphasize that the resources are indeed abstract, but I agree that an example makes it a lot clearer as to why someone might wish to do this. I also wanted it to be clear what the extra keyword was doing. Is this clearer now?

guillaumeeb · 2018-12-18T10:27:55Z

@lesteve glad to see you chimming in! It's really helpful to have another look at all this.

@leej3 could you maybe add a line of code on how you submit jobs/tasks using the resources?

leej3 · 2018-12-18T11:20:09Z

@guillaumeeb how about this? I could make it a minimal example by setting it up with something like:

def load(fn):
    pass
def process(data):  # needs some foo
    import time
    time.sleep(5)
def reduce(*data):  # foo intensive
    return "Processing finished..."

from dask import delayed
raw = [delayed(load)(i) for i in range(10)]
processed = [delayed(process)(r) for r in raw]
reduced = delayed(reduce)(*processed)

That would be useful to people. Perhaps too much to digest though...

willirath · 2018-12-18T11:47:40Z

I might be wrong about this, but I don't see the point of having resources specified when dask-jobqueue does not yet support heterogeneous clusters.

Having said that, I'd be very interested in joining any effort towards implementing / documenting heterogeneous clusters.

guillaumeeb · 2018-12-18T12:05:58Z

@willirath, this is more or less needed right know to nicely limit the number of tasks on a given node while still reserving all the computing resources, see #181 (comment).

Having said that, I'd be very interested in joining any effort towards implementing / documenting heterogeneous clusters.

That would be really cool!

I don't know if something is feasible by hand right now, maybe it is if we modify JobQueueCluster object between scale calls. But the real idea is to properly implement that in ClusterManager. See dask/distributed#2118 and dask/distributed#2208 (comment) if you've not already.

guillaumeeb · 2018-12-18T12:06:48Z

@leej3 your example feels hard to understand. Maybe we can start eaysier? I'm not sure.

leej3 · 2018-12-18T12:36:38Z

Is something like this more along the lines of what you mean?

guillaumeeb · 2018-12-18T15:12:17Z

docs/source/examples.rst

+
+    processed = [delayed(process)(i) for i in range(10)]
+    futures = client.compute(processed,
+                             resources={tuple(processed): {'foo': 1}})


Why do you use {tuple(processed): {'foo': 1}}, and not directly {'foo': 1}?

Hmmm. This may be just due to my own ignorance so correct me if so. I was under the impression that since resources was passed as a dictionary they need to be a hashable type. As a list, the command fails.

Or you referring to the fact that in general the entire task graph could be executed with a single resource specification. This would then work for the example of constraining resources for the full task graph but not generalize beyond that (where one specifies different constraints at different steps of the pipeline).

I found few examples of specification of resource constraints (at the levels of futures), which I felt was not something that was easy to intuit. Perhaps this is not the place to demonstrate it though. Happy to simplify it further.

willirath · 2018-12-18T16:13:53Z

@willirath, this is more or less needed right know to nicely limit the number of tasks on a given node while still reserving all the computing resources, see #181 (comment).

My suggestion would be not to try and come up with an abstract example but present one or two from the real world. So what about:

from dask_jobqueue import SLURMCluster
from distributed import Client
from dask import delayed

cluster = SLURMCluster(memory='8g',
                       processes=1,
                       cores=8,
                       extra=['--resources "ssdGB=200 GPU=2"'])

cluster.start_workers(2)
client = Client(cluster)


def step_1_w_single_GPU(data):
    return "Step 1 done for: %s" % data


def step_2_w_local_IO(data):
    return "Step 2 done for: %s" % data


stage_1 = [delayed(step_1_w_single_GPU)(i) for i in range(10)]
stage_2 = [delayed(step_2_w_local_IO)(s2) for s2 in stage_1]

result_stage_2 = client.compute(stage_2,
                                resources={tuple(stage_1): {'GPU': 1},
                                           tuple(stage_2): {'ssdGB': 100}})

(I did not test this!)

willirath · 2018-12-18T16:15:32Z

I don't know if something is feasible by hand right now, maybe it is if we modify JobQueueCluster object between scale calls. But the real idea is to properly implement that in ClusterManager. See dask/distributed#2118 and dask/distributed#2208 (comment) if you've not already.

Thanks for these pointers! I'll definitely read into this.

leej3 · 2018-12-18T17:30:56Z

Sounds good. I tested it to confirm it works.

guillaumeeb

Remove the part on "foo" resource.

guillaumeeb · 2019-01-03T13:10:26Z

docs/source/examples.rst

+argument is for the specification of abstract resources, described `here
+<http://distributed.dask.org/en/latest/resources.html>`_. This might be to
+specify special hardware availibility that the scheduler is not aware of, for
+example GPUs. Below, an arbitrary resource "foo" is specified. Notice that the


It is not an arbitrary resource anymore!

leej3 · 2019-01-03T14:09:35Z

apologies. Missed that. Fix now. Thanks

guillaumeeb · 2019-01-03T14:26:43Z

Thanks @leej3! Merging.

guillaumeeb reviewed Dec 17, 2018

View reviewed changes

docs/source/examples.rst Outdated

cluster = SLURMCluster(

queue='norm',

memory = '8g',

Copy link

Member

guillaumeeb Dec 17, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ben careful on spaces around equal sign.

guillaumeeb reviewed Dec 17, 2018

View reviewed changes

add an example of password arguments through to workers

Verified

This commit was signed with the committer’s verified signature.

jcubic Jakub T. Jankiewicz

GPG key ID: A58EE6F131F83013

Verified
Learn about vigilant mode

76c6f2c

leej3 force-pushed the patch-1 branch from 7a0e15f to 76c6f2c Compare December 18, 2018 10:06

add example of using dask-workers with resources

e9a66d3

make example simpler

06f6d8c

guillaumeeb reviewed Dec 18, 2018

View reviewed changes

lesteve mentioned this pull request Dec 21, 2018

Multiple cores per process/thread #181

Closed

guillaumeeb requested changes Jan 3, 2019

View reviewed changes

make resource specification example more concrete

96fc57c

leej3 force-pushed the patch-1 branch from 53d9aba to 96fc57c Compare January 3, 2019 14:07

guillaumeeb approved these changes Jan 3, 2019

View reviewed changes

guillaumeeb merged commit fa62c0e into dask:master Jan 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GitHub Sponsors

Update examples.rst #211

Update examples.rst #211

leej3 commented Dec 17, 2018

guillaumeeb left a comment

guillaumeeb Dec 17, 2018

guillaumeeb commented Dec 17, 2018

guillaumeeb Dec 17, 2018

lesteve commented Dec 18, 2018

leej3 commented Dec 18, 2018

guillaumeeb commented Dec 18, 2018

leej3 commented Dec 18, 2018 •

edited

Loading

willirath commented Dec 18, 2018

guillaumeeb commented Dec 18, 2018

guillaumeeb commented Dec 18, 2018

leej3 commented Dec 18, 2018

guillaumeeb Dec 18, 2018

leej3 Dec 18, 2018

willirath commented Dec 18, 2018 •

edited

Loading

willirath commented Dec 18, 2018

leej3 commented Dec 18, 2018

guillaumeeb left a comment

guillaumeeb Jan 3, 2019

leej3 commented Jan 3, 2019

guillaumeeb commented Jan 3, 2019

Update examples.rst #211

Update examples.rst #211

Conversation

leej3 commented Dec 17, 2018

guillaumeeb left a comment

Choose a reason for hiding this comment

guillaumeeb Dec 17, 2018

Choose a reason for hiding this comment

guillaumeeb commented Dec 17, 2018

guillaumeeb Dec 17, 2018

Choose a reason for hiding this comment

lesteve commented Dec 18, 2018

leej3 commented Dec 18, 2018

guillaumeeb commented Dec 18, 2018

leej3 commented Dec 18, 2018 • edited Loading

willirath commented Dec 18, 2018

guillaumeeb commented Dec 18, 2018

guillaumeeb commented Dec 18, 2018

leej3 commented Dec 18, 2018

guillaumeeb Dec 18, 2018

Choose a reason for hiding this comment

leej3 Dec 18, 2018

Choose a reason for hiding this comment

willirath commented Dec 18, 2018 • edited Loading

willirath commented Dec 18, 2018

leej3 commented Dec 18, 2018

guillaumeeb left a comment

Choose a reason for hiding this comment

guillaumeeb Jan 3, 2019

Choose a reason for hiding this comment

leej3 commented Jan 3, 2019

guillaumeeb commented Jan 3, 2019

leej3 commented Dec 18, 2018 •

edited

Loading

willirath commented Dec 18, 2018 •

edited

Loading