Consider high water mark for sending messages #161

mschubert · 2019-07-11T12:38:08Z

The problem is that all of these transfers are being started concurrently. If Qsys$private$send() blocked execution until the transfer was complete, then the memory requirements would be limited to the dependencies for one target. However, the current situation is that Qsys$private$send() returns as soon as the transfer is scheduled (because rzmq::send.socket() returns as soon as the transfer is scheduled), and so drake schedules the next target, which means filling the buffer for the next transfer before the first has finished. The result is that dependencies for many targets (whether they are the same data or not) are buffered on the master at the same time.

This could be addressed using ZMQ_HWM.

Alternative is using blocked sending: ropensci/drake#933 (comment)

It looks like pbdZMQ uses blocking connections by default. This is also the behavior of the rzmq-compatibility wrapper function, so blocking will come along for the ride by default if clustermq switches to using pbdZMQ.

The text was updated successfully, but these errors were encountered:

wlandau · 2019-07-12T16:18:25Z

Do you plan to expose the high water mark? Some workflows are not memory intensive and could still benefit from the existing non-blocking behavior.

mschubert · 2019-07-12T19:01:52Z

I wasn't planning on to, but will have to investigate how ZeroMQ treats this exactly with its IO threads.

Generally, I wouldn't expect this to be an issue because we're blocking on receiving anyway, and the workers will be saturated in either case.

If there's a problem with that approach that needs user/package intervention I will expose it, otherwise not.

mschubert · 2019-07-18T18:24:40Z

@brendanf This may be already fixed in the v0.9 branch, please test if you have the time

mschubert added enhancement community request labels Jul 11, 2019

mschubert mentioned this issue Jul 11, 2019

Memory usage by master when using clustermq backend ropensci/drake#933

Closed

3 tasks

wlandau mentioned this issue Jul 12, 2019

Initial planning phase: efficient data management for persistent workers ropensci/drake#672

Closed

mschubert closed this as completed Jun 21, 2020

mschubert added a commit that referenced this issue Jun 21, 2020

we now block sending by default (#161)

62091d2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider high water mark for sending messages #161

Consider high water mark for sending messages #161

mschubert commented Jul 11, 2019 •

edited

Loading

wlandau commented Jul 12, 2019

mschubert commented Jul 12, 2019

mschubert commented Jul 18, 2019

Consider high water mark for sending messages #161

Consider high water mark for sending messages #161

Comments

mschubert commented Jul 11, 2019 • edited Loading

wlandau commented Jul 12, 2019

mschubert commented Jul 12, 2019

mschubert commented Jul 18, 2019

mschubert commented Jul 11, 2019 •

edited

Loading