Add asyncio.streams Comm #2165

mrocklin · 2018-08-07T16:06:58Z

This only works with Python 3.7. I had to pull out and modify some asyncio code. It also makes no attempt at TLS.

mrocklin · 2018-08-07T16:44:47Z

distributed/comm/_asyncio_utils.py

+        infos = [socket.getaddrinfo(host, port, family=family, flags=flags) for
+                 host in hosts]
+        infos = set(itertools.chain.from_iterable(infos))
+        infos = [info for info in infos if info[1] == socket.SocketKind.SOCK_STREAM]


I changed the first two of these lines to use the socket blocking API rather than asyncio's non-blocking API.

The third line is a hack. I added it to get past errors like the following:

def _start_serving(self): if self._serving: return self._serving = True for sock in self._sockets: > sock.listen(self._backlog) E OSError: [Errno 95] Operation not supported

I suspect that there is a way to handle this by passing in the right arguments to start_server, but I haven't found it yet.

mrocklin · 2018-08-08T18:54:01Z

This does get us down to about a 600us roundtrip time, down from 800us, which is a bit of a win.

In [1]: from dask.distributed import Client
   ...: client = Client()
   ...: client
   ...: 
   ...: 
Out[1]: <Client: scheduler='tcp://127.0.0.1:35289' processes=4 cores=4>

In [2]: async def f():
   ...:     for i in range(10000):
   ...:         await client.scheduler.identity()
   ...: %time client.sync(f)
   ...: 
   ...: 
CPU times: user 5.8 s, sys: 467 ms, total: 6.27 s
Wall time: 6.31 s

mrocklin · 2018-08-08T18:59:15Z

Oooh, I can get this down to 470us if

I bundle all of the writes into one for small messages (this is definitely doable)
I engage Asyncio equivalent of call_soon tornadoweb/tornado#2463

At this point about 70% of scheduler time is spent in socket.send (see https://stackoverflow.com/questions/51731690/why-does-asyncio-spend-time-in-socket-senddata)

mrocklin · 2018-08-08T23:28:47Z

OK, this could use review by someone who knows this stack better than I do.

This currently implements a functional Comm using asyncio streams. This seems to perform a bit better when doing many small fast messages. I haven't yet checked with larger messages (though that should be interesting).

There are a few challenges:

asyncio's start_server function is asynchronous, which means that Listener.start would need to be asynchronous, which means that all of the Dask Server.listen methods would need to become asynchronous (they are currently not asynchronous) which would be a bit of a pain.

Currently my solution to this is to copy over and patch a couple functions in asyncio (see distributed/comm/_asyncio_utils.py), but this is fragile and only works for specific versions (the Server class changed a bit between Python 3.6 and 3.7).
I haven't figured out how to do IPV6. I suspect that this is a simple matter of passing a different flag when creating a connections or starting a server.
Currently we fail distributed/comm/tests/test_comms.py::test_tcp_comm_closed_implicit. When the server severs our connection we don't feel it on the client side.

@pitrou if you have time to look things over here I would appreciate it.

mrocklin · 2018-08-08T23:39:41Z

Current tests on the CI systems aren't that meaningful. This only works in Python 3.7 due to copying over and modifying asyncio code.

mrocklin · 2018-08-08T23:41:00Z

Oh, and I'm also somewhat blocked on this problem: https://stackoverflow.com/questions/51731690/why-does-asyncio-spend-time-in-socket-senddata

A very simple benchmark that sends a small message back and forth currently gets up to spending about 70% of its time in socket.send.

pitrou · 2018-08-09T09:42:41Z

I also suggest you run a benchmark with large messages, because asyncio doesn't have all the optimizations that we added to Tornado.

pitrou · 2018-08-09T09:45:35Z

A very simple benchmark that sends a small message back and forth currently gets up to spending about 70% of its time in socket.send.

Just for the record, which profiler are you using? cProfile or a sampling profiler? Do you have other threads going on in the background?

And how much send calls is that per second or, another way to phrase it, what is the average duration of a send call?

mrocklin · 2018-08-09T10:57:35Z

Just for the record, which profiler are you using? cProfile or a sampling profiler? Do you have other threads going on in the background?

Sampling profiler. The same one that we use for worker threads. See #2144

I also suggest you run a benchmark with large messages, because asyncio doesn't have all the optimizations that we added to Tornado.

I agree. My short-term plan is to include both, but have tornado be default. I'm dealing with some workloads now that need relatively low-latency communication of many small messages. Using this and many other tricks I can get round-trip message latency down to about 450us (see #2156)

And how much send calls is that per second or, another way to phrase it, what is the average duration of a send call?

That's a good question. I'll find out.

This is pretty broken, but I thought I'd push it up in the spirit of showing work.

mrocklin · 2021-03-05T19:00:19Z

distributed/comm/_asyncio_utils.py

@@ -0,0 +1,186 @@
+"""


This stuff is probably no longer necessary. Since doing this work we've made Listeners optionaly awaitable, so we should be able to handle things appropriately on the Dask side.

mrocklin commented Aug 7, 2018

View reviewed changes

mrocklin mentioned this pull request Aug 7, 2018

Investigate worker overhead #2156

Open

mrocklin changed the title ~~WIP prototype of an Asyncio streams comm~~ Add asyncio.streams Comm Aug 8, 2018

mrocklin force-pushed the asyncio-comm branch from 7c15572 to b5fbb1d Compare August 8, 2018 23:27

mrocklin added 7 commits October 8, 2018 16:09

WIP add draft of asyncio stream Comm

1fbc971

This is pretty broken, but I thought I'd push it up in the spirit of showing work.

some tests actually pass

51ffd5c

flake8

1702713

cleanup asyncio implementation

0f03993

optimize for short messages

7c051d2

add TLS

44a0788

clean up more tests

d14da81

mrocklin force-pushed the asyncio-comm branch from b5fbb1d to d14da81 Compare October 8, 2018 20:09

mrocklin mentioned this pull request Nov 8, 2018

Build Comm for ucx-py #2344

Closed

quasiben closed this Nov 7, 2019

mrocklin commented Mar 5, 2021

View reviewed changes

jakirkham mentioned this pull request Mar 5, 2021

Using asyncio directly in TCP #4513

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add asyncio.streams Comm #2165

Add asyncio.streams Comm #2165

mrocklin commented Aug 7, 2018

mrocklin Aug 7, 2018 •

edited

Loading

mrocklin commented Aug 8, 2018

mrocklin commented Aug 8, 2018

mrocklin commented Aug 8, 2018

mrocklin commented Aug 8, 2018

mrocklin commented Aug 8, 2018

pitrou commented Aug 9, 2018

pitrou commented Aug 9, 2018 •

edited

Loading

mrocklin commented Aug 9, 2018

mrocklin Mar 5, 2021

Add asyncio.streams Comm #2165

Add asyncio.streams Comm #2165

Conversation

mrocklin commented Aug 7, 2018

mrocklin Aug 7, 2018 • edited Loading

Choose a reason for hiding this comment

mrocklin commented Aug 8, 2018

mrocklin commented Aug 8, 2018

mrocklin commented Aug 8, 2018

mrocklin commented Aug 8, 2018

mrocklin commented Aug 8, 2018

pitrou commented Aug 9, 2018

pitrou commented Aug 9, 2018 • edited Loading

mrocklin commented Aug 9, 2018

mrocklin Mar 5, 2021

Choose a reason for hiding this comment

mrocklin Aug 7, 2018 •

edited

Loading

pitrou commented Aug 9, 2018 •

edited

Loading