New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Async read from socket #17868

Merged

KochetovNicolai merged 66 commits into master from async-read-from-socket

Dec 23, 2020

Member

KochetovNicolai commented Dec 7, 2020

I hereby agree to the terms of the CLA available at: https://yandex.ru/legal/cla/?lang=en

Changelog category (leave one):

Improvement

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):
Support for async tasks in PipelineExecutor. Initial support of async sockets for remote queries.

KochetovNicolai added 14 commits

November 30, 2020 15:43


          Update poco

ad9a0c6


          Updae boost.

fa98149


          Update CMakeLists.txt for boost

0f293e6


          Update CMakeLists.txt for boost

4f442cd


          Added example from boost.

d50a0e6


          Update CMakeLists.txt

0e04320


          Update CMakeLists.txt

319d36a


          Add FiberStack

0fae325


          Add async read to RemoteQueryExecutor.

e3946bc


          Add async read to RemoteQueryExecutor

082a496


          Add AsyncTaskQueue.

92d60a7


          Support of Async status for PipelineExecutor.

00492ee


          Add async status to RemoteSource.

9ca837d


          Remove Wait status.

f31610a

robot-clickhouse added pr-improvement submodule changed labels

KochetovNicolai added 8 commits

December 9, 2020 17:11


          Fixing build and tests.

088c128


          Try fix CMakeLists.txt

f9b45b8


          Fixing crash.

156f448


          Use same thread for async task continuation.

6a8384e


          Added perftest.

effc94d


          Fix build and tests.

e8667ba


          Use poco from master. Fixing tests.

a1d4d92


          Update boost submodule

faa5b71

KochetovNicolai force-pushed the async-read-from-socket branch from a96eb7e to faa5b71 Compare

December 9, 2020 14:13

KochetovNicolai added 2 commits

December 9, 2020 17:15


          Merge branch 'master' into async-read-from-socket

32b38f3


          Fixing build.

0e44a22

KochetovNicolai removed the submodule changed label

robot-clickhouse added the submodule changed label


          Fixing build.

116bed2

akuzm reviewed

View reviewed changes

src/DataStreams/RemoteQueryExecutorReadContext.h Show resolved Hide resolved

akuzm reviewed

View reviewed changes

src/IO/ReadBufferFromPocoSocket.cpp Outdated

+                      {
+                          //fiber->fd = socket.impl()->sockfd();
+                          //fiber->timeout = socket.impl()->getReceiveTimeout();
+                          *fiber = std::move(*fiber).resume();

Contributor

akuzm Dec 17, 2020

So when the read would block, we (1) resume the caller fiber. Before that, we save into the processor state:

current call state as a fiber
socket descriptor for the caller to do epoll
mark that the processor result is Async

(2) After that, the caller sees that the processor result is Async, and adds the corresponding file descriptor to its epoll set.

(3) When epoll succeeds, we see for which descriptor, and resume the corresponding fiber we created at step (1).

Am I getting this right?

Member Author

KochetovNicolai Dec 18, 2020

Yes.

Also, epoll is executed into PipelineExecutor, in separate thread. When it succeeds, we continue execution of a processor, which continues fiber execution. And next read should return some bytes.

akuzm reviewed

View reviewed changes

src/DataStreams/RemoteQueryExecutorReadContext.h Outdated Show resolved Hide resolved

akuzm reviewed

View reviewed changes

src/DataStreams/RemoteQueryExecutorReadContext.h Outdated

Comment on lines 232 to 234

+                                  read_context.is_read_in_progress = true;
+                                  read_context.packet = connections.receivePacketUnlocked(&sink);
+                                  read_context.is_read_in_progress = false;

Contributor

akuzm Dec 17, 2020 •

edited

Loading

Never thought I'd suggest adding a callback instead of removing it, but there's always the first time: did you consider injecting a callback into the connection::read, that would set this flag, and also save the corresponding socket descriptor? You're injecting the fiber there anyway.

This control flow I can barely understand... At least with callback you'll be saving all the resume data (socket, flag, fiber) in one place, not in three different ones.

Member Author

KochetovNicolai Dec 18, 2020

Hm, ok. I will try it.

akuzm reviewed

View reviewed changes

src/DataStreams/RemoteQueryExecutorReadContext.h

+                      {
+                          std::lock_guard guard(fiber_lock);
+                          if (!fiber)
+                              return false;

Contributor

akuzm Dec 17, 2020

No reading fiber to resume means 'end of data', so we return false that also means 'end of data', right?

Member Author

KochetovNicolai Dec 18, 2020

Yes.

Also we can delete fiber in cancel call, so this is an indication that we should finish.

akuzm reviewed

View reviewed changes

src/DataStreams/RemoteQueryExecutorReadContext.h

+                          return false;
+                      {
+                          std::lock_guard guard(fiber_lock);

Contributor

akuzm Dec 17, 2020 •

edited

Loading

Why is this one needed? The fiber saves state of one exchange with the remote server, that can only be logically processed in a sequential fashion. Where does the multi-threaded concurrent access arise?

Member Author

KochetovNicolai Dec 18, 2020

That's in cancel method which destroys fiber.

akuzm reviewed

View reviewed changes

src/DataStreams/RemoteQueryExecutorReadContext.h

+                      if (is_pipe_alarmed)
+                          return false;
+                      if (has_timer_alarm && !is_socket_ready)

Contributor

akuzm Dec 17, 2020

Ah, so the timer is for checking the timeouts, OK. Maybe one giant timer per the entire pipeline executor would suffice (i.e. it would go to the RemoteIOQueue aka AsyncTaskQueue). This would save us some wakeups -- I imagine this might matter if you have a big cluster with a lot of queries running.

Another thing that would require periodic wakeups is our main task -- switching over to other connections if the first one is slow. But you're not implementing it here yet, right?

Member Author

KochetovNicolai Dec 18, 2020

Maybe one giant timer per the entire pipeline executor would suffice

I had the same idea, but there are considerations:

different sockets may have different timeouts
if timeout for one socket is over, we may not notice it - in case if other sockets are fine
there may be other async processors with different logic

This would save us some wakeups

Actually, I think quite opposite. Now, we restart timer after every read. So, timer do signal only if timeout really exceeded.

switching over to other connections if the first one is slow. But you're not implementing it here yet, right?

Right)

KochetovNicolai added 7 commits

December 18, 2020 16:15


          Add setting async_socket_for_remote

c7ef57c


          Try fix fluppy test.

bbd34b5


          Use replace fiber to callback in ReadBufferFromPocoSocket

01c8d5d


          Added comments.

855c4bc


          Fix yamake.

1c88527


          Fix special build

bc0e91b


          Fix special build.

110e76e

alexey-milovidov added the 🎅 🎁 gift🎄 label

akuzm reviewed

View reviewed changes

src/Processors/IProcessor.h

    
            @@ -207,7 +204,7 @@ class IProcessor
          
                    * Note that it can fire many events in EventCounter while doing its job,

                    *  and you have to wait for next event (or do something else) every time when 'prepare' returned Wait.

                    */

                  virtual void schedule(EventCounter & /*watch*/)

                  virtual int schedule()

Contributor

akuzm Dec 22, 2020

The comment should be updated to reflect the new return value.

akuzm approved these changes

View reviewed changes

Contributor

akuzm left a comment

I get the general idea, looks OK to merge after adding the comments about the things we discussed above. Also there are some commented out lines here and there.

akuzm self-assigned this

KochetovNicolai added 2 commits

December 22, 2020 11:42


          Update comment.

5109c34


          Update comment.

dc450e9

KochetovNicolai mentioned this pull request

TSAN annotations boostorg/context#156

Merged

Member Author

KochetovNicolai commented Dec 23, 2020

Special build check - It's inner CI problem.
Stress test (thread) - Server failed to start - ci problem?
Yandex synchronization check - still not fixed in master
Performance - -1.865x for added test.

KochetovNicolai merged commit af7f5c9 into master

KochetovNicolai deleted the async-read-from-socket branch

December 23, 2020 09:20

azat reviewed

View reviewed changes

src/IO/ReadBufferFromPocoSocket.cpp

@@ @@ -28,10 +28,23 @@ bool ReadBufferFromPocoSocket::nextImpl() @@
                   ssize_t bytes_read = 0;
                   Stopwatch watch;
+                  int flags = 0;
+                  if (async_callback)
+                      flags |= MSG_DONTWAIT;

Collaborator

azat Jan 6, 2021

Note that async_socket_for_remote is completely broken with unbundled poco, since it cannot handle MSG_DONTWAIT correctly.

But IIUC arcadia build does not support distributed queries, so it is not a problem there?

Member Author

KochetovNicolai Jan 11, 2021

Indeed.

@azat does it cause any problem for you? I think I can just read form socket manually, avoid using poco in this case.

Collaborator

azat Jan 11, 2021

@azat does it cause any problem for you?

For development I'm using unbundled build (since it is faster), and I was using unbundled poco there, but now I've switched to bundled poco there so it is not a problem anymore

I think I can just read form socket manually, avoid using poco in this case.

Maybe it is worth it, since this patches already has some low-level code anyway (epoll_ctl and similar), although I'm not sure.

azat mentioned this pull request

Fix leaking of pipe fd for async_socket_for_remote #19153

Merged

azat mentioned this pull request

Fix SIGSEGV on Unknown packet for Distributed queries #20547

Merged

Contributor

filimonov commented Feb 23, 2021

Will it close #9900 ?

Member

alexey-milovidov commented Feb 23, 2021

No.

What will address #9900 is #19291.

filimonov mentioned this pull request

Crash 21.2.4.6 #21167

Closed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

🎅 🎁 gift🎄 pr-improvement submodule changed