[C++] Wait until event loop terminates when closing the Client #15316

BewareMyPower · 2022-04-25T15:56:58Z

Motivation

Unlike Java client, the Client of C++ client has a shutdown method
that is responsible to execute the following steps:

Call shutdown on all internal producers and consumers
Close all connections in the pool
Close all executors of the executor providers.

When an executor is closed, it call io_service::stop(), which makes
the event loop (io_service::run()) in another thread return as soon as
possible. However, there is no wait operation. If a client failed to
create a producer or consumer, the close method will call shutdown
and close all executors immediately and exits the application. In this
case, the detached event loop thread might not exit ASAP, then valgrind
will detect the memory leak.

This memory leak can be avoided by sleeping for a while after
Client::close returns or there are still other things to do after
that. However, we should still adopt the semantics that after
Client::shutdown returns, all event loop threads should be terminated.

Modifications

Add a timeout parameter to the close method of ExecutorService and
ExecutorServiceProvider as the max blocking timeout if it's
non-negative.
Add a TimeoutProcessor helper class to update the left timeout after
calling all methods that accept the timeout parameter.
Call close on all ExecutorServiceProviders in
ClientImpl::shutdown with 500ms timeout, which could be long enough.
In addition, in handleClose method, call shutdown in another
thread to avoid the deadlock.

Verifying this change

After applying this patch, the reproduce code in #13267 will pass the
valgrind check.

==3013== LEAK SUMMARY:
==3013==    definitely lost: 0 bytes in 0 blocks
==3013==    indirectly lost: 0 bytes in 0 blocks
==3013==      possibly lost: 0 bytes in 0 blocks

Documentation

Check the box below or label this PR directly.

Need to update docs?

doc-required
(Your PR needs to update docs and you will update later)
no-need-doc
(Please explain why)
doc
(Your PR contains doc changes)
doc-added
(Docs have been already added)

Fixes apache#13267 ### Motivation Unlike Java client, the `Client` of C++ client has a `shutdown` method that is responsible to execute the following steps: 1. Call `shutdown` on all internal producers and consumers 2. Close all connections in the pool 3. Close all executors of the executor providers. When an executor is closed, it call `io_service::stop()`, which makes the event loop (`io_service::run()`) in another thread return as soon as possible. However, there is no wait operation. If a client failed to create a producer or consumer, the `close` method will call `shutdown` and close all executors immediately and exits the application. In this case, the detached event loop thread might not exit ASAP, then valgrind will detect the memory leak. This memory leak can be avoided by sleeping for a while after `Client::close` returns or there are still other things to do after that. However, we should still adopt the semantics that after `Client::shutdown` returns, all event loop threads should be terminated. ### Modifications - Add a timeout parameter to the `close` method of `ExecutorService` and `ExecutorServiceProvider` as the max blocking timeout if it's non-negative. - Add a `TimeoutProcessor` helper class to update the left timeout after calling all methods that accept the timeout parameter. - Call `close` on all `ExecutorServiceProvider`s in `ClientImpl::shutdown` with 500ms timeout, which could be long enough. In addition, in `handleClose` method, call `shutdown` in another thread to avoid the deadlock. ### Verifying this change After applying this patch, the reproduce code in apache#13627 will pass the valgrind check. ``` ==3013== LEAK SUMMARY: ==3013== definitely lost: 0 bytes in 0 blocks ==3013== indirectly lost: 0 bytes in 0 blocks ==3013== possibly lost: 0 bytes in 0 blocks ```

…in different threads

* [C++] Wait until event loops terminates when closing the Client Fixes #13267 ### Motivation Unlike Java client, the `Client` of C++ client has a `shutdown` method that is responsible to execute the following steps: 1. Call `shutdown` on all internal producers and consumers 2. Close all connections in the pool 3. Close all executors of the executor providers. When an executor is closed, it call `io_service::stop()`, which makes the event loop (`io_service::run()`) in another thread return as soon as possible. However, there is no wait operation. If a client failed to create a producer or consumer, the `close` method will call `shutdown` and close all executors immediately and exits the application. In this case, the detached event loop thread might not exit ASAP, then valgrind will detect the memory leak. This memory leak can be avoided by sleeping for a while after `Client::close` returns or there are still other things to do after that. However, we should still adopt the semantics that after `Client::shutdown` returns, all event loop threads should be terminated. ### Modifications - Add a timeout parameter to the `close` method of `ExecutorService` and `ExecutorServiceProvider` as the max blocking timeout if it's non-negative. - Add a `TimeoutProcessor` helper class to update the left timeout after calling all methods that accept the timeout parameter. - Call `close` on all `ExecutorServiceProvider`s in `ClientImpl::shutdown` with 500ms timeout, which could be long enough. In addition, in `handleClose` method, call `shutdown` in another thread to avoid the deadlock. ### Verifying this change After applying this patch, the reproduce code in #13627 will pass the valgrind check. ``` ==3013== LEAK SUMMARY: ==3013== definitely lost: 0 bytes in 0 blocks ==3013== indirectly lost: 0 bytes in 0 blocks ==3013== possibly lost: 0 bytes in 0 blocks ``` (cherry picked from commit cd78f39)

…e#15316) * [C++] Wait until event loops terminates when closing the Client Fixes apache#13267 ### Motivation Unlike Java client, the `Client` of C++ client has a `shutdown` method that is responsible to execute the following steps: 1. Call `shutdown` on all internal producers and consumers 2. Close all connections in the pool 3. Close all executors of the executor providers. When an executor is closed, it call `io_service::stop()`, which makes the event loop (`io_service::run()`) in another thread return as soon as possible. However, there is no wait operation. If a client failed to create a producer or consumer, the `close` method will call `shutdown` and close all executors immediately and exits the application. In this case, the detached event loop thread might not exit ASAP, then valgrind will detect the memory leak. This memory leak can be avoided by sleeping for a while after `Client::close` returns or there are still other things to do after that. However, we should still adopt the semantics that after `Client::shutdown` returns, all event loop threads should be terminated. ### Modifications - Add a timeout parameter to the `close` method of `ExecutorService` and `ExecutorServiceProvider` as the max blocking timeout if it's non-negative. - Add a `TimeoutProcessor` helper class to update the left timeout after calling all methods that accept the timeout parameter. - Call `close` on all `ExecutorServiceProvider`s in `ClientImpl::shutdown` with 500ms timeout, which could be long enough. In addition, in `handleClose` method, call `shutdown` in another thread to avoid the deadlock. ### Verifying this change After applying this patch, the reproduce code in apache#13627 will pass the valgrind check. ``` ==3013== LEAK SUMMARY: ==3013== definitely lost: 0 bytes in 0 blocks ==3013== indirectly lost: 0 bytes in 0 blocks ==3013== possibly lost: 0 bytes in 0 blocks ``` (cherry picked from commit cd78f39) (cherry picked from commit c0c67db)

…e#15316) * [C++] Wait until event loops terminates when closing the Client Fixes apache#13267 ### Motivation Unlike Java client, the `Client` of C++ client has a `shutdown` method that is responsible to execute the following steps: 1. Call `shutdown` on all internal producers and consumers 2. Close all connections in the pool 3. Close all executors of the executor providers. When an executor is closed, it call `io_service::stop()`, which makes the event loop (`io_service::run()`) in another thread return as soon as possible. However, there is no wait operation. If a client failed to create a producer or consumer, the `close` method will call `shutdown` and close all executors immediately and exits the application. In this case, the detached event loop thread might not exit ASAP, then valgrind will detect the memory leak. This memory leak can be avoided by sleeping for a while after `Client::close` returns or there are still other things to do after that. However, we should still adopt the semantics that after `Client::shutdown` returns, all event loop threads should be terminated. ### Modifications - Add a timeout parameter to the `close` method of `ExecutorService` and `ExecutorServiceProvider` as the max blocking timeout if it's non-negative. - Add a `TimeoutProcessor` helper class to update the left timeout after calling all methods that accept the timeout parameter. - Call `close` on all `ExecutorServiceProvider`s in `ClientImpl::shutdown` with 500ms timeout, which could be long enough. In addition, in `handleClose` method, call `shutdown` in another thread to avoid the deadlock. ### Verifying this change After applying this patch, the reproduce code in apache#13627 will pass the valgrind check. ``` ==3013== LEAK SUMMARY: ==3013== definitely lost: 0 bytes in 0 blocks ==3013== indirectly lost: 0 bytes in 0 blocks ==3013== possibly lost: 0 bytes in 0 blocks ``` (cherry picked from commit cd78f39) (cherry picked from commit 6d365c9)

BewareMyPower added component/client-c++ release/2.9.3 release/2.8.4 release/2.10.1 labels Apr 25, 2022

BewareMyPower added this to the 2.11.0 milestone Apr 25, 2022

BewareMyPower requested review from merlimat, rdhabalia, jiazhai, aahmed-se, k2la and massakam April 25, 2022 15:56

BewareMyPower self-assigned this Apr 25, 2022

BewareMyPower changed the title ~~[C++] Wait until event loops terminates when closing the Client~~ [C++] Wait until event loop terminates when closing the Client Apr 25, 2022

github-actions bot added the doc-not-needed Your PR changes do not impact docs label Apr 25, 2022

BewareMyPower marked this pull request as draft April 25, 2022 16:25

Fix testCustomLogger segfault because a shared Logger object is used …

036c1e3

…in different threads

BewareMyPower marked this pull request as ready for review April 25, 2022 16:54

Add missed header

10eeb88

lhotari approved these changes Apr 27, 2022

View reviewed changes

BewareMyPower merged commit cd78f39 into apache:master Apr 27, 2022

BewareMyPower deleted the bewaremypower/cpp-executor-wait branch April 27, 2022 07:10

codelipenghui added the cherry-picked/branch-2.10 label Apr 28, 2022

codelipenghui added the cherry-picked/branch-2.8 Archived: 2.8 is end of life label Apr 28, 2022

codelipenghui added the cherry-picked/branch-2.9 Archived: 2.9 is end of life label Apr 29, 2022

BewareMyPower mentioned this pull request Nov 19, 2022

Pulsar CPP client mem leak #13267

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[C++] Wait until event loop terminates when closing the Client #15316

[C++] Wait until event loop terminates when closing the Client #15316

BewareMyPower commented Apr 25, 2022 •

edited

Loading

[C++] Wait until event loop terminates when closing the Client #15316

[C++] Wait until event loop terminates when closing the Client #15316

Conversation

BewareMyPower commented Apr 25, 2022 • edited Loading

Motivation

Modifications

Verifying this change

Documentation

BewareMyPower commented Apr 25, 2022 •

edited

Loading