Don't drain mode: worker threads are interrupted on shutdown #559

dannpopescu · 2023-02-23T17:58:42Z

Hi,

I’ve already written about a bug we’ve discovered with the DRAIN shutdown mode here: #552. After we realized the PC keeps polling messages in the DRAIN mode we chose the DONT_DRAIN mode to gracefully shutdown the PC, but it behaves unexpectedly too.

The version in use is 0.5.2.4.

Our use case is pretty standard I would say. We use only the core module and message processing involves network calls to other services. When calling them, errors are expected, so we commit the offsets in both the happy case and not-so-happy case.

The problem comes at the shutdown. In the DONT_DRAIN mode the PC will shutdown the worker threads via interrupts. This means that all in-progress messages will fail on network calls. Of course, we don’t want to commit the failed messages in this case. At the moment, to avoid committing the message’s offset, we catch all the exceptions thrown by the processing code and check if the root cause is an InterruptedException. If it’s the case then throw the exception out of the user function to the PC. This way the PC puts the message in the retry queue and doesn’t commit it. This approach works at the moment, but it’s kinda hacky I would say.

All this said, do you think it’s reasonable to change the way in which the worker thread pool is shutdown so that it doesn’t interrupt the worker threads, but instead lets the already scheduled tasks to finish gracefully? This is basically adhering to DONT_DRAIN's current description:

Stop downloading more messages, and stop procesing more messages in the queue, but finish processing messages already being processed locally.

The text was updated successfully, but these errors were encountered:

antonmos · 2023-07-25T21:37:53Z

@niamhthornbury I noticed that you closed #593 that was trying to address this issue. Could you share any plans to address this?

rkolesnev · 2023-08-03T16:14:34Z

@dannpopescu, @antonmos - We just released 0.5.2.6 build of PC - this and #552 are both fixed in it.
Closing the issue.

dannpopescu mentioned this issue May 22, 2023

Drain mode: PC keeps polling messages after closeDrainFirst #552

Closed

eddyv mentioned this issue Jun 27, 2023

Pl 176/dont drain issue #593

Closed

2 tasks

rkolesnev mentioned this issue Aug 2, 2023

PL-176 handle close dont drain mode gracefully #615

Merged

2 tasks

rkolesnev closed this as completed Aug 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't drain mode: worker threads are interrupted on shutdown #559

Don't drain mode: worker threads are interrupted on shutdown #559

dannpopescu commented Feb 23, 2023

antonmos commented Jul 25, 2023 •

edited

Loading

rkolesnev commented Aug 3, 2023

Don't drain mode: worker threads are interrupted on shutdown #559

Don't drain mode: worker threads are interrupted on shutdown #559

Comments

dannpopescu commented Feb 23, 2023

antonmos commented Jul 25, 2023 • edited Loading

rkolesnev commented Aug 3, 2023

antonmos commented Jul 25, 2023 •

edited

Loading