Finish the watch stream connection when exiting #278

agrare · 2018-08-23T13:23:34Z

Prevent thread joins always timing out due to the watch stream blocking
for far longer than the join timeout.

agrare · 2018-08-23T13:23:51Z

cben · 2018-08-23T14:48:43Z

app/models/manageiq/providers/kubernetes/container_manager/streaming_refresh_mixin.rb

@@ -108,7 +108,13 @@ def ensure_watch_threads
  def stop_watch_threads
    safe_log("#{log_header} Stopping watch threads...")

-    finish.value = true
+    # First call WatchStream#finish to forcibly terminate the loop, this
+    # closes the HTTP connection and will cause the #each method to raise an


to be precise, the #each method crashes and rescues internally.
it's awkward implementation, but should be invisible.
if you see any exception leaking outside it, it's kubeclient bug (we already escalated the rescue couple times, ManageIQ/kubeclient#280 and ManageIQ/kubeclient#315)

oh, I see, I think manageiq is still using old kubeclient 2.5.2 where some are uncatched :-(
includes 280 but not 315.
bumping kubeclient to 3.x / 4.x is still blocked on several things that are basically ready, I just need to test image scanning.
I could also release 2.5.3 with 315 backported.

ok to catch HTTP::ConnectionError for now but let's have a more precise comment and do nag me to bump or backport.

Okay added that PR to the comment re: the exception

cben · 2018-08-23T14:52:35Z

app/models/manageiq/providers/kubernetes/container_manager/streaming_refresh_mixin.rb

+    watch_streams[entity_type] = watch_stream
+
+    begin
+      loop do


is the loop for restarting if it disconnects (ManageIQ/kubeclient#275)? have you actually observed that, and does this work?
I think you need a fresh start_watch() to reconnect (with fresh resource_version of course)

I haven't seen it yet but yes that's exactly why I had this loop, if this won't work i'll drop the loop and just let the thread restart.

👍 thread restarting looks good, less code paths is good

Prevent thread joins always timing out due to the watch stream blocking for far longer than the join timeout.

cben

LGTM 👍
merging

cben · 2018-08-23T15:07:29Z

app/models/manageiq/providers/kubernetes/container_manager/streaming_refresh_mixin.rb

    self.initial           = true
    self.queue             = Queue.new
    self.resource_versions = {}
-    self.watch_threads     = {}
+    self.watch_streams     = Concurrent::Map.new
+    self.watch_threads     = Concurrent::Map.new


reminder, not necessarily in this PR: you also planned resource_versions to be Concurrent::Map.

is this https://ruby-concurrency.github.io/concurrent-ruby/master/Concurrent/Hash.html ?

locks against the object itself for every method call, ensuring only one thread can be reading or writing at a time

will lock contention be a problem?
I'd guess this is minor compared to overhead of reading notices from network.
anyway I'm cool with erring on side of safety and profiling later.

miq-bot · 2018-08-23T15:08:21Z

Checked commit agrare@c216551 with ruby 2.3.3, rubocop 0.52.1, haml-lint 0.20.0, and yamllint 1.10.0
1 file checked, 1 offense detected

app/models/manageiq/providers/kubernetes/container_manager/streaming_refresh_mixin.rb

⚠️ - Line 144, Col 5 - Lint/HandleExceptions - Do not suppress exceptions.

Ladas self-assigned this Aug 23, 2018

Ladas added the enhancement label Aug 23, 2018

cben reviewed Aug 23, 2018

View reviewed changes

agrare force-pushed the terminate_watch_stream_connection branch 2 times, most recently from a55057d to a72b2d8 Compare August 23, 2018 15:04

Finish the watch stream connection when exiting

c216551

Prevent thread joins always timing out due to the watch stream blocking for far longer than the join timeout.

agrare force-pushed the terminate_watch_stream_connection branch from a72b2d8 to c216551 Compare August 23, 2018 15:05

cben approved these changes Aug 23, 2018

View reviewed changes

cben added this to the Sprint 93 Ending Aug 27, 2018 milestone Aug 23, 2018

cben added the inventory label Aug 23, 2018

cben merged commit ff072cd into ManageIQ:master Aug 23, 2018

agrare deleted the terminate_watch_stream_connection branch August 23, 2018 16:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finish the watch stream connection when exiting #278

Finish the watch stream connection when exiting #278

agrare commented Aug 23, 2018 •

edited

Loading

agrare commented Aug 23, 2018

cben Aug 23, 2018 •

edited

Loading

agrare Aug 23, 2018

agrare Aug 23, 2018

cben Aug 23, 2018

agrare Aug 23, 2018

cben Aug 23, 2018

cben left a comment

cben Aug 23, 2018

miq-bot commented Aug 23, 2018

Finish the watch stream connection when exiting #278

Finish the watch stream connection when exiting #278

Conversation

agrare commented Aug 23, 2018 • edited Loading

agrare commented Aug 23, 2018

cben Aug 23, 2018 • edited Loading

Choose a reason for hiding this comment

agrare Aug 23, 2018

Choose a reason for hiding this comment

agrare Aug 23, 2018

Choose a reason for hiding this comment

cben Aug 23, 2018

Choose a reason for hiding this comment

agrare Aug 23, 2018

Choose a reason for hiding this comment

cben Aug 23, 2018

Choose a reason for hiding this comment

cben left a comment

Choose a reason for hiding this comment

cben Aug 23, 2018

Choose a reason for hiding this comment

miq-bot commented Aug 23, 2018

agrare commented Aug 23, 2018 •

edited

Loading

cben Aug 23, 2018 •

edited

Loading