Vnode proxy stuck in overload state #767

jonmeredith · 2015-08-12T16:57:00Z

Fix for #760 (RIAK-1914).

Previously, if a heavily loaded vnode received more than check_threshold msgs after it hit the direct mailbox check, the ping/pong would leave it in overload state. In cases where the load was removed (e.g. primary recovered) the vnode proxy would stay in overload state until enough messages were received to trigger pinging the vnode or rechecking the mailbox.

This PR does a couple of things

Stops counting messages skipped due to overload in the mailbox estimate.
Changes the ping/pong mechanism to use a reference so we can be resubmit safely.
After doing a direct check, immediately trigger a ping/pong so that once the mailbox is cleared the proxy will be immediately notified.
Adds an overloaded/1 function to make testing easier.

There is corresponding riak_test on branch bugfix/jdm/vnode-proxy-stuck-in-overload-v2

Customer ticket zd://11440

…box size. When starting up fallbacks that receive a lot of load (e.g. in riak_kv when a node running bitcask restarts it only starts primaries before marking the service as up, fallbacks are started on demand and if there is a large keydir already on the node, it may take several minutes to start while queuing up requests). The vnode is receiving messages from other sources than the vnode proxy (possibly messages to trigger handoff from the vnode manager if the primary recovers) and does not respond to the ping message before the direct message queue length check is invoked. The msgq is over the threshold and leaves the vnode proxy in overload, but also ignores the eventual response from the ping message. If enough requests go through the vnode proxy it will do another direct check, however if this happens after the primary is back in service then the message volume is much lower (probably just handoff folds), significantly delaying (possibly by many hours) the start of a handoff. This fix immediately sends a ping message after a direct check as the vnode was overloaded enough not to respond to the ping in 2500 requests. When the vnode responds to the ping message it will adjust the mailbox check and reset the counter for a new soft check. This should bring the vnode proxy out of overload much sooner. Here's a reproducer to play with the proxy code. ``` %% Provoke the nastiness PPid = whereis(proxy_riak_kv_vnode_0). {ok, VPid} = riak_core_vnode_proxy:command_return_vnode({riak_kv_vnode,0,node()}, timeout). Report = fun() -> io:format("PPid = ~p with ~p messages. VPid = ~p with ~p messages\n", [PPid, element(2, process_info(PPid, message_queue_len)), VPid, element(2, process_info(VPid, message_queue_len))]) end. sys:suspend(PPid), sys:suspend(VPid), [spawn(fun() -> riak_kv_vnode:local_get(0, {<<"b">>,<<"k">>}) end) || X <- lists:seq(1,15000)]. Report(). [spawn(fun() -> catch gen_fsm:sync_send_event(VPid, {ohai, X}, 1) end) || X <- lists:seq(1,15000)]. Report(). sys:resume(PPid). %%% Pause here and enjoy the misery. lists:usort([riak_kv_vnode:local_get(0, {<<"b">>,<<"k">>}) || X <- lists:seq(1,2498)]). sys:resume(VPid). Report(). Report(). Report(). Report(). Report(). rr(riak_core_vnode_proxy). sys:get_status(PPid). % look for check_mailbox > 0 ```

after recovery until possibly thousands of messages are sent. As the mailbox is only an estimate, simplify by not including the vnode proxy ping/pong in it - there is no need. The ping/pong mechanism will now just track proxied requests being completed. Previously the mailbox estimate grew incorrectly when in the overload state. Now the estimate is only updated when messages are proxied. If the proxy has to perform a hard check on the msgq len, *do* include the ping message as part of it so that the estimate will be correct once the ping/pong happens. QuickCheck test added for riak_test to exercise more than the unit tests.

cmeiklejohn · 2015-08-13T16:25:07Z

You can't retry until you 👍 a commit.

cmeiklejohn · 2015-08-13T16:25:33Z

👍 c322af3

cmeiklejohn · 2015-08-13T16:26:07Z

src/riak_core_vnode_proxy.erl

@@ -317,6 +326,10 @@ fake_loop() ->
        {get_count, Pid} ->
            Pid ! {count, erlang:get(count)},
            fake_loop();
+        %% Original tests do not expect replies


Remove commented code.

Actually added this for future souls on purpose as I couldn't work out where my replies were. I'll improve the comment.

…erload-v2 Vnode proxy stuck in overload state Reviewed-by: cmeiklejohn

jonmeredith · 2015-08-13T17:38:27Z

@cmeiklejohn updated comments per review. Would you mind +1ing the new SHA?

jonmeredith · 2015-08-13T18:00:38Z

Stupid racey test

**error:{assertEqual_failed,[{module,riak_core_vnode_proxy},
                     {line,397},
                     {expression,"Count"},
                     {expected,20005},
                     {value,19569}]}

Put the limit back so it cannot overload. EQC test coverage is much better anyway.

On the builders, sending more than threshold messages could cause overload. This should *never* overload.

jonmeredith · 2015-08-13T18:02:15Z

@cmeiklejohn once more, with feeling please :)

cmeiklejohn · 2015-08-13T23:06:57Z

👍 63ff909

…erload-v2 Vnode proxy stuck in overload state Reviewed-by: cmeiklejohn

jonmeredith · 2015-08-14T13:58:28Z

@borshop merge

Jon Meredith added 4 commits August 11, 2015 10:58

Refactored core vnode proxy ping message to take ref.

c4d32a5

Added overloaded call for testing.

e6e64b1

jonmeredith assigned cmeiklejohn Aug 12, 2015

jonmeredith mentioned this pull request Aug 12, 2015

Quickcheck test for vnode proxy overload recovery. basho/riak_test#827

Merged

cmeiklejohn reviewed Aug 13, 2015
View reviewed changes

borshop added a commit that referenced this pull request Aug 13, 2015

Merge pull request #767 from basho/bugfix/jdm/vnode-proxy-stuck-in-ov…

481c5dd

…erload-v2 Vnode proxy stuck in overload state Reviewed-by: cmeiklejohn

borshop added a commit that referenced this pull request Aug 13, 2015

Merge pull request #767 from basho/bugfix/jdm/vnode-proxy-stuck-in-ov…

3b55b61

…erload-v2 Vnode proxy stuck in overload state Reviewed-by: cmeiklejohn

Updated for review comments.

ce54b0d

Dropped the messages sent in the happy path case.

63ff909

On the builders, sending more than threshold messages could cause overload. This should *never* overload.

borshop added a commit that referenced this pull request Aug 13, 2015

Merge pull request #767 from basho/bugfix/jdm/vnode-proxy-stuck-in-ov…

a80486b

…erload-v2 Vnode proxy stuck in overload state Reviewed-by: cmeiklejohn

borshop added a commit that referenced this pull request Aug 13, 2015

Merge pull request #767 from basho/bugfix/jdm/vnode-proxy-stuck-in-ov…

806e857

…erload-v2 Vnode proxy stuck in overload state Reviewed-by: cmeiklejohn

borshop merged commit 63ff909 into 2.0 Aug 14, 2015

hazen deleted the bugfix/jdm/vnode-proxy-stuck-in-overload-v2 branch September 28, 2016 12:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vnode proxy stuck in overload state #767

Vnode proxy stuck in overload state #767

jonmeredith commented Aug 12, 2015

cmeiklejohn commented Aug 13, 2015

cmeiklejohn commented Aug 13, 2015

cmeiklejohn Aug 13, 2015

jonmeredith Aug 13, 2015

jonmeredith commented Aug 13, 2015

jonmeredith commented Aug 13, 2015

jonmeredith commented Aug 13, 2015

cmeiklejohn commented Aug 13, 2015

jonmeredith commented Aug 14, 2015

Vnode proxy stuck in overload state #767

Vnode proxy stuck in overload state #767

Conversation

jonmeredith commented Aug 12, 2015

cmeiklejohn commented Aug 13, 2015

cmeiklejohn commented Aug 13, 2015

cmeiklejohn Aug 13, 2015

Choose a reason for hiding this comment

jonmeredith Aug 13, 2015

Choose a reason for hiding this comment

jonmeredith commented Aug 13, 2015

jonmeredith commented Aug 13, 2015

jonmeredith commented Aug 13, 2015

cmeiklejohn commented Aug 13, 2015

jonmeredith commented Aug 14, 2015