Evaluate usage of tracker clock to ensure it's logically correct #21

bitwalker · 2016-11-02T17:41:44Z

To recap, the tracker uses an implementation of an Interval Tree Clock for resolving the causal history of replicated events. My understanding of ITC is that a scenario plays out like so:

Node A is created as a seed clock
Node B is created by forking A's clock
Event X occurs on A, incrementing it's event clock
When A syncs with B or vice versa, we can compare the clocks, and see that B's clock is strictly in the past of A's clock, so we can update B's state no problem
If events occur on both A and B, their clocks will conflict, which requires us to resolve the conflicting state in some way, and then make sure the clocks are synchronized before proceeding

In the case of Swarm, we can deterministically resolve conflicts because at any given point in time, we know which node in the cluster "owns" a given process/registration, with the exception of processes which are registered but not managed by Swarm. We can ask the conflicted process to shut down, or kill it, and the registration will remain with the "correct" version. However, since the way synchronization works today is that the registry is sent to the remote node, I'm not sure a clock is strictly needed. If we assume that it does help us though, we should ensure that our logic for handling the clock forking/peeking/incrementing is correct.

tschmittni · 2018-03-13T11:28:51Z

I have noticed some failures in swarm when ITCs are compared. I realized that the Clock.peek() operation is broken, returning an invalid structure which is not compatible with the other operations. Which means that peeked clocks cannot be compared or joined. I put up a pull request with a fix: #77

While I was investigating this problem, I reviewed the general use of ITCs and it seems like the current implementation is flawed.

Here are a few rules which have to be implemented to make ITCs work:

Never store a remote clock in the local registry. Remote clocks have no identity (or the wrong identity).
There are a many places which store remote clocks in the local registry. Which should be always wrong. Because remote clocks either have no identity (if peeked) or the wrong identity of another node.
e.g.

swarm/lib/swarm/tracker/tracker.ex

Line 889 in e88966c

Registry.update(name, [meta: meta, clock: rclock])

Also peeked clocks should never be stored because otherwise the identity gets lost:
e.g.

swarm/lib/swarm/tracker/tracker.ex

Line 654 in e88966c

    
           Registry.new!(entry(name: name, pid: pid, ref: ref, meta: meta, clock: Clock.peek(new_clock)))

Use Clock.join() to receive causal information from remote (peeked) clocks.
Clock.join() is not used anywhere in the Tracker, which means that causal information from other nodes is never updated after a broadcast. In order to use Clock.join() with peeked clocks, the ITC implementation has to be fixed first.
Always call Clock.peek() before sending clocks to other replicas. This avoids a couple bugs when the local node stores the wrong clock identity.
Make sure that every node gets a properly forked clock when joining the cluster.
This is actually the most complicated piece to fix.

Let's take a look at an example (handle_replica_event for :update_meta):

It should not merge the metadata when the remote clock wins, otherwise remove_meta calls might get lost:

swarm/lib/swarm/tracker/tracker.ex

Line 888 in e88966c

meta = Map.merge(old_meta, new_meta)
It should not take any data when the local clock wins:

swarm/lib/swarm/tracker/tracker.ex

Line 892 in e88966c

meta = Map.merge(new_meta, old_meta)
It should not store the remote clock in the local registry:

swarm/lib/swarm/tracker/tracker.ex

Line 889 in e88966c

Registry.update(name, [meta: meta, clock: rclock])

I think, the code should look like this:

Clock.leq(lclock, rclock) ->
  # Join the peeked remote clock to take over causal information from the other node
  nclock = Clock.join(lclock, rclock)
  # Update the metadata and clock
  Registry.update(name, [meta: new_meta, clock: nclock])
Clock.leq(rclock, lclock) ->
  # This node has data which is causally dominating the remote
  # Ignore the request when the local clock wins
:else ->
  # we're going to take the last-writer wins approach for resolution for now
  new_meta = Map.merge(old_meta, new_meta)
  nclock = Clock.join(lclock, rclock)
  nclock = Clock.event(nclock)
  # we're going to join and bump our local clock though and re-broadcast the update to ensure we converge
  Registry.update(name, [meta: new_meta, clock: nclock])
  debug "conflicting meta for #{inspect name}, updating and notifying other nodes"
  broadcast_event(state.nodes, Clock.peek(nclock), {:update_meta, new_meta, pid})

h4cc · 2018-04-05T10:38:10Z

I found this issue because there is one process in our swarm of 3 nodes that does not start and we get this warning:

09:11:40.932 [warn] [swarm on node1@49800a15e1cc] [tracker:handle_replica_event] received track event for "device-1234", mismatched pids, local clock conflicts with remote clock, event unhandled

We have 3 nodes running, but this warning only occurs on one (49800a15e1cc) of them.

Strange is that Swarm.whereis_name does always return the same PID where the according process is not running (anymore).

Is this a problem that will be fixed by swarm?
Is there a workaround for this?

bitwalker · 2018-04-06T17:39:10Z

I've got some pending work which addresses the usages of clocks inside Swarm, but I'm currently in the middle of a big move, so I've had to shelve stuff for a bit until I get settled in my new house - this is definitely on my list of priorities!

h4cc · 2018-05-07T11:54:54Z

This issue is starting to occur more frequently at our system. Hope it can be fixed with the next release so we can keep using this nice package :)

andersonmcook · 2018-10-23T17:38:07Z

We're running into the same warning/error as @h4cc.

bitwalker · 2018-10-24T00:30:33Z

I'm currently working on a new causally consistent model for Swarm, but in the meantime I believe we recently merged a PR which included fixes to the clock usage, you may want to give that a shot until I can get a release out.

tschmittni mentioned this issue Apr 18, 2018

Fixed handling of interval tree clocks #85

Merged

morganatwishpond mentioned this issue Sep 20, 2018

Swarm tracker clock gets weird, eats memory #106

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluate usage of tracker clock to ensure it's logically correct #21

Evaluate usage of tracker clock to ensure it's logically correct #21

bitwalker commented Nov 2, 2016

tschmittni commented Mar 13, 2018

h4cc commented Apr 5, 2018 •

edited

Loading

bitwalker commented Apr 6, 2018

h4cc commented May 7, 2018

andersonmcook commented Oct 23, 2018

bitwalker commented Oct 24, 2018

Evaluate usage of tracker clock to ensure it's logically correct #21

Evaluate usage of tracker clock to ensure it's logically correct #21

Comments

bitwalker commented Nov 2, 2016

tschmittni commented Mar 13, 2018

h4cc commented Apr 5, 2018 • edited Loading

bitwalker commented Apr 6, 2018

h4cc commented May 7, 2018

andersonmcook commented Oct 23, 2018

bitwalker commented Oct 24, 2018

h4cc commented Apr 5, 2018 •

edited

Loading