Problems/limitations with network busses #28

usedbytes · 2018-01-02T12:31:32Z

I want to bridge two busses over the network, giving the illusion of a single bus. This isn't currently possible as it deadlocks (and also you'll end up with an infinite loop of duplicate events).

The simple program below will show the deadlock:

package main

import (
	"fmt"
	"time"

	"github.com/asaskevich/EventBus"
)

func main() {

	networkBusA := EventBus.NewNetworkBus(":2035", "/_net_bus_A")
	networkBusA.Start()

	networkBusB := EventBus.NewNetworkBus(":2030", "/_net_bus_B")
	networkBusB.Start()

	networkBusA.Subscribe("topic-A", func(a int) { fmt.Println("A handler:", a) }, ":2030", "/_net_bus_B")

	networkBusB.Subscribe("topic-A", func(a int) { fmt.Println("B handler:", a) }, ":2035", "/_net_bus_A")

	fmt.Println("Publishing on A...")
	networkBusA.EventBus().Publish("topic-A", 10)
	fmt.Println("Done.")

	time.Sleep(2 * time.Second)

	networkBusA.Stop()
	networkBusB.Stop()
}

This is similar to #25, with a slightly different cause.

networkBusA.EventBus().Publish("topic-A", 20) causes the networkBusA lock to be taken
The rpc callback for networkBusB is called to publish the event there.
This goes through networkBusB's normal Publish() path
- Which will call the rpc callback for networkBusA, and now we're back trying to take the networkBusA lock again.

Avoiding the deadlock by spawning the Publish() in a goroutine shows the infinite loop (run the same program as above):

diff --git a/client.go b/client.go
index a9e9e69..831431a 100644
--- a/client.go
+++ b/client.go
@@ -116,7 +116,7 @@ type ClientService struct {
 
 // PushEvent - exported service to listening to remote events
 func (service *ClientService) PushEvent(arg *ClientArg, reply *bool) error {
-	service.client.eventBus.Publish(arg.Topic, arg.Args...)
+	go service.client.eventBus.Publish(arg.Topic, arg.Args...)
 	*reply = true
 	return nil
 }

I'm looking for any suggestions on how to fix this, or perhaps just a definition of how network busses should work (e.g. if there's multiple network clients subscribed to a bus, should events received over the network be propagated to them?)

Just avoiding sending an event back to the client that published it seems like OK behaviour, but I'm not entirely sure how to plumb that in - Publish() probably needs to gain an understanding of where a publish request came from, and handlers need to know where they are going to

The text was updated successfully, but these errors were encountered:

This reworks NetworkBus in an attempt to address asaskevich/EventBus/asaskevich#28 Previously, remote subscriptions were handled via regular handlers in the bus, which would publish the event in the remote bus by invoking its Publish() method over RPC. If the bus originally publishing the event was also subscribed to the same topic on the remote bus, then its "remote" handler would be called in the remote bus, calling back into the original caller, and deadlocking (or looping forever). To remedy this, NetworkBus is split into two separate buses - one only holds handlers which correspond to remote subscribers, and one only handling "local" subscribers. When an event is published (locally), it gets broadcast to both local and remote subscribers. However, when an event is received over the network, it is _only_ published to local subscribers. This prevents the call back to the remote originator. The downside to this approach is that: a) Two busses are needed (which uses resources) b) Events received over the network are not propagated to other remote subscribers. i.e. If BusA subscribes to "topic1" in BusB, and BusB subscribes to "topic1" in BusC, then if someone publishes "topic1" in BusC, then it gets received in BusB, but _not_ in BusA. BusA would need to subscribe directly to BusC in order to receive the event. *NOTE*: This breaks compatibility by renaming Client.Subscribe[Once] to Client.Subscribe[Once]Remote. This allows NetworkBus to expose the "normal" Bus interface, but might be unacceptable.

usedbytes · 2018-01-02T22:38:14Z

Having chewed on this a bit, I came up with the commit above. It doesn't commute events across chained network subscriptions, but is fairly neat/pragmatic. Looking for feedback if there is any 😄

bennAH · 2018-01-03T00:07:56Z

hi @usedbytes will ponder over this in the next few days.

Just to get a better understanding, what is your use case for this?

usedbytes · 2018-01-03T10:58:37Z

I'm using the bus to communicate between subsystems on a robot I'm building. e.g. the sensor subsystem(s) can publish events with sensor readings.

It's useful to send telemetry data from the robot to my laptop, which is easily achieved by letting the laptop "sniff" the bus - it'll receive the internal traffic by subscribing to the bus over the network. That works fine with the existing code, as the laptop only subscribes, and doesn't publish.

As an extension, it's nice from a development perspective if the laptop can inject events into the robot's on-board bus (e.g. to "fake" sensor readings, or to develop a subsystem on the laptop without needing to deploy it to the 'bot) - that's what leads to the bidirectional subscriptions. These could theoretically be published on a different topic, but it's obviously nicer if it can be on the same topic as if it were a "local" subsystem - i.e. the robot code doesn't need to know whether the event came from something on-board, or over the network.

This reworks NetworkBus in an attempt to address asaskevich/EventBus/asaskevich#28 Previously, remote subscriptions were handled via regular handlers in the bus, which would publish the event in the remote bus by invoking its Publish() method over RPC. If the bus originally publishing the event was also subscribed to the same topic on the remote bus, then its "remote" handler would be called in the remote bus, calling back into the original caller, and deadlocking (or calling back and forth forever). To remedy this, NetworkBus is split into two separate busses - one only holds handlers which correspond to remote subscribers, and one only handling "local" subscribers. When an event is published (locally), it gets broadcast to both local and remote subscribers. However, when an event is received over the network, it is _only_ published to local subscribers. This prevents the call back to the remote originator. The downside to this approach is that: a) Two busses are needed (which uses resources) b) Events received over the network are not propagated to other remote subscribers. i.e. If BusA subscribes to "topic1" in BusB, and BusB subscribes to "topic1" in BusC, then if someone publishes "topic1" in BusC, then it gets received in BusB, but _not_ in BusA. BusA would need to subscribe directly to BusC in order to receive the event. *NOTE*: This breaks compatibility by renaming Client.Subscribe[Once] to Client.Subscribe[Once]Remote. This allows NetworkBus to expose the "normal" Bus interface, but might be unacceptable.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problems/limitations with network busses #28

Problems/limitations with network busses #28

usedbytes commented Jan 2, 2018 •

edited

Loading

usedbytes commented Jan 2, 2018

bennAH commented Jan 3, 2018

usedbytes commented Jan 3, 2018

Problems/limitations with network busses #28

Problems/limitations with network busses #28

Comments

usedbytes commented Jan 2, 2018 • edited Loading

usedbytes commented Jan 2, 2018

bennAH commented Jan 3, 2018

usedbytes commented Jan 3, 2018

usedbytes commented Jan 2, 2018 •

edited

Loading