Agent Communication Channels #1839

Gamenot · 2023-02-07T14:06:01Z

Is your feature request related to a problem? Please describe.
There is a need to provide utility to allow for cooperative agents which can communicate with each other over top what SMARTS is responsible for.

The remaining question is how immediate and continuous the communication needs to be:

Immediacy (choose 1):

instant?
1 frame delay

Frequency (choose 1):

1 time per step
multiple times per step

Signal (as many that apply):

single hand-shake
stream
channel
resource request

Describe the solution you'd like
We want a channel subscription utility which sits above the SMARTS layer to allow agents to communicate with each-other. These should not just be vehicle related agents.

Some options:

apache kafka (probably not since it requires Java)
ros (large dependencies)
ray (problematic but easy)
pyzmq (this is already a dependency in some options and does not cause issues)
lcm

Describe alternatives you've considered
Subscribe to other agents through the agent interface where data is passed to other agents through observations (this has the issue of a 1 step delay.)

Gamenot · 2023-02-14T22:13:27Z

Concept 1 (Agent exclusive communication)

Use a messaging system that assumes unordered messages. Unlimited messages may be too much, it is unclear how to trigger the start of the communication.

class CooperativeAgent(Agent):
   def __init__(self, message_interface):
      self._message_interface = message_interface
      self._state_lock = Lock()
      self.awake = False
      message_interface.register("agents_broadcast", on_agents_message_callback)

   def on_agents_message_callback(self, msg):
      """Keep communicating until messages are done."""
      with self._state_lock:
         if msg.id == "__all__" and msg.wake:
            self.awake = True
         if self.self.open_to_communicate:
            return
         if stop_condition(msg):
            self.awake = False
            self._message_interface.send("agents_broadcast", Msg(..., id=self.id, closed=True))
         reply_msg = self.gen_msg(msg)
         self._message_interface.send("agents_broadcast", reply_msg)

agents: List[CooperativeAgent]

while not terms["__all__"]:
   ## Trigger start of agent communication with a `wake` message
   message_interface.send("agents_broadcast", Msg(..., id="__all__" wake=True))
   ## block until all agents are done communicating?
   wait_on_agents(agents)
   for i in range(num_agents):
      actions[AGENT_ID+i] = agents[AGENT_ID+i].act(obs[AGENT_ID+i])

   obs, rewards, terms, truncs, infos = env.step(actions)

Concept 2 (Gym wrapper Action->Observation message)

Use a wrapper to pass messages between agents. This has the downside of a 1 step delay for passing information.

class MessageAction(NamedTuple):
   action: Any
   msg: Any

class MessageObservation(NamedTuple):
   observation: Observation
   msgs: Dict[str, Any]

class MessagePasser(gymnasium.Wrapper):
   def __init__(self, env: gymnasium.Env):
      super.__init__(env)
      # update action_space and observation_space
      self.action_space =

   def step(self, actions):
      actions = {}
      msgs = {}
      for a_id, ma in actions.items():
         msgs[a_id] = ma.msg
         actions[a_id] = ma.action

      obs, rewards, terms, truncs, infos = self.env.step(actions)
      obs_with_msgs = {a_id: MessageObservation(ob, msgs) for a_id, ob in obs.items()}
      return obs_with_msgs, rewards, terms, truncs, infos

   def reset(self,*, seed, options):
      ...

# same standard interface
actions = {}
for i in range(num_agents):
   actions[AGENT_ID+i] = agents[AGENT_ID+i].act(obs[AGENT_ID+i])
obs, rewards, terms, truncs, infos = env.step(actions)

Concept 3 (Agent&Wrapper)

Take concept 1 and 2 and combine so that the communication occurs inside the wrapper.

agents = {a_id: Transmitter(Receiver(agent, on_receive), on_transmit) for a_id, agent in agents.items()}
env = MessagePasser(env, agents)

obs, rew, term, trunc, info = env.step({"transmit": False, "receive": True}) # communication and actions occur internally
# Actions are returned in `info`

Concept 4 (Sensor based)

In this concept messages are passed out of SMARTS about the vehicles.

class Sensitivity(Enum):
    LOW = 0
    STANDARD = 1
    HIGH = 2


class V2XTransmitter(NamedTuple):
    """A configuration utility to set up agents to transmit messages."""

    # TODO MTA: move this to agent interface.

    bands: Sequence[int]
    range: float
    mission: bool
    position: bool
    heading: bool
    breaking: bool
    throttle: bool
    steering: bool


class V2XReceiver(NamedTuple):
    """A configuratoin utility to set up agent to receive messages."""

    # TODO MTA: move this to agent interface.
    bands: Sequence[int]
    sensitivity: Sensitivity = Sensitivity.HIGH

# This is where I see a problem.
## SMARTS manages sensors but the agent must be able to send info.
## Specifically, the agent lives outside SMARTS and the sensors within.
class MessageTransmitterSensor():
   def __call__(self, sim_state):
      raise NotImplementedError

class MessageReceiverSensor():
   def __call__(self, sim_state):
      raise NotImplementedError

# Otherwise, the sensor interface is good
## It means there are no extra cases needed to modify the action/observation space.
class AgentInterface():
   ...
   v2x_receiver = V2XReceiver(bands=[4,5,8], sensitivity=Sensitivity.STANDARD)
   v2x_transmitter = V2XTransmitter(bands=[4,8], range=200, mission=True, position=True, heading=True)

Concept 5 (Hybrid Message)

In this concept SMARTS handles some of the message and the agent has some space to insert its own specific message.

class V2XTransmitter(NamedTuple):
   """A configuration utility to set up agents to transmit messages."""

   # TODO MTA: move this to agent interface.

   bands: Sequence[int]
   range: float
   ...
   custom_message_blob_bytes: int # custom agent message

class AgentInterface:
   ...
   v2x_receiver = V2XReceiver(bands=[4,5,8], sensitivity=Sensitivity.STANDARD)
   v2x_transmitter = V2XTransmitter(bands=[4,8], ..., custom_message_blob_bytes=10000)

class Agent

# env interface is then same as normal except action and observation space
# > env.action_space[@0]
# Tuple(base_action, message)
# > env.observation_space[@0]

Adaickalavan · 2023-02-15T15:40:22Z

Some thoughts on simulating communication.

Why is pyzmq a dependency of SMARTS?
In simulating Vehicle-to-Everything (V2X) communication, we could follow one of the V2X standards being developed. For example IEEE 802.11p and IEEE 802.11bd.
To mimic real life, the communication transmitter and receiver should be inside each vehicle model, and act like an additional sensor (i.e., transceiver) which is stepped (i.e., transmit and receive data) at each time point.
Given we have step based environment, we could begin by allowing communication frequency of once per time step.
Wireless communication systems incur some amount of delay. Hence, we can begin by considering a system with 1 step delay.
Vehicles may communicate directly with other objects (i.e., vehicles, traffic lights, etc) by sending and receiving messages. This could for example be achieved by MPI scatter to selected objects within some neighbourhood radius (i.e., communication range).
Message size should be limited perhaps to ~tens of Mbps.

Gamenot · 2023-02-15T15:58:36Z

@Adaickalavan

Some thoughts on simulating communication.

Why is pyzmq a dependency of SMARTS?

From pipdeptree it looks like it is not a core dependency. It appears from pytest, jupyter, and visdom related dependencies.

In simulating Vehicle-to-Everything (V2X) communication, we could follow one of the V2X standards being developed. For example IEEE 802.11p and IEEE 802.11bd.

Do you have any suggestions for papers?

To mimic real life, the communication transmitter and receiver should be inside each vehicle model, and act like an additional sensor (i.e., transceiver) which is stepped (i.e., transmit and receive data) at each time point.

I overlooked this, it is a good approach probably. I have some concern over how the sensor response is programmed. We generally do not want to inject user code into SMARTS, and would need to work from a remote (roughly data only.) It would not be breaking but if the response must go through the action that would be a very different behaviour than we currently have.

Given we have step based environment, we could begin by allowing communication frequency of once per time step.

Wireless communication systems incur some amount of delay. Hence, we can begin by considering a system with 1 step delay.

That was the reason for mocking up the wrapper version.

Vehicles may communicate directly with other objects (i.e., vehicles, traffic lights, etc) by sending and receiving messages. This could for example be achieved by MPI scatter to selected objects within some neighbourhood radius (i.e., communication range).

Message size should be limited perhaps to ~tens of Mbps.

Right, we can assume packet size can be capped at 10Mbps for now.

Gamenot · 2023-02-21T14:29:41Z

I am going to be implementing this through a gym.Wrapper for now.

Gamenot added the enhancement New feature or request label Feb 7, 2023

Gamenot added this to the `develop` branch close-down milestone Feb 14, 2023

Gamenot mentioned this issue Feb 22, 2023

Add agent communication wrapper. #1881

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agent Communication Channels #1839

Agent Communication Channels #1839

Gamenot commented Feb 7, 2023 •

edited

Loading

Gamenot commented Feb 14, 2023 •

edited

Loading

Adaickalavan commented Feb 15, 2023

Gamenot commented Feb 15, 2023

Gamenot commented Feb 21, 2023

Agent Communication Channels #1839

Agent Communication Channels #1839

Comments

Gamenot commented Feb 7, 2023 • edited Loading

Gamenot commented Feb 14, 2023 • edited Loading

Concept 1 (Agent exclusive communication)

Concept 2 (Gym wrapper Action->Observation message)

Concept 3 (Agent&Wrapper)

Concept 4 (Sensor based)

Concept 5 (Hybrid Message)

Adaickalavan commented Feb 15, 2023

Gamenot commented Feb 15, 2023

Gamenot commented Feb 21, 2023

Gamenot commented Feb 7, 2023 •

edited

Loading

Gamenot commented Feb 14, 2023 •

edited

Loading