Evolved Communication Strategies and Emergent Behaviour
of Multi-Agents in Pursuit Domains

by Gina Grossi
Supervisor: Brian Ross

Screen capture of pursuit domain simulation with 4 predator agents and 1 prey.

This study investigates how genetic programming (GP) can be effectively used in a multi-agent system to allow agents to learn to communicate. Using the predator-prey scenario and a co-operative learning strategy, communication protocols are compared as multiple predator agents learn the meaning of commands in order to achieve their common goal of first finding, and then tracking prey. This work is divided into three parts. Part 1 uses a simple GP language in the Pursuit Domain Development Kit (PDP) to investigate several communication protocols, and compares the predators' ability to find and track prey when the prey moves both linearly and randomly. Part 2, again in the PDP environment, enhances the GP language and fitness measure in search of a better solution for when the prey moves randomly. Part 3 uses the Ms. Pac-Man Development Toolkit to test how the enhanced GP language performs in a game environment. The outcome of each part of this study reveals emergent behaviours in different forms of message sending patterns. The results from Part 1 reveal a general synchronization behaviour emerging from simple message passing among agents. Additionally, the results show a learned behaviour in the best result which resembles the behaviour of guards and reinforcements found in popular stealth video games. The outcomes from Part 2 reveal an emergent message sending pattern such that one agent is designated as the "sending" agent and the remaining agents are designated as "receiving" agents. Evolved agents in the Ms. Pac-Man simulator show an emergent sending pattern in which there is one agent that sends messages when it is in view of the prey. In addition, it is shown that evolved agents in both Part 2 and Part 3 are able to learn a language. For example, "sending" agents are able to make decisions about when and what type of command to send and "receiving" agents are able to associate the intended meaning to commands.

Downloads and publications

Learning Strategies for Evolved Co-operating Multi-Agent Teams in Pursuit Domain,
Gina Grossi, MSc Thesis, 2017.
"Evolved Communication Strategies and Emergent Behaviour of Multi-Agents in Pursuit Domains",
G. Grossi and B.J. Ross, Proc. IEEE Conf. in Computational Intelligence and Games (CIG), NY, USA, July 2017.
CIG 2017 Presentation.

Videos

Synchronized Message Sending -- Staircase Pattern:

Evolved guard reinforcement behaviour:

COmpare the above to guard reinforcement behaviour in Metal Gear Solid (MGS):

Back up: http://www.cosc.brocku.ca/~bross/

Evolved Communication Strategies and Emergent Behaviour of Multi-Agents in Pursuit Domains

Downloads and publications

Videos

Evolved Communication Strategies and Emergent Behaviour
of Multi-Agents in Pursuit Domains