Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Emergence of Communication in an Interactive World with Consistent Speakers

Sep 03, 2018

Ben Bogin, Mor Geva, Jonathan Berant

Figure 1 for Emergence of Communication in an Interactive World with Consistent Speakers

Figure 2 for Emergence of Communication in an Interactive World with Consistent Speakers

Figure 3 for Emergence of Communication in an Interactive World with Consistent Speakers

Figure 4 for Emergence of Communication in an Interactive World with Consistent Speakers

Share this with someone who'll enjoy it:

Abstract:Training agents to communicate with one another given task-based supervision only has attracted considerable attention recently, due to the growing interest in developing models for human-agent interaction. Prior work on the topic focused on simple environments, where training using policy gradient was feasible despite the non-stationarity of the agents during training. In this paper, we present a more challenging environment for testing the emergence of communication from raw pixels, where training using policy gradient fails. We propose a new model and training algorithm, that utilizes the structure of a learned representation space to produce more consistent speakers at the initial phases of training, which stabilizes learning. We empirically show that our algorithm substantially improves performance compared to policy gradient. We also propose a new alignment-based metric for measuring context-independence in emerged communication and find our method increases context-independence compared to policy gradient and other competitive baselines.

View paper on

Share this with someone who'll enjoy it:

Title:Emergence of Communication in an Interactive World with Consistent Speakers

Paper and Code