Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Haris Ceribasic

On Information Asymmetry in Competitive Multi-Agent Reinforcement Learning: Convergence and Optimality

Oct 21, 2020

Ezra Tampubolon, Haris Ceribasic, Holger Boche

Figure 1 for On Information Asymmetry in Competitive Multi-Agent Reinforcement Learning: Convergence and Optimality

Figure 2 for On Information Asymmetry in Competitive Multi-Agent Reinforcement Learning: Convergence and Optimality

Abstract:In this work, we study the system of interacting non-cooperative two Q-learning agents, where one agent has the privilege of observing the other's actions. We show that this information asymmetry can lead to a stable outcome of population learning, which does not occur in an environment of general independent learners. Furthermore, we discuss the resulted post-learning policies, show that they are almost optimal in the underlying game sense, and provide numerical hints of almost welfare-optimal of the resulted policies.

* Preprint

Via

Access Paper or Ask Questions