Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards

May 30, 2019

Yuhang Song, Jianyi Wang, Thomas Lukasiewicz, Zhenghua Xu, Shangtong Zhang, Mai Xu

Figure 1 for Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards

Figure 2 for Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards

Figure 3 for Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards

Figure 4 for Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards

Share this with someone who'll enjoy it:

Abstract:Intrinsic rewards are introduced to simulate how human intelligence works; they are usually evaluated by intrinsically-motivated play, i.e., playing games without extrinsic rewards but evaluated with extrinsic rewards. However, none of the existing intrinsic reward approaches can achieve human-level performance under this very challenging setting of intrinsically-motivated play. In this work, we propose a novel megalomania-driven intrinsic reward (called \emph{mega-reward}), which, to our knowledge, is the first approach that achieves human-level performance in intrinsically-motivated play. Intuitively, mega-reward comes from the observation that infants' intelligence develops when they try to gain more control on entities in an environment; therefore, mega-reward aims to maximize the control capabilities of agents on given entities in a given environment. To formalize mega-reward, a relational transition model is proposed to bridge the gaps between direct and latent control. Experimental studies show that mega-reward can (i) greatly outperform all state-of-the-art intrinsic reward approaches, (ii) generally achieves the same level of performance as Ex-PPO and professional human-level scores; and (iii) has also superior performance when it is incorporated with extrinsic reward.

View paper on

Share this with someone who'll enjoy it:

Title:Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards

Paper and Code