Alert button
Picture for Shimon Whiteson

Shimon Whiteson

Alert button

UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 06, 2020
Tarun Gupta, Anuj Mahajan, Bei Peng, Wendelin Böhmer, Shimon Whiteson

Figure 1 for UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning
Figure 2 for UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning
Figure 3 for UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning
Figure 4 for UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning
Viaarxiv icon

My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control

Add code
Bookmark button
Alert button
Oct 05, 2020
Vitaly Kurin, Maximilian Igl, Tim Rocktäschel, Wendelin Boehmer, Shimon Whiteson

Figure 1 for My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control
Figure 2 for My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control
Figure 3 for My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control
Figure 4 for My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control
Viaarxiv icon

RODE: Learning Roles to Decompose Multi-Agent Tasks

Add code
Bookmark button
Alert button
Oct 04, 2020
Tonghan Wang, Tarun Gupta, Anuj Mahajan, Bei Peng, Shimon Whiteson, Chongjie Zhang

Figure 1 for RODE: Learning Roles to Decompose Multi-Agent Tasks
Figure 2 for RODE: Learning Roles to Decompose Multi-Agent Tasks
Figure 3 for RODE: Learning Roles to Decompose Multi-Agent Tasks
Figure 4 for RODE: Learning Roles to Decompose Multi-Agent Tasks
Viaarxiv icon

A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms

Add code
Bookmark button
Alert button
Oct 02, 2020
Shangtong Zhang, Romain Laroche, Harm van Seijen, Shimon Whiteson, Remi Tachet des Combes

Figure 1 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Figure 2 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Figure 3 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Figure 4 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Viaarxiv icon

Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 02, 2020
Luisa Zintgraf, Leo Feng, Maximilian Igl, Kristian Hartikainen, Katja Hofmann, Shimon Whiteson

Figure 1 for Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning
Figure 2 for Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning
Figure 3 for Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning
Figure 4 for Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning
Viaarxiv icon

Exploiting Submodular Value Functions For Scaling Up Active Perception

Add code
Bookmark button
Alert button
Sep 21, 2020
Yash Satsangi, Shimon Whiteson, Frans A. Oliehoek, Matthijs T. J. Spaan

Figure 1 for Exploiting Submodular Value Functions For Scaling Up Active Perception
Figure 2 for Exploiting Submodular Value Functions For Scaling Up Active Perception
Figure 3 for Exploiting Submodular Value Functions For Scaling Up Active Perception
Figure 4 for Exploiting Submodular Value Functions For Scaling Up Active Perception
Viaarxiv icon

Real-Time Resource Allocation for Tracking Systems

Add code
Bookmark button
Alert button
Sep 21, 2020
Yash Satsangi, Shimon Whiteson, Frans A. Oliehoek, Henri Bouma

Figure 1 for Real-Time Resource Allocation for Tracking Systems
Figure 2 for Real-Time Resource Allocation for Tracking Systems
Figure 3 for Real-Time Resource Allocation for Tracking Systems
Figure 4 for Real-Time Resource Allocation for Tracking Systems
Viaarxiv icon

WordCraft: An Environment for Benchmarking Commonsense Agents

Add code
Bookmark button
Alert button
Jul 17, 2020
Minqi Jiang, Jelena Luketina, Nantas Nardelli, Pasquale Minervini, Philip H. S. Torr, Shimon Whiteson, Tim Rocktäschel

Figure 1 for WordCraft: An Environment for Benchmarking Commonsense Agents
Figure 2 for WordCraft: An Environment for Benchmarking Commonsense Agents
Figure 3 for WordCraft: An Environment for Benchmarking Commonsense Agents
Figure 4 for WordCraft: An Environment for Benchmarking Commonsense Agents
Viaarxiv icon

Learning Retrospective Knowledge with Reverse Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 09, 2020
Shangtong Zhang, Vivek Veeriah, Shimon Whiteson

Figure 1 for Learning Retrospective Knowledge with Reverse Reinforcement Learning
Figure 2 for Learning Retrospective Knowledge with Reverse Reinforcement Learning
Figure 3 for Learning Retrospective Knowledge with Reverse Reinforcement Learning
Viaarxiv icon

Weighted QMIX: Expanding Monotonic Value Function Factorisation

Add code
Bookmark button
Alert button
Jun 18, 2020
Tabish Rashid, Gregory Farquhar, Bei Peng, Shimon Whiteson

Figure 1 for Weighted QMIX: Expanding Monotonic Value Function Factorisation
Figure 2 for Weighted QMIX: Expanding Monotonic Value Function Factorisation
Figure 3 for Weighted QMIX: Expanding Monotonic Value Function Factorisation
Figure 4 for Weighted QMIX: Expanding Monotonic Value Function Factorisation
Viaarxiv icon