Alert button
Picture for Shimon Whiteson

Shimon Whiteson

Alert button

Deep Coordination Graphs

Sep 27, 2019
Wendelin Böhmer, Vitaly Kurin, Shimon Whiteson

Figure 1 for Deep Coordination Graphs
Figure 2 for Deep Coordination Graphs
Figure 3 for Deep Coordination Graphs
Figure 4 for Deep Coordination Graphs
Viaarxiv icon

Improving SAT Solver Heuristics with Graph Networks and Reinforcement Learning

Sep 26, 2019
Vitaly Kurin, Saad Godil, Shimon Whiteson, Bryan Catanzaro

Figure 1 for Improving SAT Solver Heuristics with Graph Networks and Reinforcement Learning
Figure 2 for Improving SAT Solver Heuristics with Graph Networks and Reinforcement Learning
Figure 3 for Improving SAT Solver Heuristics with Graph Networks and Reinforcement Learning
Figure 4 for Improving SAT Solver Heuristics with Graph Networks and Reinforcement Learning
Viaarxiv icon

Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Estimators for Reinforcement Learning

Sep 23, 2019
Gregory Farquhar, Shimon Whiteson, Jakob Foerster

Figure 1 for Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Estimators for Reinforcement Learning
Figure 2 for Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Estimators for Reinforcement Learning
Figure 3 for Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Estimators for Reinforcement Learning
Figure 4 for Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Estimators for Reinforcement Learning
Viaarxiv icon

Growing Action Spaces

Jun 28, 2019
Gregory Farquhar, Laura Gustafson, Zeming Lin, Shimon Whiteson, Nicolas Usunier, Gabriel Synnaeve

Figure 1 for Growing Action Spaces
Figure 2 for Growing Action Spaces
Figure 3 for Growing Action Spaces
Figure 4 for Growing Action Spaces
Viaarxiv icon

A Survey of Reinforcement Learning Informed by Natural Language

Jun 10, 2019
Jelena Luketina, Nantas Nardelli, Gregory Farquhar, Jakob Foerster, Jacob Andreas, Edward Grefenstette, Shimon Whiteson, Tim Rocktäschel

Figure 1 for A Survey of Reinforcement Learning Informed by Natural Language
Viaarxiv icon

Exploration with Unreliable Intrinsic Reward in Multi-Agent Reinforcement Learning

Jun 05, 2019
Wendelin Böhmer, Tabish Rashid, Shimon Whiteson

Figure 1 for Exploration with Unreliable Intrinsic Reward in Multi-Agent Reinforcement Learning
Viaarxiv icon

DAC: The Double Actor-Critic Architecture for Learning Options

May 16, 2019
Shangtong Zhang, Shimon Whiteson

Figure 1 for DAC: The Double Actor-Critic Architecture for Learning Options
Figure 2 for DAC: The Double Actor-Critic Architecture for Learning Options
Figure 3 for DAC: The Double Actor-Critic Architecture for Learning Options
Figure 4 for DAC: The Double Actor-Critic Architecture for Learning Options
Viaarxiv icon

Deep Residual Reinforcement Learning

May 03, 2019
Shangtong Zhang, Wendelin Boehmer, Shimon Whiteson

Figure 1 for Deep Residual Reinforcement Learning
Figure 2 for Deep Residual Reinforcement Learning
Figure 3 for Deep Residual Reinforcement Learning
Figure 4 for Deep Residual Reinforcement Learning
Viaarxiv icon

Multitask Soft Option Learning

Apr 01, 2019
Maximilian Igl, Andrew Gambardella, Nantas Nardelli, N. Siddharth, Wendelin Böhmer, Shimon Whiteson

Figure 1 for Multitask Soft Option Learning
Figure 2 for Multitask Soft Option Learning
Figure 3 for Multitask Soft Option Learning
Figure 4 for Multitask Soft Option Learning
Viaarxiv icon

Generalized Off-Policy Actor-Critic

Mar 27, 2019
Shangtong Zhang, Wendelin Boehmer, Shimon Whiteson

Figure 1 for Generalized Off-Policy Actor-Critic
Figure 2 for Generalized Off-Policy Actor-Critic
Figure 3 for Generalized Off-Policy Actor-Critic
Figure 4 for Generalized Off-Policy Actor-Critic
Viaarxiv icon