Alert button
Picture for Joseph Modayil

Joseph Modayil

Alert button

Towards model-free RL algorithms that scale well with unstructured data

Add code
Bookmark button
Alert button
Nov 03, 2023
Joseph Modayil, Zaheer Abbas

Figure 1 for Towards model-free RL algorithms that scale well with unstructured data
Figure 2 for Towards model-free RL algorithms that scale well with unstructured data
Figure 3 for Towards model-free RL algorithms that scale well with unstructured data
Figure 4 for Towards model-free RL algorithms that scale well with unstructured data
Viaarxiv icon

Loss of Plasticity in Continual Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 13, 2023
Zaheer Abbas, Rosie Zhao, Joseph Modayil, Adam White, Marlos C. Machado

Figure 1 for Loss of Plasticity in Continual Deep Reinforcement Learning
Figure 2 for Loss of Plasticity in Continual Deep Reinforcement Learning
Figure 3 for Loss of Plasticity in Continual Deep Reinforcement Learning
Figure 4 for Loss of Plasticity in Continual Deep Reinforcement Learning
Viaarxiv icon

The Frost Hollow Experiments: Pavlovian Signalling as a Path to Coordination and Communication Between Agents

Add code
Bookmark button
Alert button
Mar 17, 2022
Patrick M. Pilarski, Andrew Butcher, Elnaz Davoodi, Michael Bradley Johanson, Dylan J. A. Brenneis, Adam S. R. Parker, Leslie Acker, Matthew M. Botvinick, Joseph Modayil, Adam White

Figure 1 for The Frost Hollow Experiments: Pavlovian Signalling as a Path to Coordination and Communication Between Agents
Figure 2 for The Frost Hollow Experiments: Pavlovian Signalling as a Path to Coordination and Communication Between Agents
Figure 3 for The Frost Hollow Experiments: Pavlovian Signalling as a Path to Coordination and Communication Between Agents
Figure 4 for The Frost Hollow Experiments: Pavlovian Signalling as a Path to Coordination and Communication Between Agents
Viaarxiv icon

Pavlovian Signalling with General Value Functions in Agent-Agent Temporal Decision Making

Add code
Bookmark button
Alert button
Jan 11, 2022
Andrew Butcher, Michael Bradley Johanson, Elnaz Davoodi, Dylan J. A. Brenneis, Leslie Acker, Adam S. R. Parker, Adam White, Joseph Modayil, Patrick M. Pilarski

Figure 1 for Pavlovian Signalling with General Value Functions in Agent-Agent Temporal Decision Making
Figure 2 for Pavlovian Signalling with General Value Functions in Agent-Agent Temporal Decision Making
Figure 3 for Pavlovian Signalling with General Value Functions in Agent-Agent Temporal Decision Making
Figure 4 for Pavlovian Signalling with General Value Functions in Agent-Agent Temporal Decision Making
Viaarxiv icon

Assessing Human Interaction in Virtual Reality With Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study

Add code
Bookmark button
Alert button
Dec 14, 2021
Dylan J. A. Brenneis, Adam S. Parker, Michael Bradley Johanson, Andrew Butcher, Elnaz Davoodi, Leslie Acker, Matthew M. Botvinick, Joseph Modayil, Adam White, Patrick M. Pilarski

Figure 1 for Assessing Human Interaction in Virtual Reality With Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study
Figure 2 for Assessing Human Interaction in Virtual Reality With Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study
Figure 3 for Assessing Human Interaction in Virtual Reality With Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study
Figure 4 for Assessing Human Interaction in Virtual Reality With Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study
Viaarxiv icon

Adapting the Function Approximation Architecture in Online Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 17, 2021
John D. Martin, Joseph Modayil

Figure 1 for Adapting the Function Approximation Architecture in Online Reinforcement Learning
Figure 2 for Adapting the Function Approximation Architecture in Online Reinforcement Learning
Figure 3 for Adapting the Function Approximation Architecture in Online Reinforcement Learning
Figure 4 for Adapting the Function Approximation Architecture in Online Reinforcement Learning
Viaarxiv icon

On Inductive Biases in Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 05, 2019
Matteo Hessel, Hado van Hasselt, Joseph Modayil, David Silver

Figure 1 for On Inductive Biases in Deep Reinforcement Learning
Figure 2 for On Inductive Biases in Deep Reinforcement Learning
Figure 3 for On Inductive Biases in Deep Reinforcement Learning
Figure 4 for On Inductive Biases in Deep Reinforcement Learning
Viaarxiv icon

Ray Interference: a Source of Plateaus in Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 25, 2019
Tom Schaul, Diana Borsa, Joseph Modayil, Razvan Pascanu

Figure 1 for Ray Interference: a Source of Plateaus in Deep Reinforcement Learning
Figure 2 for Ray Interference: a Source of Plateaus in Deep Reinforcement Learning
Figure 3 for Ray Interference: a Source of Plateaus in Deep Reinforcement Learning
Figure 4 for Ray Interference: a Source of Plateaus in Deep Reinforcement Learning
Viaarxiv icon

Deep Reinforcement Learning and the Deadly Triad

Add code
Bookmark button
Alert button
Dec 06, 2018
Hado van Hasselt, Yotam Doron, Florian Strub, Matteo Hessel, Nicolas Sonnerat, Joseph Modayil

Figure 1 for Deep Reinforcement Learning and the Deadly Triad
Figure 2 for Deep Reinforcement Learning and the Deadly Triad
Figure 3 for Deep Reinforcement Learning and the Deadly Triad
Figure 4 for Deep Reinforcement Learning and the Deadly Triad
Viaarxiv icon

The Barbados 2018 List of Open Issues in Continual Learning

Add code
Bookmark button
Alert button
Nov 16, 2018
Tom Schaul, Hado van Hasselt, Joseph Modayil, Martha White, Adam White, Pierre-Luc Bacon, Jean Harb, Shibl Mourad, Marc Bellemare, Doina Precup

Viaarxiv icon