Alert button
Picture for Dhruva Tirumala

Dhruva Tirumala

Alert button

Replay across Experiments: A Natural Extension of Off-Policy RL

Add code
Bookmark button
Alert button
Nov 28, 2023
Dhruva Tirumala, Thomas Lampe, Jose Enrique Chen, Tuomas Haarnoja, Sandy Huang, Guy Lever, Ben Moran, Tim Hertweck, Leonard Hasenclever, Martin Riedmiller, Nicolas Heess, Markus Wulfmeier

Viaarxiv icon

Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 26, 2023
Tuomas Haarnoja, Ben Moran, Guy Lever, Sandy H. Huang, Dhruva Tirumala, Markus Wulfmeier, Jan Humplik, Saran Tunyasuvunakool, Noah Y. Siegel, Roland Hafner, Michael Bloesch, Kristian Hartikainen, Arunkumar Byravan, Leonard Hasenclever, Yuval Tassa, Fereshteh Sadeghi, Nathan Batchelor, Federico Casarini, Stefano Saliceti, Charles Game, Neil Sreendra, Kushal Patel, Marlon Gwira, Andrea Huber, Nicole Hurley, Francesco Nori, Raia Hadsell, Nicolas Heess

Figure 1 for Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
Figure 2 for Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
Figure 3 for Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
Figure 4 for Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
Viaarxiv icon

SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration

Add code
Bookmark button
Alert button
Dec 03, 2022
Giulia Vezzani, Dhruva Tirumala, Markus Wulfmeier, Dushyant Rao, Abbas Abdolmaleki, Ben Moran, Tuomas Haarnoja, Jan Humplik, Roland Hafner, Michael Neunert, Claudio Fantacci, Tim Hertweck, Thomas Lampe, Fereshteh Sadeghi, Nicolas Heess, Martin Riedmiller

Figure 1 for SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration
Figure 2 for SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration
Figure 3 for SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration
Figure 4 for SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration
Viaarxiv icon

MO2: Model-Based Offline Options

Add code
Bookmark button
Alert button
Sep 05, 2022
Sasha Salter, Markus Wulfmeier, Dhruva Tirumala, Nicolas Heess, Martin Riedmiller, Raia Hadsell, Dushyant Rao

Figure 1 for MO2: Model-Based Offline Options
Figure 2 for MO2: Model-Based Offline Options
Figure 3 for MO2: Model-Based Offline Options
Figure 4 for MO2: Model-Based Offline Options
Viaarxiv icon

Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies

Add code
Bookmark button
Alert button
Dec 09, 2021
Dushyant Rao, Fereshteh Sadeghi, Leonard Hasenclever, Markus Wulfmeier, Martina Zambelli, Giulia Vezzani, Dhruva Tirumala, Yusuf Aytar, Josh Merel, Nicolas Heess, Raia Hadsell

Figure 1 for Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies
Figure 2 for Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies
Figure 3 for Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies
Figure 4 for Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies
Viaarxiv icon

Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity

Add code
Bookmark button
Alert button
Oct 08, 2021
Marta Garnelo, Wojciech Marian Czarnecki, Siqi Liu, Dhruva Tirumala, Junhyuk Oh, Gauthier Gidel, Hado van Hasselt, David Balduzzi

Viaarxiv icon

Behavior Priors for Efficient Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 27, 2020
Dhruva Tirumala, Alexandre Galashov, Hyeonwoo Noh, Leonard Hasenclever, Razvan Pascanu, Jonathan Schwarz, Guillaume Desjardins, Wojciech Marian Czarnecki, Arun Ahuja, Yee Whye Teh, Nicolas Heess

Figure 1 for Behavior Priors for Efficient Reinforcement Learning
Figure 2 for Behavior Priors for Efficient Reinforcement Learning
Figure 3 for Behavior Priors for Efficient Reinforcement Learning
Figure 4 for Behavior Priors for Efficient Reinforcement Learning
Viaarxiv icon

Data-efficient Hindsight Off-policy Option Learning

Add code
Bookmark button
Alert button
Jul 30, 2020
Markus Wulfmeier, Dushyant Rao, Roland Hafner, Thomas Lampe, Abbas Abdolmaleki, Tim Hertweck, Michael Neunert, Dhruva Tirumala, Noah Siegel, Nicolas Heess, Martin Riedmiller

Figure 1 for Data-efficient Hindsight Off-policy Option Learning
Figure 2 for Data-efficient Hindsight Off-policy Option Learning
Figure 3 for Data-efficient Hindsight Off-policy Option Learning
Figure 4 for Data-efficient Hindsight Off-policy Option Learning
Viaarxiv icon

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control

Add code
Bookmark button
Alert button
Sep 26, 2019
H. Francis Song, Abbas Abdolmaleki, Jost Tobias Springenberg, Aidan Clark, Hubert Soyer, Jack W. Rae, Seb Noury, Arun Ahuja, Siqi Liu, Dhruva Tirumala, Nicolas Heess, Dan Belov, Martin Riedmiller, Matthew M. Botvinick

Figure 1 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 2 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 3 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 4 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Viaarxiv icon