Alert button
Picture for Dhruva TB

Dhruva TB

Alert button

On Multi-objective Policy Optimization as a Tool for Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 15, 2021
Abbas Abdolmaleki, Sandy H. Huang, Giulia Vezzani, Bobak Shahriari, Jost Tobias Springenberg, Shruti Mishra, Dhruva TB, Arunkumar Byravan, Konstantinos Bousmalis, Andras Gyorgy, Csaba Szepesvari, Raia Hadsell, Nicolas Heess, Martin Riedmiller

Figure 1 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 2 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 3 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 4 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Viaarxiv icon

Distributed Distributional Deterministic Policy Gradients

Add code
Bookmark button
Alert button
Apr 23, 2018
Gabriel Barth-Maron, Matthew W. Hoffman, David Budden, Will Dabney, Dan Horgan, Dhruva TB, Alistair Muldal, Nicolas Heess, Timothy Lillicrap

Figure 1 for Distributed Distributional Deterministic Policy Gradients
Figure 2 for Distributed Distributional Deterministic Policy Gradients
Figure 3 for Distributed Distributional Deterministic Policy Gradients
Figure 4 for Distributed Distributional Deterministic Policy Gradients
Viaarxiv icon

Probing Physics Knowledge Using Tools from Developmental Psychology

Add code
Bookmark button
Alert button
Apr 03, 2018
Luis Piloto, Ari Weinstein, Dhruva TB, Arun Ahuja, Mehdi Mirza, Greg Wayne, David Amos, Chia-chun Hung, Matt Botvinick

Figure 1 for Probing Physics Knowledge Using Tools from Developmental Psychology
Figure 2 for Probing Physics Knowledge Using Tools from Developmental Psychology
Figure 3 for Probing Physics Knowledge Using Tools from Developmental Psychology
Figure 4 for Probing Physics Knowledge Using Tools from Developmental Psychology
Viaarxiv icon

Emergence of Locomotion Behaviours in Rich Environments

Add code
Bookmark button
Alert button
Jul 10, 2017
Nicolas Heess, Dhruva TB, Srinivasan Sriram, Jay Lemmon, Josh Merel, Greg Wayne, Yuval Tassa, Tom Erez, Ziyu Wang, S. M. Ali Eslami, Martin Riedmiller, David Silver

Figure 1 for Emergence of Locomotion Behaviours in Rich Environments
Figure 2 for Emergence of Locomotion Behaviours in Rich Environments
Figure 3 for Emergence of Locomotion Behaviours in Rich Environments
Figure 4 for Emergence of Locomotion Behaviours in Rich Environments
Viaarxiv icon

Learning human behaviors from motion capture by adversarial imitation

Add code
Bookmark button
Alert button
Jul 10, 2017
Josh Merel, Yuval Tassa, Dhruva TB, Sriram Srinivasan, Jay Lemmon, Ziyu Wang, Greg Wayne, Nicolas Heess

Figure 1 for Learning human behaviors from motion capture by adversarial imitation
Figure 2 for Learning human behaviors from motion capture by adversarial imitation
Figure 3 for Learning human behaviors from motion capture by adversarial imitation
Figure 4 for Learning human behaviors from motion capture by adversarial imitation
Viaarxiv icon