Alert button
Picture for Abbas Abdolmaleki

Abbas Abdolmaleki

Alert button

Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach

Add code
Bookmark button
Alert button
Apr 22, 2022
Bobak Shahriari, Abbas Abdolmaleki, Arunkumar Byravan, Abe Friesen, Siqi Liu, Jost Tobias Springenberg, Nicolas Heess, Matt Hoffman, Martin Riedmiller

Figure 1 for Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach
Figure 2 for Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach
Figure 3 for Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach
Figure 4 for Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach
Viaarxiv icon

Revisiting Gaussian mixture critic in off-policy reinforcement learning: a sample-based approach

Add code
Bookmark button
Alert button
Apr 21, 2022
Bobak Shahriari, Abbas Abdolmaleki, Arunkumar Byravan, Abe Friesen, Siqi Liu, Jost Tobias Springenberg, Nicolas Heess, Matt Hoffman, Martin Riedmiller

Figure 1 for Revisiting Gaussian mixture critic in off-policy reinforcement learning: a sample-based approach
Figure 2 for Revisiting Gaussian mixture critic in off-policy reinforcement learning: a sample-based approach
Figure 3 for Revisiting Gaussian mixture critic in off-policy reinforcement learning: a sample-based approach
Figure 4 for Revisiting Gaussian mixture critic in off-policy reinforcement learning: a sample-based approach
Viaarxiv icon

Offline Distillation for Robot Lifelong Learning with Imbalanced Experience

Add code
Bookmark button
Alert button
Apr 12, 2022
Wenxuan Zhou, Steven Bohez, Jan Humplik, Abbas Abdolmaleki, Dushyant Rao, Markus Wulfmeier, Tuomas Haarnoja, Nicolas Heess

Figure 1 for Offline Distillation for Robot Lifelong Learning with Imbalanced Experience
Figure 2 for Offline Distillation for Robot Lifelong Learning with Imbalanced Experience
Figure 3 for Offline Distillation for Robot Lifelong Learning with Imbalanced Experience
Figure 4 for Offline Distillation for Robot Lifelong Learning with Imbalanced Experience
Viaarxiv icon

Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes

Add code
Bookmark button
Alert button
Nov 03, 2021
Alex X. Lee, Coline Devin, Yuxiang Zhou, Thomas Lampe, Konstantinos Bousmalis, Jost Tobias Springenberg, Arunkumar Byravan, Abbas Abdolmaleki, Nimrod Gileadi, David Khosid, Claudio Fantacci, Jose Enrique Chen, Akhil Raju, Rae Jeong, Michael Neunert, Antoine Laurens, Stefano Saliceti, Federico Casarini, Martin Riedmiller, Raia Hadsell, Francesco Nori

Figure 1 for Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes
Figure 2 for Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes
Figure 3 for Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes
Figure 4 for Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes
Viaarxiv icon

Evaluating model-based planning and planner amortization for continuous control

Add code
Bookmark button
Alert button
Oct 07, 2021
Arunkumar Byravan, Leonard Hasenclever, Piotr Trochim, Mehdi Mirza, Alessandro Davide Ialongo, Yuval Tassa, Jost Tobias Springenberg, Abbas Abdolmaleki, Nicolas Heess, Josh Merel, Martin Riedmiller

Figure 1 for Evaluating model-based planning and planner amortization for continuous control
Figure 2 for Evaluating model-based planning and planner amortization for continuous control
Figure 3 for Evaluating model-based planning and planner amortization for continuous control
Figure 4 for Evaluating model-based planning and planner amortization for continuous control
Viaarxiv icon

On Multi-objective Policy Optimization as a Tool for Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 15, 2021
Abbas Abdolmaleki, Sandy H. Huang, Giulia Vezzani, Bobak Shahriari, Jost Tobias Springenberg, Shruti Mishra, Dhruva TB, Arunkumar Byravan, Konstantinos Bousmalis, Andras Gyorgy, Csaba Szepesvari, Raia Hadsell, Nicolas Heess, Martin Riedmiller

Figure 1 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 2 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 3 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 4 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Viaarxiv icon

From Motor Control to Team Play in Simulated Humanoid Football

Add code
Bookmark button
Alert button
May 25, 2021
Siqi Liu, Guy Lever, Zhe Wang, Josh Merel, S. M. Ali Eslami, Daniel Hennes, Wojciech M. Czarnecki, Yuval Tassa, Shayegan Omidshafiei, Abbas Abdolmaleki, Noah Y. Siegel, Leonard Hasenclever, Luke Marris, Saran Tunyasuvunakool, H. Francis Song, Markus Wulfmeier, Paul Muller, Tuomas Haarnoja, Brendan D. Tracey, Karl Tuyls, Thore Graepel, Nicolas Heess

Figure 1 for From Motor Control to Team Play in Simulated Humanoid Football
Figure 2 for From Motor Control to Team Play in Simulated Humanoid Football
Figure 3 for From Motor Control to Team Play in Simulated Humanoid Football
Figure 4 for From Motor Control to Team Play in Simulated Humanoid Football
Viaarxiv icon

Rethinking Exploration for Sample-Efficient Policy Learning

Add code
Bookmark button
Alert button
Jan 23, 2021
William F. Whitney, Michael Bloesch, Jost Tobias Springenberg, Abbas Abdolmaleki, Martin Riedmiller

Figure 1 for Rethinking Exploration for Sample-Efficient Policy Learning
Figure 2 for Rethinking Exploration for Sample-Efficient Policy Learning
Figure 3 for Rethinking Exploration for Sample-Efficient Policy Learning
Figure 4 for Rethinking Exploration for Sample-Efficient Policy Learning
Viaarxiv icon

"What, not how": Solving an under-actuated insertion task from scratch

Add code
Bookmark button
Alert button
Oct 30, 2020
Giulia Vezzani, Michael Neunert, Markus Wulfmeier, Rae Jeong, Thomas Lampe, Noah Siegel, Roland Hafner, Abbas Abdolmaleki, Martin Riedmiller, Francesco Nori

Figure 1 for "What, not how": Solving an under-actuated insertion task from scratch
Figure 2 for "What, not how": Solving an under-actuated insertion task from scratch
Figure 3 for "What, not how": Solving an under-actuated insertion task from scratch
Figure 4 for "What, not how": Solving an under-actuated insertion task from scratch
Viaarxiv icon