Alert button
Picture for Abbas Abdolmaleki

Abbas Abdolmaleki

Alert button

Augmenting learning using symmetry in a biologically-inspired domain

Add code
Bookmark button
Alert button
Oct 01, 2019
Shruti Mishra, Abbas Abdolmaleki, Arthur Guez, Piotr Trochim, Doina Precup

Figure 1 for Augmenting learning using symmetry in a biologically-inspired domain
Figure 2 for Augmenting learning using symmetry in a biologically-inspired domain
Viaarxiv icon

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control

Add code
Bookmark button
Alert button
Sep 26, 2019
H. Francis Song, Abbas Abdolmaleki, Jost Tobias Springenberg, Aidan Clark, Hubert Soyer, Jack W. Rae, Seb Noury, Arun Ahuja, Siqi Liu, Dhruva Tirumala, Nicolas Heess, Dan Belov, Martin Riedmiller, Matthew M. Botvinick

Figure 1 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 2 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 3 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 4 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Viaarxiv icon

Regularized Hierarchical Policies for Compositional Transfer in Robotics

Add code
Bookmark button
Alert button
Jun 27, 2019
Markus Wulfmeier, Abbas Abdolmaleki, Roland Hafner, Jost Tobias Springenberg, Michael Neunert, Tim Hertweck, Thomas Lampe, Noah Siegel, Nicolas Heess, Martin Riedmiller

Figure 1 for Regularized Hierarchical Policies for Compositional Transfer in Robotics
Figure 2 for Regularized Hierarchical Policies for Compositional Transfer in Robotics
Figure 3 for Regularized Hierarchical Policies for Compositional Transfer in Robotics
Figure 4 for Regularized Hierarchical Policies for Compositional Transfer in Robotics
Viaarxiv icon

Robust Reinforcement Learning for Continuous Control with Model Misspecification

Add code
Bookmark button
Alert button
Jun 18, 2019
Daniel J. Mankowitz, Nir Levine, Rae Jeong, Abbas Abdolmaleki, Jost Tobias Springenberg, Timothy Mann, Todd Hester, Martin Riedmiller

Figure 1 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Figure 2 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Figure 3 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Figure 4 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Viaarxiv icon

Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup

Add code
Bookmark button
Alert button
Feb 18, 2019
Devin Schwab, Tobias Springenberg, Murilo F. Martins, Thomas Lampe, Michael Neunert, Abbas Abdolmaleki, Tim Hertweck, Roland Hafner, Francesco Nori, Martin Riedmiller

Figure 1 for Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Figure 2 for Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Figure 3 for Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Figure 4 for Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Viaarxiv icon

Value constrained model-free continuous control

Add code
Bookmark button
Alert button
Feb 12, 2019
Steven Bohez, Abbas Abdolmaleki, Michael Neunert, Jonas Buchli, Nicolas Heess, Raia Hadsell

Figure 1 for Value constrained model-free continuous control
Figure 2 for Value constrained model-free continuous control
Figure 3 for Value constrained model-free continuous control
Figure 4 for Value constrained model-free continuous control
Viaarxiv icon

Relative Entropy Regularized Policy Iteration

Add code
Bookmark button
Alert button
Dec 05, 2018
Abbas Abdolmaleki, Jost Tobias Springenberg, Jonas Degrave, Steven Bohez, Yuval Tassa, Dan Belov, Nicolas Heess, Martin Riedmiller

Figure 1 for Relative Entropy Regularized Policy Iteration
Figure 2 for Relative Entropy Regularized Policy Iteration
Figure 3 for Relative Entropy Regularized Policy Iteration
Figure 4 for Relative Entropy Regularized Policy Iteration
Viaarxiv icon

Model-Free Trajectory-based Policy Optimization with Monotonic Improvement

Add code
Bookmark button
Alert button
Jul 02, 2018
Riad Akrour, Abbas Abdolmaleki, Hany Abdulsamad, Jan Peters, Gerhard Neumann

Figure 1 for Model-Free Trajectory-based Policy Optimization with Monotonic Improvement
Figure 2 for Model-Free Trajectory-based Policy Optimization with Monotonic Improvement
Figure 3 for Model-Free Trajectory-based Policy Optimization with Monotonic Improvement
Figure 4 for Model-Free Trajectory-based Policy Optimization with Monotonic Improvement
Viaarxiv icon

Maximum a Posteriori Policy Optimisation

Add code
Bookmark button
Alert button
Jun 14, 2018
Abbas Abdolmaleki, Jost Tobias Springenberg, Yuval Tassa, Remi Munos, Nicolas Heess, Martin Riedmiller

Figure 1 for Maximum a Posteriori Policy Optimisation
Figure 2 for Maximum a Posteriori Policy Optimisation
Figure 3 for Maximum a Posteriori Policy Optimisation
Figure 4 for Maximum a Posteriori Policy Optimisation
Viaarxiv icon

Guide Actor-Critic for Continuous Control

Add code
Bookmark button
Alert button
Feb 22, 2018
Voot Tangkaratt, Abbas Abdolmaleki, Masashi Sugiyama

Figure 1 for Guide Actor-Critic for Continuous Control
Figure 2 for Guide Actor-Critic for Continuous Control
Figure 3 for Guide Actor-Critic for Continuous Control
Figure 4 for Guide Actor-Critic for Continuous Control
Viaarxiv icon