Alert button
Picture for Martin Riedmiller

Martin Riedmiller

Alert button

Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics

Add code
Bookmark button
Alert button
Jan 02, 2020
Michael Neunert, Abbas Abdolmaleki, Markus Wulfmeier, Thomas Lampe, Jost Tobias Springenberg, Roland Hafner, Francesco Romano, Jonas Buchli, Nicolas Heess, Martin Riedmiller

Figure 1 for Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics
Figure 2 for Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics
Figure 3 for Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics
Figure 4 for Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics
Viaarxiv icon

Quinoa: a Q-function You Infer Normalized Over Actions

Add code
Bookmark button
Alert button
Nov 05, 2019
Jonas Degrave, Abbas Abdolmaleki, Jost Tobias Springenberg, Nicolas Heess, Martin Riedmiller

Figure 1 for Quinoa: a Q-function You Infer Normalized Over Actions
Figure 2 for Quinoa: a Q-function You Infer Normalized Over Actions
Viaarxiv icon

Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models

Add code
Bookmark button
Alert button
Oct 09, 2019
Arunkumar Byravan, Jost Tobias Springenberg, Abbas Abdolmaleki, Roland Hafner, Michael Neunert, Thomas Lampe, Noah Siegel, Nicolas Heess, Martin Riedmiller

Figure 1 for Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models
Figure 2 for Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models
Figure 3 for Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models
Figure 4 for Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models
Viaarxiv icon

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control

Add code
Bookmark button
Alert button
Sep 26, 2019
H. Francis Song, Abbas Abdolmaleki, Jost Tobias Springenberg, Aidan Clark, Hubert Soyer, Jack W. Rae, Seb Noury, Arun Ahuja, Siqi Liu, Dhruva Tirumala, Nicolas Heess, Dan Belov, Martin Riedmiller, Matthew M. Botvinick

Figure 1 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 2 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 3 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 4 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Viaarxiv icon

Regularized Hierarchical Policies for Compositional Transfer in Robotics

Add code
Bookmark button
Alert button
Jun 27, 2019
Markus Wulfmeier, Abbas Abdolmaleki, Roland Hafner, Jost Tobias Springenberg, Michael Neunert, Tim Hertweck, Thomas Lampe, Noah Siegel, Nicolas Heess, Martin Riedmiller

Figure 1 for Regularized Hierarchical Policies for Compositional Transfer in Robotics
Figure 2 for Regularized Hierarchical Policies for Compositional Transfer in Robotics
Figure 3 for Regularized Hierarchical Policies for Compositional Transfer in Robotics
Figure 4 for Regularized Hierarchical Policies for Compositional Transfer in Robotics
Viaarxiv icon

Robust Reinforcement Learning for Continuous Control with Model Misspecification

Add code
Bookmark button
Alert button
Jun 18, 2019
Daniel J. Mankowitz, Nir Levine, Rae Jeong, Abbas Abdolmaleki, Jost Tobias Springenberg, Timothy Mann, Todd Hester, Martin Riedmiller

Figure 1 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Figure 2 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Figure 3 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Figure 4 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Viaarxiv icon

Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup

Add code
Bookmark button
Alert button
Feb 18, 2019
Devin Schwab, Tobias Springenberg, Murilo F. Martins, Thomas Lampe, Michael Neunert, Abbas Abdolmaleki, Tim Hertweck, Roland Hafner, Francesco Nori, Martin Riedmiller

Figure 1 for Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Figure 2 for Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Figure 3 for Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Figure 4 for Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Viaarxiv icon

Self-supervised Learning of Image Embedding for Continuous Control

Add code
Bookmark button
Alert button
Jan 03, 2019
Carlos Florensa, Jonas Degrave, Nicolas Heess, Jost Tobias Springenberg, Martin Riedmiller

Figure 1 for Self-supervised Learning of Image Embedding for Continuous Control
Figure 2 for Self-supervised Learning of Image Embedding for Continuous Control
Figure 3 for Self-supervised Learning of Image Embedding for Continuous Control
Figure 4 for Self-supervised Learning of Image Embedding for Continuous Control
Viaarxiv icon

Relative Entropy Regularized Policy Iteration

Add code
Bookmark button
Alert button
Dec 05, 2018
Abbas Abdolmaleki, Jost Tobias Springenberg, Jonas Degrave, Steven Bohez, Yuval Tassa, Dan Belov, Nicolas Heess, Martin Riedmiller

Figure 1 for Relative Entropy Regularized Policy Iteration
Figure 2 for Relative Entropy Regularized Policy Iteration
Figure 3 for Relative Entropy Regularized Policy Iteration
Figure 4 for Relative Entropy Regularized Policy Iteration
Viaarxiv icon

Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards

Add code
Bookmark button
Alert button
Oct 08, 2018
Mel Vecerik, Todd Hester, Jonathan Scholz, Fumin Wang, Olivier Pietquin, Bilal Piot, Nicolas Heess, Thomas Rothörl, Thomas Lampe, Martin Riedmiller

Figure 1 for Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Figure 2 for Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Figure 3 for Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Figure 4 for Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Viaarxiv icon