Picture for Martin Riedmiller

Martin Riedmiller

A Distributional View on Multi-Objective Policy Optimization

Add code
May 15, 2020
Figure 1 for A Distributional View on Multi-Objective Policy Optimization
Figure 2 for A Distributional View on Multi-Objective Policy Optimization
Figure 3 for A Distributional View on Multi-Objective Policy Optimization
Figure 4 for A Distributional View on Multi-Objective Policy Optimization
Viaarxiv icon

Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning

Add code
Feb 23, 2020
Figure 1 for Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning
Figure 2 for Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning
Figure 3 for Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning
Figure 4 for Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning
Viaarxiv icon

Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics

Add code
Jan 02, 2020
Figure 1 for Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics
Figure 2 for Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics
Figure 3 for Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics
Figure 4 for Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics
Viaarxiv icon

Quinoa: a Q-function You Infer Normalized Over Actions

Add code
Nov 05, 2019
Figure 1 for Quinoa: a Q-function You Infer Normalized Over Actions
Figure 2 for Quinoa: a Q-function You Infer Normalized Over Actions
Viaarxiv icon

Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models

Add code
Oct 09, 2019
Figure 1 for Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models
Figure 2 for Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models
Figure 3 for Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models
Figure 4 for Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models
Viaarxiv icon

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control

Add code
Sep 26, 2019
Figure 1 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 2 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 3 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 4 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Viaarxiv icon

Regularized Hierarchical Policies for Compositional Transfer in Robotics

Add code
Jun 27, 2019
Figure 1 for Regularized Hierarchical Policies for Compositional Transfer in Robotics
Figure 2 for Regularized Hierarchical Policies for Compositional Transfer in Robotics
Figure 3 for Regularized Hierarchical Policies for Compositional Transfer in Robotics
Figure 4 for Regularized Hierarchical Policies for Compositional Transfer in Robotics
Viaarxiv icon

Robust Reinforcement Learning for Continuous Control with Model Misspecification

Add code
Jun 18, 2019
Figure 1 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Figure 2 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Figure 3 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Figure 4 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Viaarxiv icon

Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup

Add code
Feb 18, 2019
Figure 1 for Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Figure 2 for Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Figure 3 for Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Figure 4 for Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Viaarxiv icon

Self-supervised Learning of Image Embedding for Continuous Control

Add code
Jan 03, 2019
Figure 1 for Self-supervised Learning of Image Embedding for Continuous Control
Figure 2 for Self-supervised Learning of Image Embedding for Continuous Control
Figure 3 for Self-supervised Learning of Image Embedding for Continuous Control
Figure 4 for Self-supervised Learning of Image Embedding for Continuous Control
Viaarxiv icon