Picture for Abbas Abdolmaleki

Abbas Abdolmaleki

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control

Add code
Sep 26, 2019
Figure 1 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 2 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 3 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 4 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Viaarxiv icon

Regularized Hierarchical Policies for Compositional Transfer in Robotics

Add code
Jun 27, 2019
Figure 1 for Regularized Hierarchical Policies for Compositional Transfer in Robotics
Figure 2 for Regularized Hierarchical Policies for Compositional Transfer in Robotics
Figure 3 for Regularized Hierarchical Policies for Compositional Transfer in Robotics
Figure 4 for Regularized Hierarchical Policies for Compositional Transfer in Robotics
Viaarxiv icon

Robust Reinforcement Learning for Continuous Control with Model Misspecification

Add code
Jun 18, 2019
Figure 1 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Figure 2 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Figure 3 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Figure 4 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Viaarxiv icon

Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup

Add code
Feb 18, 2019
Figure 1 for Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Figure 2 for Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Figure 3 for Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Figure 4 for Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Viaarxiv icon

Value constrained model-free continuous control

Add code
Feb 12, 2019
Figure 1 for Value constrained model-free continuous control
Figure 2 for Value constrained model-free continuous control
Figure 3 for Value constrained model-free continuous control
Figure 4 for Value constrained model-free continuous control
Viaarxiv icon

Relative Entropy Regularized Policy Iteration

Add code
Dec 05, 2018
Figure 1 for Relative Entropy Regularized Policy Iteration
Figure 2 for Relative Entropy Regularized Policy Iteration
Figure 3 for Relative Entropy Regularized Policy Iteration
Figure 4 for Relative Entropy Regularized Policy Iteration
Viaarxiv icon

Model-Free Trajectory-based Policy Optimization with Monotonic Improvement

Add code
Jul 02, 2018
Figure 1 for Model-Free Trajectory-based Policy Optimization with Monotonic Improvement
Figure 2 for Model-Free Trajectory-based Policy Optimization with Monotonic Improvement
Figure 3 for Model-Free Trajectory-based Policy Optimization with Monotonic Improvement
Figure 4 for Model-Free Trajectory-based Policy Optimization with Monotonic Improvement
Viaarxiv icon

Maximum a Posteriori Policy Optimisation

Add code
Jun 14, 2018
Figure 1 for Maximum a Posteriori Policy Optimisation
Figure 2 for Maximum a Posteriori Policy Optimisation
Figure 3 for Maximum a Posteriori Policy Optimisation
Figure 4 for Maximum a Posteriori Policy Optimisation
Viaarxiv icon

Guide Actor-Critic for Continuous Control

Add code
Feb 22, 2018
Figure 1 for Guide Actor-Critic for Continuous Control
Figure 2 for Guide Actor-Critic for Continuous Control
Figure 3 for Guide Actor-Critic for Continuous Control
Figure 4 for Guide Actor-Critic for Continuous Control
Viaarxiv icon

DeepMind Control Suite

Add code
Jan 02, 2018
Figure 1 for DeepMind Control Suite
Figure 2 for DeepMind Control Suite
Figure 3 for DeepMind Control Suite
Figure 4 for DeepMind Control Suite
Viaarxiv icon