Picture for Yinlam Chow

Yinlam Chow

Dima

Piecewise-Stationary Off-Policy Optimization

Add code
Jun 15, 2020
Figure 1 for Piecewise-Stationary Off-Policy Optimization
Figure 2 for Piecewise-Stationary Off-Policy Optimization
Figure 3 for Piecewise-Stationary Off-Policy Optimization
Viaarxiv icon

Predictive Coding for Locally-Linear Control

Add code
Mar 02, 2020
Figure 1 for Predictive Coding for Locally-Linear Control
Figure 2 for Predictive Coding for Locally-Linear Control
Figure 3 for Predictive Coding for Locally-Linear Control
Figure 4 for Predictive Coding for Locally-Linear Control
Viaarxiv icon

BRPO: Batch Residual Policy Optimization

Add code
Feb 08, 2020
Figure 1 for BRPO: Batch Residual Policy Optimization
Figure 2 for BRPO: Batch Residual Policy Optimization
Figure 3 for BRPO: Batch Residual Policy Optimization
Figure 4 for BRPO: Batch Residual Policy Optimization
Viaarxiv icon

AlgaeDICE: Policy Gradient from Arbitrary Experience

Add code
Dec 04, 2019
Figure 1 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Figure 2 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Figure 3 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Figure 4 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Viaarxiv icon

CAQL: Continuous Action Q-Learning

Add code
Oct 09, 2019
Figure 1 for CAQL: Continuous Action Q-Learning
Figure 2 for CAQL: Continuous Action Q-Learning
Figure 3 for CAQL: Continuous Action Q-Learning
Figure 4 for CAQL: Continuous Action Q-Learning
Viaarxiv icon

Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control

Add code
Sep 04, 2019
Figure 1 for Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control
Figure 2 for Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control
Figure 3 for Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control
Figure 4 for Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control
Viaarxiv icon

DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections

Add code
Jun 10, 2019
Figure 1 for DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Figure 2 for DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Figure 3 for DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Figure 4 for DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Viaarxiv icon

Lyapunov-based Safe Policy Optimization for Continuous Control

Add code
Jan 28, 2019
Figure 1 for Lyapunov-based Safe Policy Optimization for Continuous Control
Figure 2 for Lyapunov-based Safe Policy Optimization for Continuous Control
Figure 3 for Lyapunov-based Safe Policy Optimization for Continuous Control
Figure 4 for Lyapunov-based Safe Policy Optimization for Continuous Control
Viaarxiv icon

A Block Coordinate Ascent Algorithm for Mean-Variance Optimization

Add code
Nov 01, 2018
Figure 1 for A Block Coordinate Ascent Algorithm for Mean-Variance Optimization
Figure 2 for A Block Coordinate Ascent Algorithm for Mean-Variance Optimization
Viaarxiv icon

Risk-Sensitive Generative Adversarial Imitation Learning

Add code
Aug 13, 2018
Figure 1 for Risk-Sensitive Generative Adversarial Imitation Learning
Figure 2 for Risk-Sensitive Generative Adversarial Imitation Learning
Figure 3 for Risk-Sensitive Generative Adversarial Imitation Learning
Viaarxiv icon