Picture for Yaqi Duan

Yaqi Duan

Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning

Add code
Mar 25, 2021
Figure 1 for Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning
Figure 2 for Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning
Viaarxiv icon

Bootstrapping Statistical Inference for Off-Policy Evaluation

Add code
Feb 09, 2021
Figure 1 for Bootstrapping Statistical Inference for Off-Policy Evaluation
Figure 2 for Bootstrapping Statistical Inference for Off-Policy Evaluation
Figure 3 for Bootstrapping Statistical Inference for Off-Policy Evaluation
Figure 4 for Bootstrapping Statistical Inference for Off-Policy Evaluation
Viaarxiv icon

Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient

Add code
Nov 08, 2020
Viaarxiv icon

Minimax-Optimal Off-Policy Evaluation with Linear Function Approximation

Add code
Feb 21, 2020
Viaarxiv icon

Learning low-dimensional state embeddings and metastable clusters from time series data

Add code
Jun 01, 2019
Figure 1 for Learning low-dimensional state embeddings and metastable clusters from time series data
Figure 2 for Learning low-dimensional state embeddings and metastable clusters from time series data
Figure 3 for Learning low-dimensional state embeddings and metastable clusters from time series data
Figure 4 for Learning low-dimensional state embeddings and metastable clusters from time series data
Viaarxiv icon

State Aggregation Learning from Markov Transition Data

Add code
Nov 06, 2018
Figure 1 for State Aggregation Learning from Markov Transition Data
Figure 2 for State Aggregation Learning from Markov Transition Data
Figure 3 for State Aggregation Learning from Markov Transition Data
Figure 4 for State Aggregation Learning from Markov Transition Data
Viaarxiv icon

Adaptive Low-Nonnegative-Rank Approximation for State Aggregation of Markov Chains

Add code
Oct 14, 2018
Figure 1 for Adaptive Low-Nonnegative-Rank Approximation for State Aggregation of Markov Chains
Figure 2 for Adaptive Low-Nonnegative-Rank Approximation for State Aggregation of Markov Chains
Figure 3 for Adaptive Low-Nonnegative-Rank Approximation for State Aggregation of Markov Chains
Figure 4 for Adaptive Low-Nonnegative-Rank Approximation for State Aggregation of Markov Chains
Viaarxiv icon