Picture for Yaqi Duan

Yaqi Duan

Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces

Add code
Jan 10, 2024
Viaarxiv icon

Policy evaluation from a single path: Multi-step methods, mixing and mis-specification

Add code
Nov 07, 2022
Figure 1 for Policy evaluation from a single path: Multi-step methods, mixing and mis-specification
Figure 2 for Policy evaluation from a single path: Multi-step methods, mixing and mis-specification
Figure 3 for Policy evaluation from a single path: Multi-step methods, mixing and mis-specification
Viaarxiv icon

Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism

Add code
Mar 11, 2022
Figure 1 for Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism
Viaarxiv icon

Adaptive and Robust Multi-task Learning

Add code
Feb 10, 2022
Figure 1 for Adaptive and Robust Multi-task Learning
Figure 2 for Adaptive and Robust Multi-task Learning
Figure 3 for Adaptive and Robust Multi-task Learning
Viaarxiv icon

Optimal policy evaluation using kernel-based temporal difference methods

Add code
Sep 24, 2021
Figure 1 for Optimal policy evaluation using kernel-based temporal difference methods
Figure 2 for Optimal policy evaluation using kernel-based temporal difference methods
Figure 3 for Optimal policy evaluation using kernel-based temporal difference methods
Figure 4 for Optimal policy evaluation using kernel-based temporal difference methods
Viaarxiv icon

PU-Flow: a Point Cloud Upsampling Networkwith Normalizing Flows

Add code
Jul 13, 2021
Figure 1 for PU-Flow: a Point Cloud Upsampling Networkwith Normalizing Flows
Figure 2 for PU-Flow: a Point Cloud Upsampling Networkwith Normalizing Flows
Figure 3 for PU-Flow: a Point Cloud Upsampling Networkwith Normalizing Flows
Figure 4 for PU-Flow: a Point Cloud Upsampling Networkwith Normalizing Flows
Viaarxiv icon

Learning Good State and Action Representations via Tensor Decomposition

Add code
May 03, 2021
Figure 1 for Learning Good State and Action Representations via Tensor Decomposition
Figure 2 for Learning Good State and Action Representations via Tensor Decomposition
Figure 3 for Learning Good State and Action Representations via Tensor Decomposition
Figure 4 for Learning Good State and Action Representations via Tensor Decomposition
Viaarxiv icon

Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning

Add code
Mar 25, 2021
Figure 1 for Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning
Figure 2 for Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning
Viaarxiv icon

Bootstrapping Statistical Inference for Off-Policy Evaluation

Add code
Feb 09, 2021
Figure 1 for Bootstrapping Statistical Inference for Off-Policy Evaluation
Figure 2 for Bootstrapping Statistical Inference for Off-Policy Evaluation
Figure 3 for Bootstrapping Statistical Inference for Off-Policy Evaluation
Figure 4 for Bootstrapping Statistical Inference for Off-Policy Evaluation
Viaarxiv icon

Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient

Add code
Nov 08, 2020
Viaarxiv icon