Picture for Lihong Li

Lihong Li

Near-optimal Representation Learning for Linear Bandits and Linear RL

Add code
Feb 08, 2021
Viaarxiv icon

CoinDICE: Off-Policy Confidence Interval Estimation

Add code
Oct 22, 2020
Figure 1 for CoinDICE: Off-Policy Confidence Interval Estimation
Figure 2 for CoinDICE: Off-Policy Confidence Interval Estimation
Figure 3 for CoinDICE: Off-Policy Confidence Interval Estimation
Viaarxiv icon

Neural Thompson Sampling

Add code
Oct 02, 2020
Figure 1 for Neural Thompson Sampling
Figure 2 for Neural Thompson Sampling
Figure 3 for Neural Thompson Sampling
Figure 4 for Neural Thompson Sampling
Viaarxiv icon

Efficient Reinforcement Learning in Factored MDPs with Application to Constrained RL

Add code
Sep 15, 2020
Viaarxiv icon

Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders

Add code
Jul 27, 2020
Figure 1 for Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders
Figure 2 for Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders
Figure 3 for Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders
Figure 4 for Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders
Viaarxiv icon

Off-Policy Evaluation via the Regularized Lagrangian

Add code
Jul 07, 2020
Figure 1 for Off-Policy Evaluation via the Regularized Lagrangian
Figure 2 for Off-Policy Evaluation via the Regularized Lagrangian
Figure 3 for Off-Policy Evaluation via the Regularized Lagrangian
Figure 4 for Off-Policy Evaluation via the Regularized Lagrangian
Viaarxiv icon

Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning

Add code
Mar 24, 2020
Figure 1 for Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning
Figure 2 for Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning
Figure 3 for Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning
Figure 4 for Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning
Viaarxiv icon

Batch Stationary Distribution Estimation

Add code
Mar 02, 2020
Figure 1 for Batch Stationary Distribution Estimation
Figure 2 for Batch Stationary Distribution Estimation
Figure 3 for Batch Stationary Distribution Estimation
Figure 4 for Batch Stationary Distribution Estimation
Viaarxiv icon

GenDICE: Generalized Offline Estimation of Stationary Values

Add code
Feb 21, 2020
Figure 1 for GenDICE: Generalized Offline Estimation of Stationary Values
Figure 2 for GenDICE: Generalized Offline Estimation of Stationary Values
Figure 3 for GenDICE: Generalized Offline Estimation of Stationary Values
Figure 4 for GenDICE: Generalized Offline Estimation of Stationary Values
Viaarxiv icon

Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing

Add code
Feb 12, 2020
Figure 1 for Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Figure 2 for Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Figure 3 for Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Figure 4 for Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Viaarxiv icon