Alert button
Picture for Lihong Li

Lihong Li

Alert button

Efficient Reinforcement Learning in Factored MDPs with Application to Constrained RL

Add code
Bookmark button
Alert button
Aug 31, 2020
Xiaoyu Chen, Jiachen Hu, Lihong Li, Liwei Wang

Viaarxiv icon

Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders

Add code
Bookmark button
Alert button
Jul 27, 2020
Andrew Bennett, Nathan Kallus, Lihong Li, Ali Mousavi

Figure 1 for Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders
Figure 2 for Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders
Figure 3 for Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders
Figure 4 for Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders
Viaarxiv icon

Off-Policy Evaluation via the Regularized Lagrangian

Add code
Bookmark button
Alert button
Jul 07, 2020
Mengjiao Yang, Ofir Nachum, Bo Dai, Lihong Li, Dale Schuurmans

Figure 1 for Off-Policy Evaluation via the Regularized Lagrangian
Figure 2 for Off-Policy Evaluation via the Regularized Lagrangian
Figure 3 for Off-Policy Evaluation via the Regularized Lagrangian
Figure 4 for Off-Policy Evaluation via the Regularized Lagrangian
Viaarxiv icon

Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 24, 2020
Ali Mousavi, Lihong Li, Qiang Liu, Denny Zhou

Figure 1 for Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning
Figure 2 for Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning
Figure 3 for Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning
Figure 4 for Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning
Viaarxiv icon

Batch Stationary Distribution Estimation

Add code
Bookmark button
Alert button
Mar 02, 2020
Junfeng Wen, Bo Dai, Lihong Li, Dale Schuurmans

Figure 1 for Batch Stationary Distribution Estimation
Figure 2 for Batch Stationary Distribution Estimation
Figure 3 for Batch Stationary Distribution Estimation
Figure 4 for Batch Stationary Distribution Estimation
Viaarxiv icon

GenDICE: Generalized Offline Estimation of Stationary Values

Add code
Bookmark button
Alert button
Feb 21, 2020
Ruiyi Zhang, Bo Dai, Lihong Li, Dale Schuurmans

Figure 1 for GenDICE: Generalized Offline Estimation of Stationary Values
Figure 2 for GenDICE: Generalized Offline Estimation of Stationary Values
Figure 3 for GenDICE: Generalized Offline Estimation of Stationary Values
Figure 4 for GenDICE: Generalized Offline Estimation of Stationary Values
Viaarxiv icon

Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing

Add code
Bookmark button
Alert button
Feb 12, 2020
Ge Liu, Rui Wu, Heng-Tze Cheng, Jing Wang, Jayden Ooi, Lihong Li, Ang Li, Wai Lok Sibon Li, Craig Boutilier, Ed Chi

Figure 1 for Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Figure 2 for Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Figure 3 for Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Figure 4 for Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Viaarxiv icon

AlgaeDICE: Policy Gradient from Arbitrary Experience

Add code
Bookmark button
Alert button
Dec 04, 2019
Ofir Nachum, Bo Dai, Ilya Kostrikov, Yinlam Chow, Lihong Li, Dale Schuurmans

Figure 1 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Figure 2 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Figure 3 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Figure 4 for AlgaeDICE: Policy Gradient from Arbitrary Experience
Viaarxiv icon

Neural Contextual Bandits with Upper Confidence Bound-Based Exploration

Add code
Bookmark button
Alert button
Nov 11, 2019
Dongruo Zhou, Lihong Li, Quanquan Gu

Figure 1 for Neural Contextual Bandits with Upper Confidence Bound-Based Exploration
Viaarxiv icon

Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation

Add code
Bookmark button
Alert button
Oct 16, 2019
Ziyang Tang, Yihao Feng, Lihong Li, Dengyong Zhou, Qiang Liu

Figure 1 for Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation
Figure 2 for Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation
Figure 3 for Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation
Viaarxiv icon