Alert button
Picture for Shikai Luo

Shikai Luo

Alert button

Pessimistic Causal Reinforcement Learning with Mediators for Confounded Offline Data

Add code
Bookmark button
Alert button
Mar 18, 2024
Danyang Wang, Chengchun Shi, Shikai Luo, Will Wei Sun

Figure 1 for Pessimistic Causal Reinforcement Learning with Mediators for Confounded Offline Data
Figure 2 for Pessimistic Causal Reinforcement Learning with Mediators for Confounded Offline Data
Figure 3 for Pessimistic Causal Reinforcement Learning with Mediators for Confounded Offline Data
Figure 4 for Pessimistic Causal Reinforcement Learning with Mediators for Confounded Offline Data
Viaarxiv icon

Robust Offline Policy Evaluation and Optimization with Heavy-Tailed Rewards

Add code
Bookmark button
Alert button
Oct 28, 2023
Jin Zhu, Runzhe Wan, Zhengling Qi, Shikai Luo, Chengchun Shi

Figure 1 for Robust Offline Policy Evaluation and Optimization with Heavy-Tailed Rewards
Figure 2 for Robust Offline Policy Evaluation and Optimization with Heavy-Tailed Rewards
Figure 3 for Robust Offline Policy Evaluation and Optimization with Heavy-Tailed Rewards
Figure 4 for Robust Offline Policy Evaluation and Optimization with Heavy-Tailed Rewards
Viaarxiv icon

An Instrumental Variable Approach to Confounded Off-Policy Evaluation

Add code
Bookmark button
Alert button
Dec 29, 2022
Yang Xu, Jin Zhu, Chengchun Shi, Shikai Luo, Rui Song

Figure 1 for An Instrumental Variable Approach to Confounded Off-Policy Evaluation
Figure 2 for An Instrumental Variable Approach to Confounded Off-Policy Evaluation
Figure 3 for An Instrumental Variable Approach to Confounded Off-Policy Evaluation
Figure 4 for An Instrumental Variable Approach to Confounded Off-Policy Evaluation
Viaarxiv icon

Quantile Off-Policy Evaluation via Deep Conditional Generative Learning

Add code
Bookmark button
Alert button
Dec 29, 2022
Yang Xu, Chengchun Shi, Shikai Luo, Lan Wang, Rui Song

Figure 1 for Quantile Off-Policy Evaluation via Deep Conditional Generative Learning
Figure 2 for Quantile Off-Policy Evaluation via Deep Conditional Generative Learning
Figure 3 for Quantile Off-Policy Evaluation via Deep Conditional Generative Learning
Figure 4 for Quantile Off-Policy Evaluation via Deep Conditional Generative Learning
Viaarxiv icon

Conformal Off-Policy Prediction

Add code
Bookmark button
Alert button
Jun 14, 2022
Yingying Zhang, Chengchun Shi, Shikai Luo

Figure 1 for Conformal Off-Policy Prediction
Figure 2 for Conformal Off-Policy Prediction
Figure 3 for Conformal Off-Policy Prediction
Viaarxiv icon

Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process

Add code
Bookmark button
Alert button
Mar 12, 2022
Chengchun Shi, Jin Zhu, Ye Shen, Shikai Luo, Hongtu Zhu, Rui Song

Figure 1 for Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process
Figure 2 for Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process
Figure 3 for Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process
Figure 4 for Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process
Viaarxiv icon

Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons

Add code
Bookmark button
Alert button
Feb 26, 2022
Chengchun Shi, Shikai Luo, Hongtu Zhu, Rui Song

Figure 1 for Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons
Figure 2 for Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons
Figure 3 for Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons
Figure 4 for Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons
Viaarxiv icon

Policy Evaluation for Temporal and/or Spatial Dependent Experiments in Ride-sourcing Platforms

Add code
Bookmark button
Alert button
Feb 22, 2022
Shikai Luo, Ying Yang, Chengchun Shi, Fang Yao, Jieping Ye, Hongtu Zhu

Figure 1 for Policy Evaluation for Temporal and/or Spatial Dependent Experiments in Ride-sourcing Platforms
Figure 2 for Policy Evaluation for Temporal and/or Spatial Dependent Experiments in Ride-sourcing Platforms
Figure 3 for Policy Evaluation for Temporal and/or Spatial Dependent Experiments in Ride-sourcing Platforms
Figure 4 for Policy Evaluation for Temporal and/or Spatial Dependent Experiments in Ride-sourcing Platforms
Viaarxiv icon

A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets

Add code
Bookmark button
Alert button
Feb 21, 2022
Chengchun Shi, Runzhe Wan, Ge Song, Shikai Luo, Rui Song, Hongtu Zhu

Figure 1 for A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets
Figure 2 for A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets
Figure 3 for A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets
Figure 4 for A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets
Viaarxiv icon