Picture for Chengchun Shi

Chengchun Shi

Log-Sum-Exponential Estimator for Off-Policy Evaluation and Learning

Add code
Jun 07, 2025
Viaarxiv icon

Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation

Add code
May 28, 2025
Viaarxiv icon

Semi-pessimistic Reinforcement Learning

Add code
May 25, 2025
Viaarxiv icon

Deep Distributional Learning with Non-crossing Quantile Network

Add code
Apr 11, 2025
Viaarxiv icon

Statistical Inference in Reinforcement Learning: A Selective Survey

Add code
Feb 22, 2025
Viaarxiv icon

Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing

Add code
Jan 14, 2025
Viaarxiv icon

Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning

Add code
Dec 08, 2024
Figure 1 for Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning
Figure 2 for Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning
Figure 3 for Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning
Figure 4 for Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning
Viaarxiv icon

Dual Active Learning for Reinforcement Learning from Human Feedback

Add code
Oct 03, 2024
Viaarxiv icon

Causal Deepsets for Off-policy Evaluation under Spatial or Spatio-temporal Interferences

Add code
Jul 25, 2024
Figure 1 for Causal Deepsets for Off-policy Evaluation under Spatial or Spatio-temporal Interferences
Figure 2 for Causal Deepsets for Off-policy Evaluation under Spatial or Spatio-temporal Interferences
Figure 3 for Causal Deepsets for Off-policy Evaluation under Spatial or Spatio-temporal Interferences
Figure 4 for Causal Deepsets for Off-policy Evaluation under Spatial or Spatio-temporal Interferences
Viaarxiv icon

Forward and Backward State Abstractions for Off-policy Evaluation

Add code
Jun 27, 2024
Viaarxiv icon