Picture for Chengchun Shi

Chengchun Shi

Log-Sum-Exponential Estimator for Off-Policy Evaluation and Learning

Add code
Jun 07, 2025
Viaarxiv icon

Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation

Add code
May 28, 2025
Viaarxiv icon

Semi-pessimistic Reinforcement Learning

Add code
May 25, 2025
Viaarxiv icon

Deep Distributional Learning with Non-crossing Quantile Network

Add code
Apr 11, 2025
Viaarxiv icon

Statistical Inference in Reinforcement Learning: A Selective Survey

Add code
Feb 22, 2025
Viaarxiv icon

Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing

Add code
Jan 14, 2025
Viaarxiv icon

Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning

Add code
Dec 08, 2024
Viaarxiv icon

Dual Active Learning for Reinforcement Learning from Human Feedback

Add code
Oct 03, 2024
Viaarxiv icon

Causal Deepsets for Off-policy Evaluation under Spatial or Spatio-temporal Interferences

Add code
Jul 25, 2024
Figure 1 for Causal Deepsets for Off-policy Evaluation under Spatial or Spatio-temporal Interferences
Figure 2 for Causal Deepsets for Off-policy Evaluation under Spatial or Spatio-temporal Interferences
Figure 3 for Causal Deepsets for Off-policy Evaluation under Spatial or Spatio-temporal Interferences
Figure 4 for Causal Deepsets for Off-policy Evaluation under Spatial or Spatio-temporal Interferences
Viaarxiv icon

Forward and Backward State Abstractions for Off-policy Evaluation

Add code
Jun 27, 2024
Viaarxiv icon