Picture for Zhengling Qi

Zhengling Qi

Quantile-Optimal Policy Learning under Unmeasured Confounding

Add code
Jun 08, 2025
Viaarxiv icon

Reinforcement Learning with Continuous Actions Under Unmeasured Confounding

Add code
May 01, 2025
Viaarxiv icon

Offline Dynamic Inventory and Pricing Strategy: Addressing Censored and Dependent Demand

Add code
Apr 14, 2025
Viaarxiv icon

Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning

Add code
Dec 08, 2024
Viaarxiv icon

A Tale of Two Cities: Pessimism and Opportunism in Offline Dynamic Pricing

Add code
Nov 12, 2024
Viaarxiv icon

Learning Robust Treatment Rules for Censored Data

Add code
Aug 17, 2024
Viaarxiv icon

Distributional Off-policy Evaluation with Bellman Residual Minimization

Add code
Feb 02, 2024
Viaarxiv icon

Robust Offline Policy Evaluation and Optimization with Heavy-Tailed Rewards

Add code
Oct 28, 2023
Viaarxiv icon

Off-policy Evaluation in Doubly Inhomogeneous Environments

Add code
Jun 14, 2023
Viaarxiv icon

A Policy Gradient Method for Confounded POMDPs

Add code
May 26, 2023
Figure 1 for A Policy Gradient Method for Confounded POMDPs
Figure 2 for A Policy Gradient Method for Confounded POMDPs
Figure 3 for A Policy Gradient Method for Confounded POMDPs
Figure 4 for A Policy Gradient Method for Confounded POMDPs
Viaarxiv icon