Alert button
Picture for Zhengling Qi

Zhengling Qi

Alert button

Distributional Off-policy Evaluation with Bellman Residual Minimization

Add code
Bookmark button
Alert button
Feb 02, 2024
Sungee Hong, Zhengling Qi, Raymond K. W. Wong

Viaarxiv icon

Robust Offline Policy Evaluation and Optimization with Heavy-Tailed Rewards

Add code
Bookmark button
Alert button
Oct 28, 2023
Jin Zhu, Runzhe Wan, Zhengling Qi, Shikai Luo, Chengchun Shi

Figure 1 for Robust Offline Policy Evaluation and Optimization with Heavy-Tailed Rewards
Figure 2 for Robust Offline Policy Evaluation and Optimization with Heavy-Tailed Rewards
Figure 3 for Robust Offline Policy Evaluation and Optimization with Heavy-Tailed Rewards
Figure 4 for Robust Offline Policy Evaluation and Optimization with Heavy-Tailed Rewards
Viaarxiv icon

Off-policy Evaluation in Doubly Inhomogeneous Environments

Add code
Bookmark button
Alert button
Jun 14, 2023
Zeyu Bian, Chengchun Shi, Zhengling Qi, Lan Wang

Figure 1 for Off-policy Evaluation in Doubly Inhomogeneous Environments
Figure 2 for Off-policy Evaluation in Doubly Inhomogeneous Environments
Figure 3 for Off-policy Evaluation in Doubly Inhomogeneous Environments
Figure 4 for Off-policy Evaluation in Doubly Inhomogeneous Environments
Viaarxiv icon

A Policy Gradient Method for Confounded POMDPs

Add code
Bookmark button
Alert button
May 26, 2023
Mao Hong, Zhengling Qi, Yanxun Xu

Figure 1 for A Policy Gradient Method for Confounded POMDPs
Viaarxiv icon

Sequential Knockoffs for Variable Selection in Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 24, 2023
Tao Ma, Hengrui Cai, Zhengling Qi, Chengchun Shi, Eric B. Laber

Figure 1 for Sequential Knockoffs for Variable Selection in Reinforcement Learning
Figure 2 for Sequential Knockoffs for Variable Selection in Reinforcement Learning
Figure 3 for Sequential Knockoffs for Variable Selection in Reinforcement Learning
Figure 4 for Sequential Knockoffs for Variable Selection in Reinforcement Learning
Viaarxiv icon

Personalized Pricing with Invalid Instrumental Variables: Identification, Estimation, and Policy Learning

Add code
Bookmark button
Alert button
Feb 24, 2023
Rui Miao, Zhengling Qi, Cong Shi, Lin Lin

Figure 1 for Personalized Pricing with Invalid Instrumental Variables: Identification, Estimation, and Policy Learning
Figure 2 for Personalized Pricing with Invalid Instrumental Variables: Identification, Estimation, and Policy Learning
Figure 3 for Personalized Pricing with Invalid Instrumental Variables: Identification, Estimation, and Policy Learning
Figure 4 for Personalized Pricing with Invalid Instrumental Variables: Identification, Estimation, and Policy Learning
Viaarxiv icon

PASTA: Pessimistic Assortment Optimization

Add code
Bookmark button
Alert button
Feb 08, 2023
Juncheng Dong, Weibin Mo, Zhengling Qi, Cong Shi, Ethan X. Fang, Vahid Tarokh

Figure 1 for PASTA: Pessimistic Assortment Optimization
Figure 2 for PASTA: Pessimistic Assortment Optimization
Figure 3 for PASTA: Pessimistic Assortment Optimization
Viaarxiv icon

STEEL: Singularity-aware Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 31, 2023
Xiaohong Chen, Zhengling Qi, Runzhe Wan

Figure 1 for STEEL: Singularity-aware Reinforcement Learning
Figure 2 for STEEL: Singularity-aware Reinforcement Learning
Figure 3 for STEEL: Singularity-aware Reinforcement Learning
Figure 4 for STEEL: Singularity-aware Reinforcement Learning
Viaarxiv icon

Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization

Add code
Bookmark button
Alert button
Jan 05, 2023
Chengchun Shi, Zhengling Qi, Jianing Wang, Fan Zhou

Figure 1 for Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization
Figure 2 for Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization
Figure 3 for Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization
Figure 4 for Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization
Viaarxiv icon

Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information

Add code
Bookmark button
Alert button
Dec 23, 2022
Zuyue Fu, Zhengling Qi, Zhuoran Yang, Zhaoran Wang, Lan Wang

Figure 1 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Figure 2 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Figure 3 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Figure 4 for Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Viaarxiv icon