Alert button
Picture for Ruohan Zhan

Ruohan Zhan

Alert button

Proportional Response: Contextual Bandits for Simple and Cumulative Regret Minimization

Jul 05, 2023
Sanath Kumar Krishnamurthy, Ruohan Zhan, Susan Athey, Emma Brunskill

Figure 1 for Proportional Response: Contextual Bandits for Simple and Cumulative Regret Minimization
Viaarxiv icon

Post-Episodic Reinforcement Learning Inference

Feb 17, 2023
Vasilis Syrgkanis, Ruohan Zhan

Viaarxiv icon

Two-Stage Constrained Actor-Critic for Short Video Recommendation

Feb 06, 2023
Qingpeng Cai, Zhenghai Xue, Chi Zhang, Wanqi Xue, Shuchang Liu, Ruohan Zhan, Xueliang Wang, Tianyou Zuo, Wentao Xie, Dong Zheng, Peng Jiang, Kun Gai

Figure 1 for Two-Stage Constrained Actor-Critic for Short Video Recommendation
Figure 2 for Two-Stage Constrained Actor-Critic for Short Video Recommendation
Figure 3 for Two-Stage Constrained Actor-Critic for Short Video Recommendation
Figure 4 for Two-Stage Constrained Actor-Critic for Short Video Recommendation
Viaarxiv icon

Deconfounding Duration Bias in Watch-time Prediction for Video Recommendation

Jun 13, 2022
Ruohan Zhan, Changhua Pei, Qiang Su, Jianfeng Wen, Xueliang Wang, Guanyu Mu, Dong Zheng, Peng Jiang

Figure 1 for Deconfounding Duration Bias in Watch-time Prediction for Video Recommendation
Figure 2 for Deconfounding Duration Bias in Watch-time Prediction for Video Recommendation
Figure 3 for Deconfounding Duration Bias in Watch-time Prediction for Video Recommendation
Figure 4 for Deconfounding Duration Bias in Watch-time Prediction for Video Recommendation
Viaarxiv icon

ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor

Jun 01, 2022
Wanqi Xue, Qingpeng Cai, Ruohan Zhan, Dong Zheng, Peng Jiang, Bo An

Figure 1 for ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor
Figure 2 for ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor
Figure 3 for ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor
Figure 4 for ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor
Viaarxiv icon

Constrained Reinforcement Learning for Short Video Recommendation

May 26, 2022
Qingpeng Cai, Ruohan Zhan, Chi Zhang, Jie Zheng, Guangwei Ding, Pinghua Gong, Dong Zheng, Peng Jiang

Figure 1 for Constrained Reinforcement Learning for Short Video Recommendation
Figure 2 for Constrained Reinforcement Learning for Short Video Recommendation
Figure 3 for Constrained Reinforcement Learning for Short Video Recommendation
Figure 4 for Constrained Reinforcement Learning for Short Video Recommendation
Viaarxiv icon

Off-Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits

Jun 10, 2021
Ruohan Zhan, Vitor Hadad, David A. Hirshberg, Susan Athey

Figure 1 for Off-Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits
Figure 2 for Off-Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits
Figure 3 for Off-Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits
Figure 4 for Off-Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits
Viaarxiv icon

Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities

May 06, 2021
Ruohan Zhan, Konstantina Christakopoulou, Ya Le, Jayden Ooi, Martin Mladenov, Alex Beutel, Craig Boutilier, Ed H. Chi, Minmin Chen

Figure 1 for Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities
Figure 2 for Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities
Figure 3 for Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities
Figure 4 for Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities
Viaarxiv icon

Policy Learning with Adaptively Collected Data

May 05, 2021
Ruohan Zhan, Zhimei Ren, Susan Athey, Zhengyuan Zhou

Figure 1 for Policy Learning with Adaptively Collected Data
Figure 2 for Policy Learning with Adaptively Collected Data
Figure 3 for Policy Learning with Adaptively Collected Data
Figure 4 for Policy Learning with Adaptively Collected Data
Viaarxiv icon