Alert button
Picture for Qiwei Di

Qiwei Di

Alert button

Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback

Add code
Bookmark button
Alert button
Apr 16, 2024
Qiwei Di, Jiafan He, Quanquan Gu

Viaarxiv icon

Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path

Add code
Bookmark button
Alert button
Feb 14, 2024
Qiwei Di, Jiafan He, Dongruo Zhou, Quanquan Gu

Viaarxiv icon

Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 02, 2023
Qiwei Di, Heyang Zhao, Jiafan He, Quanquan Gu

Viaarxiv icon

Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits

Add code
Bookmark button
Alert button
Oct 02, 2023
Qiwei Di, Tao Jin, Yue Wu, Heyang Zhao, Farzad Farnoud, Quanquan Gu

Figure 1 for Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits
Viaarxiv icon