Alert button
Picture for Haipeng Luo

Haipeng Luo

Alert button

Impossible Tuning Made Possible: A New Expert Algorithm and Its Applications

Add code
Bookmark button
Alert button
Feb 01, 2021
Liyu Chen, Haipeng Luo, Chen-Yu Wei

Figure 1 for Impossible Tuning Made Possible: A New Expert Algorithm and Its Applications
Viaarxiv icon

Minimax Regret for Stochastic Shortest Path with Adversarial Costs and Known Transition

Add code
Bookmark button
Alert button
Dec 07, 2020
Liyu Chen, Haipeng Luo, Chen-Yu Wei

Figure 1 for Minimax Regret for Stochastic Shortest Path with Adversarial Costs and Known Transition
Viaarxiv icon

Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation

Add code
Bookmark button
Alert button
Jul 23, 2020
Chen-Yu Wei, Mehdi Jafarnia-Jahromi, Haipeng Luo, Rahul Jain

Figure 1 for Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
Viaarxiv icon

Comparator-adaptive Convex Bandits

Add code
Bookmark button
Alert button
Jul 16, 2020
Dirk van der Hoeven, Ashok Cutkosky, Haipeng Luo

Figure 1 for Comparator-adaptive Convex Bandits
Viaarxiv icon

Active Online Domain Adaptation

Add code
Bookmark button
Alert button
Jun 25, 2020
Yining Chen, Haipeng Luo, Tengyu Ma, Chicheng Zhang

Figure 1 for Active Online Domain Adaptation
Viaarxiv icon

Open Problem: Model Selection for Contextual Bandits

Add code
Bookmark button
Alert button
Jun 19, 2020
Dylan J. Foster, Akshay Krishnamurthy, Haipeng Luo

Viaarxiv icon

Linear Last-iterate Convergence for Matrix Games and Stochastic Games

Add code
Bookmark button
Alert button
Jun 16, 2020
Chung-Wei Lee, Haipeng Luo, Chen-Yu Wei, Mengxiao Zhang

Figure 1 for Linear Last-iterate Convergence for Matrix Games and Stochastic Games
Figure 2 for Linear Last-iterate Convergence for Matrix Games and Stochastic Games
Figure 3 for Linear Last-iterate Convergence for Matrix Games and Stochastic Games
Figure 4 for Linear Last-iterate Convergence for Matrix Games and Stochastic Games
Viaarxiv icon

Bias no more: high-probability data-dependent regret bounds for adversarial bandits and MDPs

Add code
Bookmark button
Alert button
Jun 14, 2020
Chung-Wei Lee, Haipeng Luo, Chen-Yu Wei, Mengxiao Zhang

Figure 1 for Bias no more: high-probability data-dependent regret bounds for adversarial bandits and MDPs
Viaarxiv icon

Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Known Transition

Add code
Bookmark button
Alert button
Jun 10, 2020
Tiancheng Jin, Haipeng Luo

Viaarxiv icon

A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret

Add code
Bookmark button
Alert button
Jun 08, 2020
Mehdi Jafarnia-Jahromi, Chen-Yu Wei, Rahul Jain, Haipeng Luo

Figure 1 for A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret
Figure 2 for A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret
Figure 3 for A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret
Viaarxiv icon