Alert button
Picture for Haipeng Luo

Haipeng Luo

Alert button

Adversarial Online Learning with Changing Action Sets: Efficient Algorithms with Approximate Regret Bounds

Add code
Bookmark button
Alert button
Mar 07, 2020
Ehsan Emamjomeh-Zadeh, Chen-Yu Wei, Haipeng Luo, David Kempe

Figure 1 for Adversarial Online Learning with Changing Action Sets: Efficient Algorithms with Approximate Regret Bounds
Viaarxiv icon

Taking a hint: How to leverage loss predictors in contextual bandits?

Add code
Bookmark button
Alert button
Mar 04, 2020
Chen-Yu Wei, Haipeng Luo, Alekh Agarwal

Figure 1 for Taking a hint: How to leverage loss predictors in contextual bandits?
Figure 2 for Taking a hint: How to leverage loss predictors in contextual bandits?
Viaarxiv icon

A Closer Look at Small-loss Bounds for Bandits with Graph Feedback

Add code
Bookmark button
Alert button
Feb 02, 2020
Chung-Wei Lee, Haipeng Luo, Mengxiao Zhang

Figure 1 for A Closer Look at Small-loss Bounds for Bandits with Graph Feedback
Figure 2 for A Closer Look at Small-loss Bounds for Bandits with Graph Feedback
Viaarxiv icon

Learning Adversarial MDPs with Bandit Feedback and Unknown Transition

Add code
Bookmark button
Alert button
Jan 07, 2020
Chi Jin, Tiancheng Jin, Haipeng Luo, Suvrit Sra, Tiancheng Yu

Viaarxiv icon

Fair Contextual Multi-Armed Bandits: Theory and Experiments

Add code
Bookmark button
Alert button
Dec 13, 2019
Yifang Chen, Alex Cuellar, Haipeng Luo, Jignesh Modi, Heramb Nemlekar, Stefanos Nikolaidis

Figure 1 for Fair Contextual Multi-Armed Bandits: Theory and Experiments
Figure 2 for Fair Contextual Multi-Armed Bandits: Theory and Experiments
Figure 3 for Fair Contextual Multi-Armed Bandits: Theory and Experiments
Figure 4 for Fair Contextual Multi-Armed Bandits: Theory and Experiments
Viaarxiv icon

Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes

Add code
Bookmark button
Alert button
Oct 15, 2019
Chen-Yu Wei, Mehdi Jafarnia-Jahromi, Haipeng Luo, Hiteshi Sharma, Rahul Jain

Figure 1 for Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Figure 2 for Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Viaarxiv icon

Model selection for contextual bandits

Add code
Bookmark button
Alert button
Jun 03, 2019
Dylan J. Foster, Akshay Krishnamurthy, Haipeng Luo

Viaarxiv icon

Equipping Experts/Bandits with Long-term Memory

Add code
Bookmark button
Alert button
May 30, 2019
Kai Zheng, Haipeng Luo, Ilias Diakonikolas, Liwei Wang

Viaarxiv icon