Picture for Haipeng Luo

Haipeng Luo

Policy Optimization for Stochastic Shortest Path

Add code
Feb 07, 2022
Viaarxiv icon

Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the Gap Between Learning in Extensive-Form and Normal-Form Games

Add code
Feb 01, 2022
Figure 1 for Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the Gap Between Learning in Extensive-Form and Normal-Form Games
Figure 2 for Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the Gap Between Learning in Extensive-Form and Normal-Form Games
Figure 3 for Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the Gap Between Learning in Extensive-Form and Normal-Form Games
Figure 4 for Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the Gap Between Learning in Extensive-Form and Normal-Form Games
Viaarxiv icon

Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints

Add code
Jan 31, 2022
Viaarxiv icon

Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback

Add code
Jan 31, 2022
Viaarxiv icon

No-Regret Learning in Time-Varying Zero-Sum Games

Add code
Jan 30, 2022
Figure 1 for No-Regret Learning in Time-Varying Zero-Sum Games
Figure 2 for No-Regret Learning in Time-Varying Zero-Sum Games
Viaarxiv icon

Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDP

Add code
Dec 18, 2021
Viaarxiv icon

Policy Optimization in Adversarial MDPs: Improved Exploration via Dilated Bonuses

Add code
Jul 18, 2021
Viaarxiv icon

Last-iterate Convergence in Extensive-Form Games

Add code
Jun 27, 2021
Figure 1 for Last-iterate Convergence in Extensive-Form Games
Figure 2 for Last-iterate Convergence in Extensive-Form Games
Viaarxiv icon

Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path

Add code
Jun 15, 2021
Figure 1 for Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path
Figure 2 for Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path
Figure 3 for Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path
Figure 4 for Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path
Viaarxiv icon

Online Learning for Stochastic Shortest Path Model via Posterior Sampling

Add code
Jun 09, 2021
Figure 1 for Online Learning for Stochastic Shortest Path Model via Posterior Sampling
Viaarxiv icon