Alert button
Picture for Haipeng Luo

Haipeng Luo

Alert button

Tractable Local Equilibria in Non-Concave Games

Add code
Bookmark button
Alert button
Mar 13, 2024
Yang Cai, Constantinos Daskalakis, Haipeng Luo, Chen-Yu Wei, Weiqiang Zheng

Figure 1 for Tractable Local Equilibria in Non-Concave Games
Figure 2 for Tractable Local Equilibria in Non-Concave Games
Figure 3 for Tractable Local Equilibria in Non-Concave Games
Figure 4 for Tractable Local Equilibria in Non-Concave Games
Viaarxiv icon

Contextual Multinomial Logit Bandits with General Value Functions

Add code
Bookmark button
Alert button
Feb 18, 2024
Mengxiao Zhang, Haipeng Luo

Viaarxiv icon

Efficient Contextual Bandits with Uninformed Feedback Graphs

Add code
Bookmark button
Alert button
Feb 12, 2024
Mengxiao Zhang, Yuheng Zhang, Haipeng Luo, Paul Mineiro

Viaarxiv icon

Near-Optimal Policy Optimization for Correlated Equilibrium in General-Sum Markov Games

Add code
Bookmark button
Alert button
Jan 26, 2024
Yang Cai, Haipeng Luo, Chen-Yu Wei, Weiqiang Zheng

Viaarxiv icon

Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games

Add code
Bookmark button
Alert button
Nov 01, 2023
Yang Cai, Gabriele Farina, Julien Grand-Clément, Christian Kroer, Chung-Wei Lee, Haipeng Luo, Weiqiang Zheng

Viaarxiv icon

Online Learning in Contextual Second-Price Pay-Per-Click Auctions

Add code
Bookmark button
Alert button
Oct 08, 2023
Mengxiao Zhang, Haipeng Luo

Figure 1 for Online Learning in Contextual Second-Price Pay-Per-Click Auctions
Viaarxiv icon

WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

Add code
Bookmark button
Alert button
Aug 18, 2023
Haipeng Luo, Qingfeng Sun, Can Xu, Pu Zhao, Jianguang Lou, Chongyang Tao, Xiubo Geng, Qingwei Lin, Shifeng Chen, Dongmei Zhang

Figure 1 for WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Figure 2 for WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Figure 3 for WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Figure 4 for WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Viaarxiv icon

No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions

Add code
Bookmark button
Alert button
May 30, 2023
Tiancheng Jin, Junyan Liu, Chloé Rouyer, William Chang, Chen-Yu Wei, Haipeng Luo

Viaarxiv icon

Regret Matching+: (In)Stability and Fast Convergence in Games

Add code
Bookmark button
Alert button
May 24, 2023
Gabriele Farina, Julien Grand-Clément, Christian Kroer, Chung-Wei Lee, Haipeng Luo

Figure 1 for Regret Matching+: (In)Stability and Fast Convergence in Games
Figure 2 for Regret Matching+: (In)Stability and Fast Convergence in Games
Figure 3 for Regret Matching+: (In)Stability and Fast Convergence in Games
Figure 4 for Regret Matching+: (In)Stability and Fast Convergence in Games
Viaarxiv icon