Picture for Haipeng Luo

Haipeng Luo

Interaction-Grounded Learning for Contextual Markov Decision Processes with Personalized Feedback

Add code
Feb 09, 2026
Viaarxiv icon

Adversarial Learning in Games with Bandit Feedback: Logarithmic Pure-Strategy Maximin Regret

Add code
Feb 06, 2026
Viaarxiv icon

Online Learning for Uninformed Markov Games: Empirical Nash-Value Regret and Non-Stationarity Adaptation

Add code
Feb 06, 2026
Viaarxiv icon

Swap Regret Minimization Through Response-Based Approachability

Add code
Feb 05, 2026
Viaarxiv icon

AgentMath: Empowering Mathematical Reasoning for Large Language Models via Tool-Augmented Agent

Add code
Dec 23, 2025
Viaarxiv icon

Efficient Swap Multicalibration of Elicitable Properties

Add code
Nov 07, 2025
Figure 1 for Efficient Swap Multicalibration of Elicitable Properties
Viaarxiv icon

Reinforcement Learning from Adversarial Preferences in Tabular MDPs

Add code
Jul 15, 2025
Viaarxiv icon

Improved Bounds for Swap Multicalibration and Swap Omniprediction

Add code
May 28, 2025
Viaarxiv icon

Improved Regret and Contextual Linear Extension for Pandora's Box and Prophet Inequality

Add code
May 24, 2025
Viaarxiv icon

Comparator-Adaptive $Φ$-Regret: Improved Bounds, Simpler Algorithms, and Applications to Games

Add code
May 22, 2025
Viaarxiv icon