Picture for Haipeng Luo

Haipeng Luo

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

Add code
Jun 17, 2026
Viaarxiv icon

Toward Simultaneously Optimal Regret in U-Calibration

Add code
Jun 16, 2026
Viaarxiv icon

Accelerating Min-Max Optimization via Power-Law Stepsizes

Add code
Jun 01, 2026
Viaarxiv icon

Adaptive Calibration in Non-Stationary Environments

Add code
May 12, 2026
Viaarxiv icon

Near-Optimal Last-Iterate Convergence for Zero-Sum Games with Bandit Feedback and Opponent Actions

Add code
May 10, 2026
Viaarxiv icon

Calibeating Made Simple

Add code
Mar 23, 2026
Viaarxiv icon

A Short Note on a Variant of the Squint Algorithm

Add code
Mar 03, 2026
Viaarxiv icon

One Good Source is All You Need: Near-Optimal Regret for Bandits under Heterogeneous Noise

Add code
Feb 16, 2026
Viaarxiv icon

Scale-Invariant Fast Convergence in Games

Add code
Feb 12, 2026
Viaarxiv icon

Is Online Linear Optimization Sufficient for Strategic Robustness?

Add code
Feb 12, 2026
Viaarxiv icon