Picture for Mengxiao Zhang

Mengxiao Zhang

CRIStAL

Near-Optimal Stochastic Linear Bandits with Delay

Add code
Jun 15, 2026
Viaarxiv icon

Near-Optimal Last-Iterate Convergence for Zero-Sum Games with Bandit Feedback and Opponent Actions

Add code
May 10, 2026
Viaarxiv icon

Pricing Query Complexity of Multiplicative Revenue Approximation

Add code
Feb 11, 2026
Viaarxiv icon

Interaction-Grounded Learning for Contextual Markov Decision Processes with Personalized Feedback

Add code
Feb 09, 2026
Viaarxiv icon

Parameter-free Dynamic Regret: Time-varying Movement Costs, Delayed Feedback, and Memory

Add code
Feb 06, 2026
Viaarxiv icon

Near-Optimal Regret for Distributed Adversarial Bandits: A Black-Box Approach

Add code
Feb 06, 2026
Viaarxiv icon

Decentralized Online Convex Optimization with Unknown Feedback Delays

Add code
Jan 12, 2026
Viaarxiv icon

Exploiting Curvature in Online Convex Optimization with Delayed Feedback

Add code
Jun 09, 2025
Viaarxiv icon

Comparator-Adaptive $Φ$-Regret: Improved Bounds, Simpler Algorithms, and Applications to Games

Add code
May 22, 2025
Viaarxiv icon

Contextual Linear Bandits with Delay as Payoff

Add code
Feb 20, 2025
Figure 1 for Contextual Linear Bandits with Delay as Payoff
Viaarxiv icon