Picture for Mengxiao Zhang

Mengxiao Zhang

CRIStAL

Interaction-Grounded Learning for Contextual Markov Decision Processes with Personalized Feedback

Add code
Feb 09, 2026
Viaarxiv icon

Parameter-free Dynamic Regret: Time-varying Movement Costs, Delayed Feedback, and Memory

Add code
Feb 06, 2026
Viaarxiv icon

Near-Optimal Regret for Distributed Adversarial Bandits: A Black-Box Approach

Add code
Feb 06, 2026
Viaarxiv icon

Decentralized Online Convex Optimization with Unknown Feedback Delays

Add code
Jan 12, 2026
Viaarxiv icon

Exploiting Curvature in Online Convex Optimization with Delayed Feedback

Add code
Jun 09, 2025
Viaarxiv icon

Comparator-Adaptive $Φ$-Regret: Improved Bounds, Simpler Algorithms, and Applications to Games

Add code
May 22, 2025
Viaarxiv icon

Contextual Linear Bandits with Delay as Payoff

Add code
Feb 20, 2025
Figure 1 for Contextual Linear Bandits with Delay as Payoff
Viaarxiv icon

Alternating Regret for Online Convex Optimization

Add code
Feb 18, 2025
Viaarxiv icon

Data Pricing for Graph Neural Networks without Pre-purchased Inspection

Add code
Feb 12, 2025
Figure 1 for Data Pricing for Graph Neural Networks without Pre-purchased Inspection
Figure 2 for Data Pricing for Graph Neural Networks without Pre-purchased Inspection
Figure 3 for Data Pricing for Graph Neural Networks without Pre-purchased Inspection
Figure 4 for Data Pricing for Graph Neural Networks without Pre-purchased Inspection
Viaarxiv icon

No-Regret Learning for Fair Multi-Agent Social Welfare Optimization

Add code
May 31, 2024
Viaarxiv icon