Optimistic Mirror Descent


Optimistic mirror descent is an optimization algorithm that combines mirror descent with optimism to achieve faster convergence rates.

Scale-Invariant Regret Matching and Online Learning with Optimal Convergence: Bridging Theory and Practice in Zero-Sum Games

Add code
Oct 06, 2025
Viaarxiv icon

An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints

Add code
May 28, 2025
Figure 1 for An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints
Figure 2 for An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints
Figure 3 for An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints
Figure 4 for An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints
Viaarxiv icon

Generalized Linear Bandits: Almost Optimal Regret with One-Pass Update

Add code
Jul 16, 2025
Viaarxiv icon

Improved Impossible Tuning and Lipschitz-Adaptive Universal Online Learning with Gradient Variations

Add code
May 27, 2025
Viaarxiv icon

Improving LLM General Preference Alignment via Optimistic Online Mirror Descent

Add code
Feb 24, 2025
Figure 1 for Improving LLM General Preference Alignment via Optimistic Online Mirror Descent
Figure 2 for Improving LLM General Preference Alignment via Optimistic Online Mirror Descent
Figure 3 for Improving LLM General Preference Alignment via Optimistic Online Mirror Descent
Viaarxiv icon

Gradient-Variation Online Learning under Generalized Smoothness

Add code
Aug 17, 2024
Viaarxiv icon

Online Optimization for Learning to Communicate over Time-Correlated Channels

Add code
Sep 01, 2024
Figure 1 for Online Optimization for Learning to Communicate over Time-Correlated Channels
Figure 2 for Online Optimization for Learning to Communicate over Time-Correlated Channels
Figure 3 for Online Optimization for Learning to Communicate over Time-Correlated Channels
Figure 4 for Online Optimization for Learning to Communicate over Time-Correlated Channels
Viaarxiv icon

Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent

Add code
Apr 22, 2024
Viaarxiv icon

Online Saddle Point Problem and Online Convex-Concave Optimization

Add code
Dec 15, 2023
Viaarxiv icon

Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization

Add code
Feb 09, 2023
Viaarxiv icon