Optimistic Mirror Descent


Optimistic mirror descent is an optimization algorithm that combines mirror descent with optimism to achieve faster convergence rates.

An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints

Add code
May 28, 2025
Viaarxiv icon

Improved Impossible Tuning and Lipschitz-Adaptive Universal Online Learning with Gradient Variations

Add code
May 27, 2025
Viaarxiv icon

Improving LLM General Preference Alignment via Optimistic Online Mirror Descent

Add code
Feb 24, 2025
Viaarxiv icon

Online Optimization for Learning to Communicate over Time-Correlated Channels

Add code
Sep 01, 2024
Figure 1 for Online Optimization for Learning to Communicate over Time-Correlated Channels
Figure 2 for Online Optimization for Learning to Communicate over Time-Correlated Channels
Figure 3 for Online Optimization for Learning to Communicate over Time-Correlated Channels
Figure 4 for Online Optimization for Learning to Communicate over Time-Correlated Channels
Viaarxiv icon

Gradient-Variation Online Learning under Generalized Smoothness

Add code
Aug 17, 2024
Viaarxiv icon

Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent

Add code
Apr 22, 2024
Viaarxiv icon

Online Saddle Point Problem and Online Convex-Concave Optimization

Add code
Dec 15, 2023
Viaarxiv icon

Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization

Add code
Feb 09, 2023
Viaarxiv icon

Breaking the Curse of Multiagency: Provably Efficient Decentralized Multi-Agent RL with Function Approximation

Add code
Mar 02, 2023
Viaarxiv icon

The Power of Regularization in Solving Extensive-Form Games

Add code
Jun 19, 2022
Figure 1 for The Power of Regularization in Solving Extensive-Form Games
Figure 2 for The Power of Regularization in Solving Extensive-Form Games
Figure 3 for The Power of Regularization in Solving Extensive-Form Games
Figure 4 for The Power of Regularization in Solving Extensive-Form Games
Viaarxiv icon