Picture for Patrick Rebeschini

Patrick Rebeschini

University of Oxford

On the necessity of adaptive regularisation:Optimal anytime online learning on $\boldsymbol{\ell_p}$-balls

Add code
Jun 24, 2025
Viaarxiv icon

Early-Stopped Mirror Descent for Linear Regression over Convex Bodies

Add code
Mar 05, 2025
Viaarxiv icon

Black-Box Uniform Stability for Non-Euclidean Empirical Risk Minimization

Add code
Dec 20, 2024
Figure 1 for Black-Box Uniform Stability for Non-Euclidean Empirical Risk Minimization
Viaarxiv icon

Learning Loss Landscapes in Preference Optimization

Add code
Nov 10, 2024
Figure 1 for Learning Loss Landscapes in Preference Optimization
Figure 2 for Learning Loss Landscapes in Preference Optimization
Figure 3 for Learning Loss Landscapes in Preference Optimization
Figure 4 for Learning Loss Landscapes in Preference Optimization
Viaarxiv icon

Robust Gradient Descent for Phase Retrieval

Add code
Oct 14, 2024
Figure 1 for Robust Gradient Descent for Phase Retrieval
Viaarxiv icon

Differentiable Cost-Parameterized Monge Map Estimators

Add code
Jun 12, 2024
Viaarxiv icon

Meta-learning the mirror map in policy mirror descent

Add code
Feb 07, 2024
Figure 1 for Meta-learning the mirror map in policy mirror descent
Figure 2 for Meta-learning the mirror map in policy mirror descent
Figure 3 for Meta-learning the mirror map in policy mirror descent
Figure 4 for Meta-learning the mirror map in policy mirror descent
Viaarxiv icon

Generalization Bounds for Label Noise Stochastic Gradient Descent

Add code
Nov 01, 2023
Figure 1 for Generalization Bounds for Label Noise Stochastic Gradient Descent
Viaarxiv icon

Sample-Efficiency in Multi-Batch Reinforcement Learning: The Need for Dimension-Dependent Adaptivity

Add code
Oct 02, 2023
Viaarxiv icon

Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes

Add code
Feb 22, 2023
Figure 1 for Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes
Figure 2 for Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes
Viaarxiv icon