Picture for Pierre Ablin

Pierre Ablin

Ecole normale supérieure, Paris, France

Optimization without retraction on the random generalized Stiefel manifold

Add code
May 02, 2024
Viaarxiv icon

Enhancing Hypergradients Estimation: A Study of Preconditioning and Reparameterization

Add code
Feb 26, 2024
Figure 1 for Enhancing Hypergradients Estimation: A Study of Preconditioning and Reparameterization
Figure 2 for Enhancing Hypergradients Estimation: A Study of Preconditioning and Reparameterization
Figure 3 for Enhancing Hypergradients Estimation: A Study of Preconditioning and Reparameterization
Figure 4 for Enhancing Hypergradients Estimation: A Study of Preconditioning and Reparameterization
Viaarxiv icon

Careful with that Scalpel: Improving Gradient Surgery with an EMA

Add code
Feb 05, 2024
Viaarxiv icon

Specialized Language Models with Cheap Inference from Limited Domain Data

Add code
Feb 02, 2024
Viaarxiv icon

Understanding the Regularity of Self-Attention with Optimal Transport

Add code
Dec 22, 2023
Figure 1 for Understanding the Regularity of Self-Attention with Optimal Transport
Figure 2 for Understanding the Regularity of Self-Attention with Optimal Transport
Viaarxiv icon

MultiView Independent Component Analysis with Delays

Add code
Dec 01, 2023
Figure 1 for MultiView Independent Component Analysis with Delays
Figure 2 for MultiView Independent Component Analysis with Delays
Figure 3 for MultiView Independent Component Analysis with Delays
Figure 4 for MultiView Independent Component Analysis with Delays
Viaarxiv icon

Adaptive Training Distributions with Scalable Online Bilevel Optimization

Add code
Nov 20, 2023
Figure 1 for Adaptive Training Distributions with Scalable Online Bilevel Optimization
Figure 2 for Adaptive Training Distributions with Scalable Online Bilevel Optimization
Figure 3 for Adaptive Training Distributions with Scalable Online Bilevel Optimization
Figure 4 for Adaptive Training Distributions with Scalable Online Bilevel Optimization
Viaarxiv icon

A Challenge in Reweighting Data with Bilevel Optimization

Add code
Oct 26, 2023
Figure 1 for A Challenge in Reweighting Data with Bilevel Optimization
Figure 2 for A Challenge in Reweighting Data with Bilevel Optimization
Figure 3 for A Challenge in Reweighting Data with Bilevel Optimization
Figure 4 for A Challenge in Reweighting Data with Bilevel Optimization
Viaarxiv icon

How to Scale Your EMA

Add code
Jul 27, 2023
Figure 1 for How to Scale Your EMA
Figure 2 for How to Scale Your EMA
Figure 3 for How to Scale Your EMA
Figure 4 for How to Scale Your EMA
Viaarxiv icon

Learning Costs for Structured Monge Displacements

Add code
Jun 20, 2023
Figure 1 for Learning Costs for Structured Monge Displacements
Figure 2 for Learning Costs for Structured Monge Displacements
Figure 3 for Learning Costs for Structured Monge Displacements
Figure 4 for Learning Costs for Structured Monge Displacements
Viaarxiv icon