Alert button
Picture for Pierre Ablin

Pierre Ablin

Alert button

Optimization without retraction on the random generalized Stiefel manifold

Add code
Bookmark button
Alert button
May 02, 2024
Simon Vary, Pierre Ablin, Bin Gao, P. -A. Absil

Viaarxiv icon

Enhancing Hypergradients Estimation: A Study of Preconditioning and Reparameterization

Add code
Bookmark button
Alert button
Feb 26, 2024
Zhenzhang Ye, Gabriel Peyré, Daniel Cremers, Pierre Ablin

Figure 1 for Enhancing Hypergradients Estimation: A Study of Preconditioning and Reparameterization
Figure 2 for Enhancing Hypergradients Estimation: A Study of Preconditioning and Reparameterization
Figure 3 for Enhancing Hypergradients Estimation: A Study of Preconditioning and Reparameterization
Figure 4 for Enhancing Hypergradients Estimation: A Study of Preconditioning and Reparameterization
Viaarxiv icon

Careful with that Scalpel: Improving Gradient Surgery with an EMA

Add code
Bookmark button
Alert button
Feb 05, 2024
Yu-Guan Hsieh, James Thornton, Eugene Ndiaye, Michal Klein, Marco Cuturi, Pierre Ablin

Viaarxiv icon

Specialized Language Models with Cheap Inference from Limited Domain Data

Add code
Bookmark button
Alert button
Feb 02, 2024
David Grangier, Angelos Katharopoulos, Pierre Ablin, Awni Hannun

Viaarxiv icon

Understanding the Regularity of Self-Attention with Optimal Transport

Add code
Bookmark button
Alert button
Dec 22, 2023
Valérie Castin, Pierre Ablin, Gabriel Peyré

Figure 1 for Understanding the Regularity of Self-Attention with Optimal Transport
Figure 2 for Understanding the Regularity of Self-Attention with Optimal Transport
Viaarxiv icon

MultiView Independent Component Analysis with Delays

Add code
Bookmark button
Alert button
Dec 01, 2023
Ambroise Heurtebise, Pierre Ablin, Alexandre Gramfort

Figure 1 for MultiView Independent Component Analysis with Delays
Figure 2 for MultiView Independent Component Analysis with Delays
Figure 3 for MultiView Independent Component Analysis with Delays
Figure 4 for MultiView Independent Component Analysis with Delays
Viaarxiv icon

Adaptive Training Distributions with Scalable Online Bilevel Optimization

Add code
Bookmark button
Alert button
Nov 20, 2023
David Grangier, Pierre Ablin, Awni Hannun

Figure 1 for Adaptive Training Distributions with Scalable Online Bilevel Optimization
Figure 2 for Adaptive Training Distributions with Scalable Online Bilevel Optimization
Figure 3 for Adaptive Training Distributions with Scalable Online Bilevel Optimization
Figure 4 for Adaptive Training Distributions with Scalable Online Bilevel Optimization
Viaarxiv icon

A Challenge in Reweighting Data with Bilevel Optimization

Add code
Bookmark button
Alert button
Oct 26, 2023
Anastasia Ivanova, Pierre Ablin

Figure 1 for A Challenge in Reweighting Data with Bilevel Optimization
Figure 2 for A Challenge in Reweighting Data with Bilevel Optimization
Figure 3 for A Challenge in Reweighting Data with Bilevel Optimization
Figure 4 for A Challenge in Reweighting Data with Bilevel Optimization
Viaarxiv icon

How to Scale Your EMA

Add code
Bookmark button
Alert button
Jul 27, 2023
Dan Busbridge, Jason Ramapuram, Pierre Ablin, Tatiana Likhomanenko, Eeshan Gunesh Dhekane, Xavier Suau, Russ Webb

Figure 1 for How to Scale Your EMA
Figure 2 for How to Scale Your EMA
Figure 3 for How to Scale Your EMA
Figure 4 for How to Scale Your EMA
Viaarxiv icon

Learning Costs for Structured Monge Displacements

Add code
Bookmark button
Alert button
Jun 20, 2023
Michal Klein, Aram-Alexandre Pooladian, Pierre Ablin, Eugène Ndiaye, Jonathan Niles-Weed, Marco Cuturi

Figure 1 for Learning Costs for Structured Monge Displacements
Figure 2 for Learning Costs for Structured Monge Displacements
Figure 3 for Learning Costs for Structured Monge Displacements
Figure 4 for Learning Costs for Structured Monge Displacements
Viaarxiv icon