Alert button
Picture for Michael E. Sander

Michael E. Sander

Alert button

How do Transformers perform In-Context Autoregressive Learning?

Add code
Bookmark button
Alert button
Feb 08, 2024
Michael E. Sander, Raja Giryes, Taiji Suzuki, Mathieu Blondel, Gabriel Peyré

Viaarxiv icon

Implicit regularization of deep residual networks towards neural ODEs

Add code
Bookmark button
Alert button
Sep 03, 2023
Pierre Marion, Yu-Han Wu, Michael E. Sander, Gérard Biau

Viaarxiv icon

Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective

Add code
Bookmark button
Alert button
Feb 06, 2023
Michael E. Sander, Joan Puigcerver, Josip Djolonga, Gabriel Peyré, Mathieu Blondel

Figure 1 for Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective
Figure 2 for Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective
Figure 3 for Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective
Figure 4 for Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective
Viaarxiv icon

Vision Transformers provably learn spatial structure

Add code
Bookmark button
Alert button
Oct 13, 2022
Samy Jelassi, Michael E. Sander, Yuanzhi Li

Figure 1 for Vision Transformers provably learn spatial structure
Figure 2 for Vision Transformers provably learn spatial structure
Figure 3 for Vision Transformers provably learn spatial structure
Figure 4 for Vision Transformers provably learn spatial structure
Viaarxiv icon

Do Residual Neural Networks discretize Neural Ordinary Differential Equations?

Add code
Bookmark button
Alert button
May 29, 2022
Michael E. Sander, Pierre Ablin, Gabriel Peyré

Figure 1 for Do Residual Neural Networks discretize Neural Ordinary Differential Equations?
Figure 2 for Do Residual Neural Networks discretize Neural Ordinary Differential Equations?
Figure 3 for Do Residual Neural Networks discretize Neural Ordinary Differential Equations?
Figure 4 for Do Residual Neural Networks discretize Neural Ordinary Differential Equations?
Viaarxiv icon

Sinkformers: Transformers with Doubly Stochastic Attention

Add code
Bookmark button
Alert button
Oct 22, 2021
Michael E. Sander, Pierre Ablin, Mathieu Blondel, Gabriel Peyré

Figure 1 for Sinkformers: Transformers with Doubly Stochastic Attention
Figure 2 for Sinkformers: Transformers with Doubly Stochastic Attention
Figure 3 for Sinkformers: Transformers with Doubly Stochastic Attention
Figure 4 for Sinkformers: Transformers with Doubly Stochastic Attention
Viaarxiv icon

Momentum Residual Neural Networks

Add code
Bookmark button
Alert button
Feb 15, 2021
Michael E. Sander, Pierre Ablin, Mathieu Blondel, Gabriel Peyré

Figure 1 for Momentum Residual Neural Networks
Figure 2 for Momentum Residual Neural Networks
Figure 3 for Momentum Residual Neural Networks
Figure 4 for Momentum Residual Neural Networks
Viaarxiv icon