Alert button
Picture for Alberto Bietti

Alberto Bietti

Alert button

Level Set Teleportation: An Optimization Perspective

Add code
Bookmark button
Alert button
Mar 05, 2024
Aaron Mishkin, Alberto Bietti, Robert M. Gower

Figure 1 for Level Set Teleportation: An Optimization Perspective
Figure 2 for Level Set Teleportation: An Optimization Perspective
Figure 3 for Level Set Teleportation: An Optimization Perspective
Figure 4 for Level Set Teleportation: An Optimization Perspective
Viaarxiv icon

Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models

Add code
Bookmark button
Alert button
Feb 29, 2024
Frederik Kunstner, Robin Yadav, Alan Milligan, Mark Schmidt, Alberto Bietti

Viaarxiv icon

Learning Associative Memories with Gradient Descent

Add code
Bookmark button
Alert button
Feb 28, 2024
Vivien Cabannes, Berfin Simsek, Alberto Bietti

Viaarxiv icon

On Learning Gaussian Multi-index Models with Gradient Flow

Add code
Bookmark button
Alert button
Nov 02, 2023
Alberto Bietti, Joan Bruna, Loucas Pillaud-Vivien

Viaarxiv icon

AstroCLIP: Cross-Modal Pre-Training for Astronomical Foundation Models

Add code
Bookmark button
Alert button
Oct 04, 2023
Francois Lanusse, Liam Parker, Siavash Golkar, Miles Cranmer, Alberto Bietti, Michael Eickenberg, Geraud Krawezik, Michael McCabe, Ruben Ohana, Mariel Pettee, Bruno Regaldo-Saint Blancard, Tiberiu Tesileanu, Kyunghyun Cho, Shirley Ho

Figure 1 for AstroCLIP: Cross-Modal Pre-Training for Astronomical Foundation Models
Figure 2 for AstroCLIP: Cross-Modal Pre-Training for Astronomical Foundation Models
Figure 3 for AstroCLIP: Cross-Modal Pre-Training for Astronomical Foundation Models
Figure 4 for AstroCLIP: Cross-Modal Pre-Training for Astronomical Foundation Models
Viaarxiv icon

Multiple Physics Pretraining for Physical Surrogate Models

Add code
Bookmark button
Alert button
Oct 04, 2023
Michael McCabe, Bruno Régaldo-Saint Blancard, Liam Holden Parker, Ruben Ohana, Miles Cranmer, Alberto Bietti, Michael Eickenberg, Siavash Golkar, Geraud Krawezik, Francois Lanusse, Mariel Pettee, Tiberiu Tesileanu, Kyunghyun Cho, Shirley Ho

Figure 1 for Multiple Physics Pretraining for Physical Surrogate Models
Figure 2 for Multiple Physics Pretraining for Physical Surrogate Models
Figure 3 for Multiple Physics Pretraining for Physical Surrogate Models
Figure 4 for Multiple Physics Pretraining for Physical Surrogate Models
Viaarxiv icon

xVal: A Continuous Number Encoding for Large Language Models

Add code
Bookmark button
Alert button
Oct 04, 2023
Siavash Golkar, Mariel Pettee, Michael Eickenberg, Alberto Bietti, Miles Cranmer, Geraud Krawezik, Francois Lanusse, Michael McCabe, Ruben Ohana, Liam Parker, Bruno Régaldo-Saint Blancard, Tiberiu Tesileanu, Kyunghyun Cho, Shirley Ho

Figure 1 for xVal: A Continuous Number Encoding for Large Language Models
Figure 2 for xVal: A Continuous Number Encoding for Large Language Models
Figure 3 for xVal: A Continuous Number Encoding for Large Language Models
Figure 4 for xVal: A Continuous Number Encoding for Large Language Models
Viaarxiv icon

Scaling Laws for Associative Memories

Add code
Bookmark button
Alert button
Oct 04, 2023
Vivien Cabannes, Elvis Dohmatob, Alberto Bietti

Viaarxiv icon

Birth of a Transformer: A Memory Viewpoint

Add code
Bookmark button
Alert button
Jun 01, 2023
Alberto Bietti, Vivien Cabannes, Diane Bouchacourt, Herve Jegou, Leon Bottou

Figure 1 for Birth of a Transformer: A Memory Viewpoint
Figure 2 for Birth of a Transformer: A Memory Viewpoint
Figure 3 for Birth of a Transformer: A Memory Viewpoint
Figure 4 for Birth of a Transformer: A Memory Viewpoint
Viaarxiv icon

The SSL Interplay: Augmentations, Inductive Bias, and Generalization

Add code
Bookmark button
Alert button
Feb 06, 2023
Vivien Cabannes, Bobak T. Kiani, Randall Balestriero, Yann LeCun, Alberto Bietti

Figure 1 for The SSL Interplay: Augmentations, Inductive Bias, and Generalization
Figure 2 for The SSL Interplay: Augmentations, Inductive Bias, and Generalization
Figure 3 for The SSL Interplay: Augmentations, Inductive Bias, and Generalization
Figure 4 for The SSL Interplay: Augmentations, Inductive Bias, and Generalization
Viaarxiv icon