Alert button
Picture for Giulio Biroli

Giulio Biroli

Alert button

LPENS

From Zero to Hero: How local curvature at artless initial conditions leads away from bad minima

Add code
Bookmark button
Alert button
Mar 04, 2024
Tony Bonnaire, Giulio Biroli, Chiara Cammarota

Figure 1 for From Zero to Hero: How local curvature at artless initial conditions leads away from bad minima
Figure 2 for From Zero to Hero: How local curvature at artless initial conditions leads away from bad minima
Figure 3 for From Zero to Hero: How local curvature at artless initial conditions leads away from bad minima
Figure 4 for From Zero to Hero: How local curvature at artless initial conditions leads away from bad minima
Viaarxiv icon

Dynamical Regimes of Diffusion Models

Add code
Bookmark button
Alert button
Feb 28, 2024
Giulio Biroli, Tony Bonnaire, Valentin de Bortoli, Marc Mézard

Viaarxiv icon

On the Impact of Overparameterization on the Training of a Shallow Neural Network in High Dimensions

Add code
Bookmark button
Alert button
Nov 07, 2023
Simon Martin, Francis Bach, Giulio Biroli

Viaarxiv icon

Wavelet Conditional Renormalization Group

Add code
Bookmark button
Alert button
Jul 11, 2022
Tanguy Marchand, Misaki Ozawa, Giulio Biroli, Stéphane Mallat

Figure 1 for Wavelet Conditional Renormalization Group
Figure 2 for Wavelet Conditional Renormalization Group
Figure 3 for Wavelet Conditional Renormalization Group
Figure 4 for Wavelet Conditional Renormalization Group
Viaarxiv icon

Optimal learning rate schedules in high-dimensional non-convex optimization problems

Add code
Bookmark button
Alert button
Feb 09, 2022
Stéphane d'Ascoli, Maria Refinetti, Giulio Biroli

Figure 1 for Optimal learning rate schedules in high-dimensional non-convex optimization problems
Figure 2 for Optimal learning rate schedules in high-dimensional non-convex optimization problems
Figure 3 for Optimal learning rate schedules in high-dimensional non-convex optimization problems
Figure 4 for Optimal learning rate schedules in high-dimensional non-convex optimization problems
Viaarxiv icon

Transformed CNNs: recasting pre-trained convolutional layers with self-attention

Add code
Bookmark button
Alert button
Jun 10, 2021
Stéphane d'Ascoli, Levent Sagun, Giulio Biroli, Ari Morcos

Figure 1 for Transformed CNNs: recasting pre-trained convolutional layers with self-attention
Figure 2 for Transformed CNNs: recasting pre-trained convolutional layers with self-attention
Figure 3 for Transformed CNNs: recasting pre-trained convolutional layers with self-attention
Figure 4 for Transformed CNNs: recasting pre-trained convolutional layers with self-attention
Viaarxiv icon

Sifting out the features by pruning: Are convolutional networks the winning lottery ticket of fully connected ones?

Add code
Bookmark button
Alert button
May 14, 2021
Franco Pellegrini, Giulio Biroli

Figure 1 for Sifting out the features by pruning: Are convolutional networks the winning lottery ticket of fully connected ones?
Figure 2 for Sifting out the features by pruning: Are convolutional networks the winning lottery ticket of fully connected ones?
Figure 3 for Sifting out the features by pruning: Are convolutional networks the winning lottery ticket of fully connected ones?
Figure 4 for Sifting out the features by pruning: Are convolutional networks the winning lottery ticket of fully connected ones?
Viaarxiv icon

ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases

Add code
Bookmark button
Alert button
Mar 19, 2021
Stéphane d'Ascoli, Hugo Touvron, Matthew Leavitt, Ari Morcos, Giulio Biroli, Levent Sagun

Figure 1 for ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases
Figure 2 for ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases
Figure 3 for ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases
Figure 4 for ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases
Viaarxiv icon

More data or more parameters? Investigating the effect of data structure on generalization

Add code
Bookmark button
Alert button
Mar 09, 2021
Stéphane d'Ascoli, Marylou Gabrié, Levent Sagun, Giulio Biroli

Figure 1 for More data or more parameters? Investigating the effect of data structure on generalization
Figure 2 for More data or more parameters? Investigating the effect of data structure on generalization
Figure 3 for More data or more parameters? Investigating the effect of data structure on generalization
Figure 4 for More data or more parameters? Investigating the effect of data structure on generalization
Viaarxiv icon