Alert button
Picture for Enric Boix-Adsera

Enric Boix-Adsera

Alert button

Towards a theory of model distillation

Add code
Bookmark button
Alert button
Mar 14, 2024
Enric Boix-Adsera

Figure 1 for Towards a theory of model distillation
Figure 2 for Towards a theory of model distillation
Figure 3 for Towards a theory of model distillation
Figure 4 for Towards a theory of model distillation
Viaarxiv icon

PROPANE: Prompt design as an inverse problem

Add code
Bookmark button
Alert button
Nov 13, 2023
Rimon Melamed, Lucas H. McCabe, Tanay Wakhare, Yejin Kim, H. Howie Huang, Enric Boix-Adsera

Viaarxiv icon

When can transformers reason with abstract symbols?

Add code
Bookmark button
Alert button
Oct 15, 2023
Enric Boix-Adsera, Omid Saremi, Emmanuel Abbe, Samy Bengio, Etai Littwin, Joshua Susskind

Figure 1 for When can transformers reason with abstract symbols?
Figure 2 for When can transformers reason with abstract symbols?
Figure 3 for When can transformers reason with abstract symbols?
Figure 4 for When can transformers reason with abstract symbols?
Viaarxiv icon

Transformers learn through gradual rank increase

Add code
Bookmark button
Alert button
Jun 12, 2023
Enric Boix-Adsera, Etai Littwin, Emmanuel Abbe, Samy Bengio, Joshua Susskind

Figure 1 for Transformers learn through gradual rank increase
Figure 2 for Transformers learn through gradual rank increase
Figure 3 for Transformers learn through gradual rank increase
Figure 4 for Transformers learn through gradual rank increase
Viaarxiv icon

The NTK approximation is valid for longer than you think

Add code
Bookmark button
Alert button
May 22, 2023
Enric Boix-Adsera, Etai Littwin

Viaarxiv icon

SGD learning on neural networks: leap complexity and saddle-to-saddle dynamics

Add code
Bookmark button
Alert button
Feb 21, 2023
Emmanuel Abbe, Enric Boix-Adsera, Theodor Misiakiewicz

Figure 1 for SGD learning on neural networks: leap complexity and saddle-to-saddle dynamics
Figure 2 for SGD learning on neural networks: leap complexity and saddle-to-saddle dynamics
Figure 3 for SGD learning on neural networks: leap complexity and saddle-to-saddle dynamics
Figure 4 for SGD learning on neural networks: leap complexity and saddle-to-saddle dynamics
Viaarxiv icon

GULP: a prediction-based metric between representations

Add code
Bookmark button
Alert button
Oct 12, 2022
Enric Boix-Adsera, Hannah Lawrence, George Stepaniants, Philippe Rigollet

Figure 1 for GULP: a prediction-based metric between representations
Figure 2 for GULP: a prediction-based metric between representations
Figure 3 for GULP: a prediction-based metric between representations
Figure 4 for GULP: a prediction-based metric between representations
Viaarxiv icon

On the non-universality of deep learning: quantifying the cost of symmetry

Add code
Bookmark button
Alert button
Aug 05, 2022
Emmanuel Abbe, Enric Boix-Adsera

Figure 1 for On the non-universality of deep learning: quantifying the cost of symmetry
Figure 2 for On the non-universality of deep learning: quantifying the cost of symmetry
Figure 3 for On the non-universality of deep learning: quantifying the cost of symmetry
Viaarxiv icon

The merged-staircase property: a necessary and nearly sufficient condition for SGD learning of sparse functions on two-layer neural networks

Add code
Bookmark button
Alert button
Feb 17, 2022
Emmanuel Abbe, Enric Boix-Adsera, Theodor Misiakiewicz

Figure 1 for The merged-staircase property: a necessary and nearly sufficient condition for SGD learning of sparse functions on two-layer neural networks
Figure 2 for The merged-staircase property: a necessary and nearly sufficient condition for SGD learning of sparse functions on two-layer neural networks
Figure 3 for The merged-staircase property: a necessary and nearly sufficient condition for SGD learning of sparse functions on two-layer neural networks
Figure 4 for The merged-staircase property: a necessary and nearly sufficient condition for SGD learning of sparse functions on two-layer neural networks
Viaarxiv icon

The staircase property: How hierarchical structure can guide deep learning

Add code
Bookmark button
Alert button
Aug 24, 2021
Emmanuel Abbe, Enric Boix-Adsera, Matthew Brennan, Guy Bresler, Dheeraj Nagaraj

Figure 1 for The staircase property: How hierarchical structure can guide deep learning
Figure 2 for The staircase property: How hierarchical structure can guide deep learning
Figure 3 for The staircase property: How hierarchical structure can guide deep learning
Figure 4 for The staircase property: How hierarchical structure can guide deep learning
Viaarxiv icon