Picture for Emmanuel Abbe

Emmanuel Abbe

How Far Can Transformers Reason? The Locality Barrier and Inductive Scratchpad

Add code
Jun 10, 2024
Viaarxiv icon

On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions

Add code
Jun 10, 2024
Figure 1 for On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions
Figure 2 for On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions
Figure 3 for On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions
Figure 4 for On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions
Viaarxiv icon

When can transformers reason with abstract symbols?

Add code
Oct 15, 2023
Figure 1 for When can transformers reason with abstract symbols?
Figure 2 for When can transformers reason with abstract symbols?
Figure 3 for When can transformers reason with abstract symbols?
Figure 4 for When can transformers reason with abstract symbols?
Viaarxiv icon

Provable Advantage of Curriculum Learning on Parity Targets with Mixed Inputs

Add code
Jun 29, 2023
Figure 1 for Provable Advantage of Curriculum Learning on Parity Targets with Mixed Inputs
Figure 2 for Provable Advantage of Curriculum Learning on Parity Targets with Mixed Inputs
Figure 3 for Provable Advantage of Curriculum Learning on Parity Targets with Mixed Inputs
Figure 4 for Provable Advantage of Curriculum Learning on Parity Targets with Mixed Inputs
Viaarxiv icon

Transformers learn through gradual rank increase

Add code
Jun 12, 2023
Figure 1 for Transformers learn through gradual rank increase
Figure 2 for Transformers learn through gradual rank increase
Figure 3 for Transformers learn through gradual rank increase
Figure 4 for Transformers learn through gradual rank increase
Viaarxiv icon

SGD learning on neural networks: leap complexity and saddle-to-saddle dynamics

Add code
Feb 21, 2023
Figure 1 for SGD learning on neural networks: leap complexity and saddle-to-saddle dynamics
Figure 2 for SGD learning on neural networks: leap complexity and saddle-to-saddle dynamics
Figure 3 for SGD learning on neural networks: leap complexity and saddle-to-saddle dynamics
Figure 4 for SGD learning on neural networks: leap complexity and saddle-to-saddle dynamics
Viaarxiv icon

Generalization on the Unseen, Logic Reasoning and Degree Curriculum

Add code
Jan 30, 2023
Figure 1 for Generalization on the Unseen, Logic Reasoning and Degree Curriculum
Figure 2 for Generalization on the Unseen, Logic Reasoning and Degree Curriculum
Figure 3 for Generalization on the Unseen, Logic Reasoning and Degree Curriculum
Figure 4 for Generalization on the Unseen, Logic Reasoning and Degree Curriculum
Viaarxiv icon

On the non-universality of deep learning: quantifying the cost of symmetry

Add code
Aug 05, 2022
Figure 1 for On the non-universality of deep learning: quantifying the cost of symmetry
Figure 2 for On the non-universality of deep learning: quantifying the cost of symmetry
Figure 3 for On the non-universality of deep learning: quantifying the cost of symmetry
Viaarxiv icon

Learning to Reason with Neural Networks: Generalization, Unseen Data and Boolean Measures

Add code
May 26, 2022
Figure 1 for Learning to Reason with Neural Networks: Generalization, Unseen Data and Boolean Measures
Figure 2 for Learning to Reason with Neural Networks: Generalization, Unseen Data and Boolean Measures
Figure 3 for Learning to Reason with Neural Networks: Generalization, Unseen Data and Boolean Measures
Figure 4 for Learning to Reason with Neural Networks: Generalization, Unseen Data and Boolean Measures
Viaarxiv icon

An initial alignment between neural network and target is needed for gradient descent to learn

Add code
Feb 25, 2022
Figure 1 for An initial alignment between neural network and target is needed for gradient descent to learn
Figure 2 for An initial alignment between neural network and target is needed for gradient descent to learn
Viaarxiv icon