Picture for Gal Vardi

Gal Vardi

Temperature is All You Need for Generalization in Langevin Dynamics and other Markov Processes

Add code
May 25, 2025
Viaarxiv icon

A Theory of Learning with Autoregressive Chain of Thought

Add code
Mar 11, 2025
Viaarxiv icon

Flavors of Margin: Implicit Bias of Steepest Descent in Homogeneous Neural Networks

Add code
Oct 29, 2024
Viaarxiv icon

Provable Tempered Overfitting of Minimal Nets and Typical Nets

Add code
Oct 24, 2024
Viaarxiv icon

Benign Overfitting in Single-Head Attention

Add code
Oct 10, 2024
Viaarxiv icon

Provable Privacy Attacks on Trained Shallow Neural Networks

Add code
Oct 10, 2024
Viaarxiv icon

Trained Transformer Classifiers Generalize and Exhibit Benign Overfitting In-Context

Add code
Oct 02, 2024
Viaarxiv icon

Overfitting Behaviour of Gaussian Kernel Ridgeless Regression: Varying Bandwidth or Dimensionality

Add code
Sep 05, 2024
Viaarxiv icon

Approaching Deep Learning through the Spectral Dynamics of Weights

Add code
Aug 21, 2024
Figure 1 for Approaching Deep Learning through the Spectral Dynamics of Weights
Figure 2 for Approaching Deep Learning through the Spectral Dynamics of Weights
Figure 3 for Approaching Deep Learning through the Spectral Dynamics of Weights
Figure 4 for Approaching Deep Learning through the Spectral Dynamics of Weights
Viaarxiv icon

Reconstructing Training Data From Real World Models Trained with Transfer Learning

Add code
Jul 22, 2024
Viaarxiv icon