Picture for James B. Simon

James B. Simon

Shammie

Alternating Gradient Flows: A Theory of Feature Learning in Two-layer Neural Networks

Add code
Jun 06, 2025
Viaarxiv icon

Saddle-To-Saddle Dynamics in Deep ReLU Networks: Low-Rank Bias in the First Saddle Escape

Add code
May 27, 2025
Viaarxiv icon

The Optimization Landscape of SGD Across the Feature Learning Strength

Add code
Oct 06, 2024
Viaarxiv icon

More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory

Add code
Nov 27, 2023
Figure 1 for More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory
Figure 2 for More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory
Figure 3 for More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory
Figure 4 for More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory
Viaarxiv icon

A Spectral Condition for Feature Learning

Add code
Oct 26, 2023
Figure 1 for A Spectral Condition for Feature Learning
Figure 2 for A Spectral Condition for Feature Learning
Figure 3 for A Spectral Condition for Feature Learning
Figure 4 for A Spectral Condition for Feature Learning
Viaarxiv icon

Les Houches Lectures on Deep Learning at Large & Infinite Width

Add code
Sep 08, 2023
Viaarxiv icon

An Agnostic View on the Cost of Overfitting in Ridge Regression

Add code
Jun 22, 2023
Viaarxiv icon

Tune As You Scale: Hyperparameter Optimization For Compute Efficient Training

Add code
Jun 13, 2023
Viaarxiv icon

On the stepwise nature of self-supervised learning

Add code
Mar 27, 2023
Figure 1 for On the stepwise nature of self-supervised learning
Figure 2 for On the stepwise nature of self-supervised learning
Figure 3 for On the stepwise nature of self-supervised learning
Figure 4 for On the stepwise nature of self-supervised learning
Viaarxiv icon

Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds

Add code
Oct 24, 2022
Figure 1 for Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds
Figure 2 for Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds
Figure 3 for Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds
Figure 4 for Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds
Viaarxiv icon