Picture for Peter L. Bartlett

Peter L. Bartlett

Scaling Laws in Linear Regression: Compute, Parameters, and Data

Add code
Jun 12, 2024
Figure 1 for Scaling Laws in Linear Regression: Compute, Parameters, and Data
Figure 2 for Scaling Laws in Linear Regression: Compute, Parameters, and Data
Figure 3 for Scaling Laws in Linear Regression: Compute, Parameters, and Data
Figure 4 for Scaling Laws in Linear Regression: Compute, Parameters, and Data
Viaarxiv icon

Large Stepsize Gradient Descent for Non-Homogeneous Two-Layer Networks: Margin Improvement and Fast Optimization

Add code
Jun 12, 2024
Figure 1 for Large Stepsize Gradient Descent for Non-Homogeneous Two-Layer Networks: Margin Improvement and Fast Optimization
Figure 2 for Large Stepsize Gradient Descent for Non-Homogeneous Two-Layer Networks: Margin Improvement and Fast Optimization
Viaarxiv icon

Large Stepsize Gradient Descent for Logistic Loss: Non-Monotonicity of the Loss Improves Optimization Efficiency

Add code
Feb 24, 2024
Figure 1 for Large Stepsize Gradient Descent for Logistic Loss: Non-Monotonicity of the Loss Improves Optimization Efficiency
Figure 2 for Large Stepsize Gradient Descent for Logistic Loss: Non-Monotonicity of the Loss Improves Optimization Efficiency
Figure 3 for Large Stepsize Gradient Descent for Logistic Loss: Non-Monotonicity of the Loss Improves Optimization Efficiency
Viaarxiv icon

A Statistical Analysis of Wasserstein Autoencoders for Intrinsically Low-dimensional Data

Add code
Feb 24, 2024
Viaarxiv icon

In-Context Learning of a Linear Transformer Block: Benefits of the MLP Component and One-Step GD Initialization

Add code
Feb 22, 2024
Viaarxiv icon

On the Statistical Properties of Generative Adversarial Models for Low Intrinsic Data Dimension

Add code
Jan 28, 2024
Viaarxiv icon

How Many Pretraining Tasks Are Needed for In-Context Learning of Linear Regression?

Add code
Oct 12, 2023
Figure 1 for How Many Pretraining Tasks Are Needed for In-Context Learning of Linear Regression?
Viaarxiv icon

Sharpness-Aware Minimization and the Edge of Stability

Add code
Sep 29, 2023
Figure 1 for Sharpness-Aware Minimization and the Edge of Stability
Figure 2 for Sharpness-Aware Minimization and the Edge of Stability
Figure 3 for Sharpness-Aware Minimization and the Edge of Stability
Figure 4 for Sharpness-Aware Minimization and the Edge of Stability
Viaarxiv icon

Trained Transformers Learn Linear Models In-Context

Add code
Jun 16, 2023
Figure 1 for Trained Transformers Learn Linear Models In-Context
Viaarxiv icon

Prediction, Learning, Uniform Convergence, and Scale-sensitive Dimensions

Add code
Apr 24, 2023
Figure 1 for Prediction, Learning, Uniform Convergence, and Scale-sensitive Dimensions
Viaarxiv icon