Picture for Jingfeng Wu

Jingfeng Wu

Scaling Laws in Linear Regression: Compute, Parameters, and Data

Add code
Jun 12, 2024
Figure 1 for Scaling Laws in Linear Regression: Compute, Parameters, and Data
Figure 2 for Scaling Laws in Linear Regression: Compute, Parameters, and Data
Figure 3 for Scaling Laws in Linear Regression: Compute, Parameters, and Data
Figure 4 for Scaling Laws in Linear Regression: Compute, Parameters, and Data
Viaarxiv icon

Large Stepsize Gradient Descent for Non-Homogeneous Two-Layer Networks: Margin Improvement and Fast Optimization

Add code
Jun 12, 2024
Figure 1 for Large Stepsize Gradient Descent for Non-Homogeneous Two-Layer Networks: Margin Improvement and Fast Optimization
Figure 2 for Large Stepsize Gradient Descent for Non-Homogeneous Two-Layer Networks: Margin Improvement and Fast Optimization
Viaarxiv icon

Large Stepsize Gradient Descent for Logistic Loss: Non-Monotonicity of the Loss Improves Optimization Efficiency

Add code
Feb 24, 2024
Figure 1 for Large Stepsize Gradient Descent for Logistic Loss: Non-Monotonicity of the Loss Improves Optimization Efficiency
Figure 2 for Large Stepsize Gradient Descent for Logistic Loss: Non-Monotonicity of the Loss Improves Optimization Efficiency
Figure 3 for Large Stepsize Gradient Descent for Logistic Loss: Non-Monotonicity of the Loss Improves Optimization Efficiency
Viaarxiv icon

In-Context Learning of a Linear Transformer Block: Benefits of the MLP Component and One-Step GD Initialization

Add code
Feb 22, 2024
Viaarxiv icon

Risk Bounds of Accelerated SGD for Overparameterized Linear Regression

Add code
Nov 23, 2023
Viaarxiv icon

How Many Pretraining Tasks Are Needed for In-Context Learning of Linear Regression?

Add code
Oct 12, 2023
Figure 1 for How Many Pretraining Tasks Are Needed for In-Context Learning of Linear Regression?
Viaarxiv icon

Private Federated Frequency Estimation: Adapting to the Hardness of the Instance

Add code
Jun 15, 2023
Figure 1 for Private Federated Frequency Estimation: Adapting to the Hardness of the Instance
Figure 2 for Private Federated Frequency Estimation: Adapting to the Hardness of the Instance
Figure 3 for Private Federated Frequency Estimation: Adapting to the Hardness of the Instance
Viaarxiv icon

Implicit Bias of Gradient Descent for Logistic Regression at the Edge of Stability

Add code
May 19, 2023
Figure 1 for Implicit Bias of Gradient Descent for Logistic Regression at the Edge of Stability
Figure 2 for Implicit Bias of Gradient Descent for Logistic Regression at the Edge of Stability
Viaarxiv icon

Fixed Design Analysis of Regularization-Based Continual Learning

Add code
Mar 17, 2023
Figure 1 for Fixed Design Analysis of Regularization-Based Continual Learning
Figure 2 for Fixed Design Analysis of Regularization-Based Continual Learning
Viaarxiv icon

Learning High-Dimensional Single-Neuron ReLU Networks with Finite Samples

Add code
Mar 03, 2023
Figure 1 for Learning High-Dimensional Single-Neuron ReLU Networks with Finite Samples
Figure 2 for Learning High-Dimensional Single-Neuron ReLU Networks with Finite Samples
Figure 3 for Learning High-Dimensional Single-Neuron ReLU Networks with Finite Samples
Figure 4 for Learning High-Dimensional Single-Neuron ReLU Networks with Finite Samples
Viaarxiv icon