Picture for Nikhil Vyas

Nikhil Vyas

Deconstructing What Makes a Good Optimizer for Language Models

Add code
Jul 10, 2024
Viaarxiv icon

A New Perspective on Shampoo's Preconditioner

Add code
Jun 25, 2024
Viaarxiv icon

Distinguishing the Knowable from the Unknowable with Language Models

Add code
Feb 05, 2024
Viaarxiv icon

On Privileged and Convergent Bases in Neural Network Representations

Add code
Jul 24, 2023
Viaarxiv icon

Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning

Add code
Jun 14, 2023
Figure 1 for Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning
Figure 2 for Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning
Figure 3 for Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning
Figure 4 for Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning
Viaarxiv icon

Feature-Learning Networks Are Consistent Across Widths At Realistic Scales

Add code
May 28, 2023
Figure 1 for Feature-Learning Networks Are Consistent Across Widths At Realistic Scales
Figure 2 for Feature-Learning Networks Are Consistent Across Widths At Realistic Scales
Figure 3 for Feature-Learning Networks Are Consistent Across Widths At Realistic Scales
Figure 4 for Feature-Learning Networks Are Consistent Across Widths At Realistic Scales
Viaarxiv icon

Provable Copyright Protection for Generative Models

Add code
Feb 21, 2023
Figure 1 for Provable Copyright Protection for Generative Models
Figure 2 for Provable Copyright Protection for Generative Models
Figure 3 for Provable Copyright Protection for Generative Models
Figure 4 for Provable Copyright Protection for Generative Models
Viaarxiv icon

Limitations of the NTK for Understanding Generalization in Deep Learning

Add code
Jun 20, 2022
Figure 1 for Limitations of the NTK for Understanding Generalization in Deep Learning
Figure 2 for Limitations of the NTK for Understanding Generalization in Deep Learning
Figure 3 for Limitations of the NTK for Understanding Generalization in Deep Learning
Figure 4 for Limitations of the NTK for Understanding Generalization in Deep Learning
Viaarxiv icon

Thwarting Adversarial Examples: An $L_0$-RobustSparse Fourier Transform

Add code
Dec 12, 2018
Figure 1 for Thwarting Adversarial Examples: An $L_0$-RobustSparse Fourier Transform
Figure 2 for Thwarting Adversarial Examples: An $L_0$-RobustSparse Fourier Transform
Figure 3 for Thwarting Adversarial Examples: An $L_0$-RobustSparse Fourier Transform
Figure 4 for Thwarting Adversarial Examples: An $L_0$-RobustSparse Fourier Transform
Viaarxiv icon