Alert button
Picture for Nikhil Ghosh

Nikhil Ghosh

Alert button

LoRA+: Efficient Low Rank Adaptation of Large Models

Feb 19, 2024
Soufiane Hayou, Nikhil Ghosh, Bin Yu

Viaarxiv icon

More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory

Nov 27, 2023
James B. Simon, Dhruva Karkada, Nikhil Ghosh, Mikhail Belkin

Viaarxiv icon

The Effect of SGD Batch Size on Autoencoder Learning: Sparsity, Sharpness, and Feature Learning

Aug 06, 2023
Nikhil Ghosh, Spencer Frei, Wooseok Ha, Bin Yu

Figure 1 for The Effect of SGD Batch Size on Autoencoder Learning: Sparsity, Sharpness, and Feature Learning
Figure 2 for The Effect of SGD Batch Size on Autoencoder Learning: Sparsity, Sharpness, and Feature Learning
Viaarxiv icon

The Power of External Memory in Increasing Predictive Model Capacity

Jan 31, 2023
Cenk Baykal, Dylan J Cutler, Nishanth Dikkala, Nikhil Ghosh, Rina Panigrahy, Xin Wang

Figure 1 for The Power of External Memory in Increasing Predictive Model Capacity
Figure 2 for The Power of External Memory in Increasing Predictive Model Capacity
Figure 3 for The Power of External Memory in Increasing Predictive Model Capacity
Figure 4 for The Power of External Memory in Increasing Predictive Model Capacity
Viaarxiv icon

Alternating Updates for Efficient Transformers

Jan 30, 2023
Cenk Baykal, Dylan Cutler, Nishanth Dikkala, Nikhil Ghosh, Rina Panigrahy, Xin Wang

Figure 1 for Alternating Updates for Efficient Transformers
Figure 2 for Alternating Updates for Efficient Transformers
Figure 3 for Alternating Updates for Efficient Transformers
Figure 4 for Alternating Updates for Efficient Transformers
Viaarxiv icon

A Universal Trade-off Between the Model Size, Test Loss, and Training Loss of Linear Predictors

Jul 23, 2022
Nikhil Ghosh, Mikhail Belkin

Figure 1 for A Universal Trade-off Between the Model Size, Test Loss, and Training Loss of Linear Predictors
Viaarxiv icon

Deconstructing Distributions: A Pointwise Framework of Learning

Feb 20, 2022
Gal Kaplun, Nikhil Ghosh, Saurabh Garg, Boaz Barak, Preetum Nakkiran

Figure 1 for Deconstructing Distributions: A Pointwise Framework of Learning
Figure 2 for Deconstructing Distributions: A Pointwise Framework of Learning
Figure 3 for Deconstructing Distributions: A Pointwise Framework of Learning
Figure 4 for Deconstructing Distributions: A Pointwise Framework of Learning
Viaarxiv icon

The Three Stages of Learning Dynamics in High-Dimensional Kernel Methods

Nov 13, 2021
Nikhil Ghosh, Song Mei, Bin Yu

Figure 1 for The Three Stages of Learning Dynamics in High-Dimensional Kernel Methods
Figure 2 for The Three Stages of Learning Dynamics in High-Dimensional Kernel Methods
Figure 3 for The Three Stages of Learning Dynamics in High-Dimensional Kernel Methods
Figure 4 for The Three Stages of Learning Dynamics in High-Dimensional Kernel Methods
Viaarxiv icon

Landmark Ordinal Embedding

Oct 27, 2019
Nikhil Ghosh, Yuxin Chen, Yisong Yue

Figure 1 for Landmark Ordinal Embedding
Figure 2 for Landmark Ordinal Embedding
Figure 3 for Landmark Ordinal Embedding
Figure 4 for Landmark Ordinal Embedding
Viaarxiv icon