Picture for Nikhil Ghosh

Nikhil Ghosh

There Will Be a Scientific Theory of Deep Learning

Add code
Apr 23, 2026
Viaarxiv icon

Does a Global Perspective Help Prune Sparse MoEs Elegantly?

Add code
Apr 08, 2026
Viaarxiv icon

Understanding the Mechanisms of Fast Hyperparameter Transfer

Add code
Dec 28, 2025
Viaarxiv icon

GSM-Agent: Understanding Agentic Reasoning Using Controllable Environments

Add code
Sep 26, 2025
Figure 1 for GSM-Agent: Understanding Agentic Reasoning Using Controllable Environments
Figure 2 for GSM-Agent: Understanding Agentic Reasoning Using Controllable Environments
Figure 3 for GSM-Agent: Understanding Agentic Reasoning Using Controllable Environments
Figure 4 for GSM-Agent: Understanding Agentic Reasoning Using Controllable Environments
Viaarxiv icon

PLoP: Precise LoRA Placement for Efficient Finetuning of Large Models

Add code
Jun 25, 2025
Figure 1 for PLoP: Precise LoRA Placement for Efficient Finetuning of Large Models
Figure 2 for PLoP: Precise LoRA Placement for Efficient Finetuning of Large Models
Figure 3 for PLoP: Precise LoRA Placement for Efficient Finetuning of Large Models
Figure 4 for PLoP: Precise LoRA Placement for Efficient Finetuning of Large Models
Viaarxiv icon

The Impact of Initialization on LoRA Finetuning Dynamics

Add code
Jun 12, 2024
Figure 1 for The Impact of Initialization on LoRA Finetuning Dynamics
Figure 2 for The Impact of Initialization on LoRA Finetuning Dynamics
Figure 3 for The Impact of Initialization on LoRA Finetuning Dynamics
Figure 4 for The Impact of Initialization on LoRA Finetuning Dynamics
Viaarxiv icon

LoRA+: Efficient Low Rank Adaptation of Large Models

Add code
Feb 19, 2024
Figure 1 for LoRA+: Efficient Low Rank Adaptation of Large Models
Figure 2 for LoRA+: Efficient Low Rank Adaptation of Large Models
Figure 3 for LoRA+: Efficient Low Rank Adaptation of Large Models
Figure 4 for LoRA+: Efficient Low Rank Adaptation of Large Models
Viaarxiv icon

More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory

Add code
Nov 27, 2023
Figure 1 for More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory
Figure 2 for More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory
Figure 3 for More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory
Figure 4 for More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory
Viaarxiv icon

The Effect of SGD Batch Size on Autoencoder Learning: Sparsity, Sharpness, and Feature Learning

Add code
Aug 06, 2023
Figure 1 for The Effect of SGD Batch Size on Autoencoder Learning: Sparsity, Sharpness, and Feature Learning
Figure 2 for The Effect of SGD Batch Size on Autoencoder Learning: Sparsity, Sharpness, and Feature Learning
Viaarxiv icon

The Power of External Memory in Increasing Predictive Model Capacity

Add code
Jan 31, 2023
Figure 1 for The Power of External Memory in Increasing Predictive Model Capacity
Figure 2 for The Power of External Memory in Increasing Predictive Model Capacity
Figure 3 for The Power of External Memory in Increasing Predictive Model Capacity
Figure 4 for The Power of External Memory in Increasing Predictive Model Capacity
Viaarxiv icon