Alert button
Picture for Nikunj Saunshi

Nikunj Saunshi

Alert button

Efficient Stagewise Pretraining via Progressive Subnetworks

Add code
Bookmark button
Alert button
Feb 08, 2024
Abhishek Panigrahi, Nikunj Saunshi, Kaifeng Lyu, Sobhan Miryoosefi, Sashank Reddi, Satyen Kale, Sanjiv Kumar

Viaarxiv icon

Reasoning in Large Language Models Through Symbolic Math Word Problems

Add code
Bookmark button
Alert button
Aug 03, 2023
Vedant Gaur, Nikunj Saunshi

Viaarxiv icon

Task-Specific Skill Localization in Fine-tuned Language Models

Add code
Bookmark button
Alert button
Feb 13, 2023
Abhishek Panigrahi, Nikunj Saunshi, Haoyu Zhao, Sanjeev Arora

Figure 1 for Task-Specific Skill Localization in Fine-tuned Language Models
Figure 2 for Task-Specific Skill Localization in Fine-tuned Language Models
Figure 3 for Task-Specific Skill Localization in Fine-tuned Language Models
Figure 4 for Task-Specific Skill Localization in Fine-tuned Language Models
Viaarxiv icon

New Definitions and Evaluations for Saliency Methods: Staying Intrinsic, Complete and Sound

Add code
Bookmark button
Alert button
Nov 05, 2022
Arushi Gupta, Nikunj Saunshi, Dingli Yu, Kaifeng Lyu, Sanjeev Arora

Figure 1 for New Definitions and Evaluations for Saliency Methods: Staying Intrinsic, Complete and Sound
Figure 2 for New Definitions and Evaluations for Saliency Methods: Staying Intrinsic, Complete and Sound
Figure 3 for New Definitions and Evaluations for Saliency Methods: Staying Intrinsic, Complete and Sound
Figure 4 for New Definitions and Evaluations for Saliency Methods: Staying Intrinsic, Complete and Sound
Viaarxiv icon

Understanding Influence Functions and Datamodels via Harmonic Analysis

Add code
Bookmark button
Alert button
Oct 03, 2022
Nikunj Saunshi, Arushi Gupta, Mark Braverman, Sanjeev Arora

Figure 1 for Understanding Influence Functions and Datamodels via Harmonic Analysis
Figure 2 for Understanding Influence Functions and Datamodels via Harmonic Analysis
Figure 3 for Understanding Influence Functions and Datamodels via Harmonic Analysis
Figure 4 for Understanding Influence Functions and Datamodels via Harmonic Analysis
Viaarxiv icon

Understanding Contrastive Learning Requires Incorporating Inductive Biases

Add code
Bookmark button
Alert button
Feb 28, 2022
Nikunj Saunshi, Jordan Ash, Surbhi Goel, Dipendra Misra, Cyril Zhang, Sanjeev Arora, Sham Kakade, Akshay Krishnamurthy

Figure 1 for Understanding Contrastive Learning Requires Incorporating Inductive Biases
Figure 2 for Understanding Contrastive Learning Requires Incorporating Inductive Biases
Figure 3 for Understanding Contrastive Learning Requires Incorporating Inductive Biases
Figure 4 for Understanding Contrastive Learning Requires Incorporating Inductive Biases
Viaarxiv icon

On Predicting Generalization using GANs

Add code
Bookmark button
Alert button
Nov 28, 2021
Yi Zhang, Arushi Gupta, Nikunj Saunshi, Sanjeev Arora

Figure 1 for On Predicting Generalization using GANs
Figure 2 for On Predicting Generalization using GANs
Figure 3 for On Predicting Generalization using GANs
Figure 4 for On Predicting Generalization using GANs
Viaarxiv icon

A Representation Learning Perspective on the Importance of Train-Validation Splitting in Meta-Learning

Add code
Bookmark button
Alert button
Jun 29, 2021
Nikunj Saunshi, Arushi Gupta, Wei Hu

Figure 1 for A Representation Learning Perspective on the Importance of Train-Validation Splitting in Meta-Learning
Figure 2 for A Representation Learning Perspective on the Importance of Train-Validation Splitting in Meta-Learning
Figure 3 for A Representation Learning Perspective on the Importance of Train-Validation Splitting in Meta-Learning
Figure 4 for A Representation Learning Perspective on the Importance of Train-Validation Splitting in Meta-Learning
Viaarxiv icon

A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks

Add code
Bookmark button
Alert button
Oct 07, 2020
Nikunj Saunshi, Sadhika Malladi, Sanjeev Arora

Figure 1 for A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks
Figure 2 for A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks
Figure 3 for A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks
Figure 4 for A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks
Viaarxiv icon

Predicting What You Already Know Helps: Provable Self-Supervised Learning

Add code
Bookmark button
Alert button
Aug 03, 2020
Jason D. Lee, Qi Lei, Nikunj Saunshi, Jiacheng Zhuo

Figure 1 for Predicting What You Already Know Helps: Provable Self-Supervised Learning
Figure 2 for Predicting What You Already Know Helps: Provable Self-Supervised Learning
Figure 3 for Predicting What You Already Know Helps: Provable Self-Supervised Learning
Viaarxiv icon