Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Do Transformers Parse while Predicting the Masked Word?


Mar 14, 2023
Haoyu Zhao, Abhishek Panigrahi, Rong Ge, Sanjeev Arora

Add code


   Access Paper or Ask Questions

Why (and When) does Local SGD Generalize Better than SGD?


Mar 09, 2023
Xinran Gu, Kaifeng Lyu, Longbo Huang, Sanjeev Arora

Add code

* Published as a conference paper at ICLR 2023 

   Access Paper or Ask Questions

Task-Specific Skill Localization in Fine-tuned Language Models


Feb 13, 2023
Abhishek Panigrahi, Nikunj Saunshi, Haoyu Zhao, Sanjeev Arora

Add code


   Access Paper or Ask Questions

New Definitions and Evaluations for Saliency Methods: Staying Intrinsic, Complete and Sound


Nov 05, 2022
Arushi Gupta, Nikunj Saunshi, Dingli Yu, Kaifeng Lyu, Sanjeev Arora

Add code

* NeurIPS 2022 (Oral) 

   Access Paper or Ask Questions

A Kernel-Based View of Language Model Fine-Tuning


Oct 11, 2022
Sadhika Malladi, Alexander Wettig, Dingli Yu, Danqi Chen, Sanjeev Arora

Add code

* Code and pre-computed kernels are publicly available at https://github.com/princeton-nlp/LM-Kernel-FT 

   Access Paper or Ask Questions

Understanding Influence Functions and Datamodels via Harmonic Analysis


Oct 03, 2022
Nikunj Saunshi, Arushi Gupta, Mark Braverman, Sanjeev Arora

Add code


   Access Paper or Ask Questions

Implicit Bias of Gradient Descent on Reparametrized Models: On Equivalence to Mirror Descent


Jul 08, 2022
Zhiyuan Li, Tianhao Wang, JasonD. Lee, Sanjeev Arora

Add code

* 37 pages 

   Access Paper or Ask Questions

Understanding the Generalization Benefit of Normalization Layers: Sharpness Reduction


Jun 14, 2022
Kaifeng Lyu, Zhiyuan Li, Sanjeev Arora

Add code

* 68 pages, many figures 

   Access Paper or Ask Questions

On the SDEs and Scaling Rules for Adaptive Gradient Algorithms


May 20, 2022
Sadhika Malladi, Kaifeng Lyu, Abhishek Panigrahi, Sanjeev Arora

Add code


   Access Paper or Ask Questions

1
2
3
4
5
6
7
>>