Alert button
Picture for Vaishnavh Nagarajan

Vaishnavh Nagarajan

Alert button

The pitfalls of next-token prediction

Add code
Bookmark button
Alert button
Mar 11, 2024
Gregor Bachmann, Vaishnavh Nagarajan

Figure 1 for The pitfalls of next-token prediction
Figure 2 for The pitfalls of next-token prediction
Figure 3 for The pitfalls of next-token prediction
Figure 4 for The pitfalls of next-token prediction
Viaarxiv icon

What do larger image classifiers memorise?

Add code
Bookmark button
Alert button
Oct 09, 2023
Michal Lukasik, Vaishnavh Nagarajan, Ankit Singh Rawat, Aditya Krishna Menon, Sanjiv Kumar

Figure 1 for What do larger image classifiers memorise?
Figure 2 for What do larger image classifiers memorise?
Figure 3 for What do larger image classifiers memorise?
Figure 4 for What do larger image classifiers memorise?
Viaarxiv icon

The Cost of Down-Scaling Language Models: Fact Recall Deteriorates before In-Context Learning

Add code
Bookmark button
Alert button
Oct 07, 2023
Tian Jin, Nolan Clement, Xin Dong, Vaishnavh Nagarajan, Michael Carbin, Jonathan Ragan-Kelley, Gintare Karolina Dziugaite

Figure 1 for The Cost of Down-Scaling Language Models: Fact Recall Deteriorates before In-Context Learning
Figure 2 for The Cost of Down-Scaling Language Models: Fact Recall Deteriorates before In-Context Learning
Figure 3 for The Cost of Down-Scaling Language Models: Fact Recall Deteriorates before In-Context Learning
Figure 4 for The Cost of Down-Scaling Language Models: Fact Recall Deteriorates before In-Context Learning
Viaarxiv icon

Think before you speak: Training Language Models With Pause Tokens

Add code
Bookmark button
Alert button
Oct 03, 2023
Sachin Goyal, Ziwei Ji, Ankit Singh Rawat, Aditya Krishna Menon, Sanjiv Kumar, Vaishnavh Nagarajan

Figure 1 for Think before you speak: Training Language Models With Pause Tokens
Figure 2 for Think before you speak: Training Language Models With Pause Tokens
Figure 3 for Think before you speak: Training Language Models With Pause Tokens
Figure 4 for Think before you speak: Training Language Models With Pause Tokens
Viaarxiv icon

ResMem: Learn what you can and memorize the rest

Add code
Bookmark button
Alert button
Feb 03, 2023
Zitong Yang, Michal Lukasik, Vaishnavh Nagarajan, Zonglin Li, Ankit Singh Rawat, Manzil Zaheer, Aditya Krishna Menon, Sanjiv Kumar

Figure 1 for ResMem: Learn what you can and memorize the rest
Figure 2 for ResMem: Learn what you can and memorize the rest
Figure 3 for ResMem: Learn what you can and memorize the rest
Figure 4 for ResMem: Learn what you can and memorize the rest
Viaarxiv icon

On student-teacher deviations in distillation: does it pay to disobey?

Add code
Bookmark button
Alert button
Jan 30, 2023
Vaishnavh Nagarajan, Aditya Krishna Menon, Srinadh Bhojanapalli, Hossein Mobahi, Sanjiv Kumar

Figure 1 for On student-teacher deviations in distillation: does it pay to disobey?
Figure 2 for On student-teacher deviations in distillation: does it pay to disobey?
Figure 3 for On student-teacher deviations in distillation: does it pay to disobey?
Figure 4 for On student-teacher deviations in distillation: does it pay to disobey?
Viaarxiv icon

Explaining generalization in deep learning: progress and fundamental limits

Add code
Bookmark button
Alert button
Oct 17, 2021
Vaishnavh Nagarajan

Figure 1 for Explaining generalization in deep learning: progress and fundamental limits
Figure 2 for Explaining generalization in deep learning: progress and fundamental limits
Figure 3 for Explaining generalization in deep learning: progress and fundamental limits
Figure 4 for Explaining generalization in deep learning: progress and fundamental limits
Viaarxiv icon

Assessing Generalization of SGD via Disagreement

Add code
Bookmark button
Alert button
Jun 25, 2021
Yiding Jiang, Vaishnavh Nagarajan, Christina Baek, J. Zico Kolter

Figure 1 for Assessing Generalization of SGD via Disagreement
Figure 2 for Assessing Generalization of SGD via Disagreement
Figure 3 for Assessing Generalization of SGD via Disagreement
Figure 4 for Assessing Generalization of SGD via Disagreement
Viaarxiv icon

A Learning Theoretic Perspective on Local Explainability

Add code
Bookmark button
Alert button
Nov 02, 2020
Jeffrey Li, Vaishnavh Nagarajan, Gregory Plumb, Ameet Talwalkar

Figure 1 for A Learning Theoretic Perspective on Local Explainability
Figure 2 for A Learning Theoretic Perspective on Local Explainability
Viaarxiv icon

Understanding the Failure Modes of Out-of-Distribution Generalization

Add code
Bookmark button
Alert button
Oct 29, 2020
Vaishnavh Nagarajan, Anders Andreassen, Behnam Neyshabur

Figure 1 for Understanding the Failure Modes of Out-of-Distribution Generalization
Figure 2 for Understanding the Failure Modes of Out-of-Distribution Generalization
Figure 3 for Understanding the Failure Modes of Out-of-Distribution Generalization
Figure 4 for Understanding the Failure Modes of Out-of-Distribution Generalization
Viaarxiv icon