Alert button
Picture for Srinadh Bhojanapalli

Srinadh Bhojanapalli

Alert button

Semantic Label Smoothing for Sequence to Sequence Problems

Add code
Bookmark button
Alert button
Oct 15, 2020
Michal Lukasik, Himanshu Jain, Aditya Krishna Menon, Seungyeon Kim, Srinadh Bhojanapalli, Felix Yu, Sanjiv Kumar

Figure 1 for Semantic Label Smoothing for Sequence to Sequence Problems
Figure 2 for Semantic Label Smoothing for Sequence to Sequence Problems
Figure 3 for Semantic Label Smoothing for Sequence to Sequence Problems
Figure 4 for Semantic Label Smoothing for Sequence to Sequence Problems
Viaarxiv icon

$O(n)$ Connections are Expressive Enough: Universal Approximability of Sparse Transformers

Add code
Bookmark button
Alert button
Jun 08, 2020
Chulhee Yun, Yin-Wen Chang, Srinadh Bhojanapalli, Ankit Singh Rawat, Sashank J. Reddi, Sanjiv Kumar

Figure 1 for $O(n)$ Connections are Expressive Enough: Universal Approximability of Sparse Transformers
Figure 2 for $O(n)$ Connections are Expressive Enough: Universal Approximability of Sparse Transformers
Viaarxiv icon

Does label smoothing mitigate label noise?

Add code
Bookmark button
Alert button
Mar 05, 2020
Michal Lukasik, Srinadh Bhojanapalli, Aditya Krishna Menon, Sanjiv Kumar

Figure 1 for Does label smoothing mitigate label noise?
Figure 2 for Does label smoothing mitigate label noise?
Figure 3 for Does label smoothing mitigate label noise?
Figure 4 for Does label smoothing mitigate label noise?
Viaarxiv icon

Low-Rank Bottleneck in Multi-head Attention Models

Add code
Bookmark button
Alert button
Feb 17, 2020
Srinadh Bhojanapalli, Chulhee Yun, Ankit Singh Rawat, Sashank J. Reddi, Sanjiv Kumar

Figure 1 for Low-Rank Bottleneck in Multi-head Attention Models
Figure 2 for Low-Rank Bottleneck in Multi-head Attention Models
Figure 3 for Low-Rank Bottleneck in Multi-head Attention Models
Figure 4 for Low-Rank Bottleneck in Multi-head Attention Models
Viaarxiv icon

Are Transformers universal approximators of sequence-to-sequence functions?

Add code
Bookmark button
Alert button
Dec 20, 2019
Chulhee Yun, Srinadh Bhojanapalli, Ankit Singh Rawat, Sashank J. Reddi, Sanjiv Kumar

Figure 1 for Are Transformers universal approximators of sequence-to-sequence functions?
Figure 2 for Are Transformers universal approximators of sequence-to-sequence functions?
Viaarxiv icon

Stabilizing GAN Training with Multiple Random Projections

Add code
Bookmark button
Alert button
Jun 23, 2018
Behnam Neyshabur, Srinadh Bhojanapalli, Ayan Chakrabarti

Figure 1 for Stabilizing GAN Training with Multiple Random Projections
Figure 2 for Stabilizing GAN Training with Multiple Random Projections
Figure 3 for Stabilizing GAN Training with Multiple Random Projections
Figure 4 for Stabilizing GAN Training with Multiple Random Projections
Viaarxiv icon

Towards Understanding the Role of Over-Parametrization in Generalization of Neural Networks

Add code
Bookmark button
Alert button
May 30, 2018
Behnam Neyshabur, Zhiyuan Li, Srinadh Bhojanapalli, Yann LeCun, Nathan Srebro

Figure 1 for Towards Understanding the Role of Over-Parametrization in Generalization of Neural Networks
Figure 2 for Towards Understanding the Role of Over-Parametrization in Generalization of Neural Networks
Figure 3 for Towards Understanding the Role of Over-Parametrization in Generalization of Neural Networks
Figure 4 for Towards Understanding the Role of Over-Parametrization in Generalization of Neural Networks
Viaarxiv icon

Smoothed analysis for low-rank solutions to semidefinite programs in quadratic penalty form

Add code
Bookmark button
Alert button
Mar 01, 2018
Srinadh Bhojanapalli, Nicolas Boumal, Prateek Jain, Praneeth Netrapalli

Viaarxiv icon

A PAC-Bayesian Approach to Spectrally-Normalized Margin Bounds for Neural Networks

Add code
Bookmark button
Alert button
Feb 23, 2018
Behnam Neyshabur, Srinadh Bhojanapalli, Nathan Srebro

Viaarxiv icon

Exploring Generalization in Deep Learning

Add code
Bookmark button
Alert button
Jul 06, 2017
Behnam Neyshabur, Srinadh Bhojanapalli, David McAllester, Nathan Srebro

Figure 1 for Exploring Generalization in Deep Learning
Figure 2 for Exploring Generalization in Deep Learning
Figure 3 for Exploring Generalization in Deep Learning
Figure 4 for Exploring Generalization in Deep Learning
Viaarxiv icon