Picture for Srinadh Bhojanapalli

Srinadh Bhojanapalli

Dj

An efficient nonconvex reformulation of stagewise convex optimization problems

Add code
Oct 27, 2020
Figure 1 for An efficient nonconvex reformulation of stagewise convex optimization problems
Figure 2 for An efficient nonconvex reformulation of stagewise convex optimization problems
Figure 3 for An efficient nonconvex reformulation of stagewise convex optimization problems
Figure 4 for An efficient nonconvex reformulation of stagewise convex optimization problems
Viaarxiv icon

Coping with Label Shift via Distributionally Robust Optimisation

Add code
Oct 23, 2020
Figure 1 for Coping with Label Shift via Distributionally Robust Optimisation
Figure 2 for Coping with Label Shift via Distributionally Robust Optimisation
Figure 3 for Coping with Label Shift via Distributionally Robust Optimisation
Figure 4 for Coping with Label Shift via Distributionally Robust Optimisation
Viaarxiv icon

Semantic Label Smoothing for Sequence to Sequence Problems

Add code
Oct 15, 2020
Figure 1 for Semantic Label Smoothing for Sequence to Sequence Problems
Figure 2 for Semantic Label Smoothing for Sequence to Sequence Problems
Figure 3 for Semantic Label Smoothing for Sequence to Sequence Problems
Figure 4 for Semantic Label Smoothing for Sequence to Sequence Problems
Viaarxiv icon

$O(n)$ Connections are Expressive Enough: Universal Approximability of Sparse Transformers

Add code
Jun 08, 2020
Figure 1 for $O(n)$ Connections are Expressive Enough: Universal Approximability of Sparse Transformers
Figure 2 for $O(n)$ Connections are Expressive Enough: Universal Approximability of Sparse Transformers
Viaarxiv icon

Does label smoothing mitigate label noise?

Add code
Mar 05, 2020
Figure 1 for Does label smoothing mitigate label noise?
Figure 2 for Does label smoothing mitigate label noise?
Figure 3 for Does label smoothing mitigate label noise?
Figure 4 for Does label smoothing mitigate label noise?
Viaarxiv icon

Low-Rank Bottleneck in Multi-head Attention Models

Add code
Feb 17, 2020
Figure 1 for Low-Rank Bottleneck in Multi-head Attention Models
Figure 2 for Low-Rank Bottleneck in Multi-head Attention Models
Figure 3 for Low-Rank Bottleneck in Multi-head Attention Models
Figure 4 for Low-Rank Bottleneck in Multi-head Attention Models
Viaarxiv icon

Are Transformers universal approximators of sequence-to-sequence functions?

Add code
Dec 20, 2019
Figure 1 for Are Transformers universal approximators of sequence-to-sequence functions?
Figure 2 for Are Transformers universal approximators of sequence-to-sequence functions?
Viaarxiv icon

Stabilizing GAN Training with Multiple Random Projections

Add code
Jun 23, 2018
Figure 1 for Stabilizing GAN Training with Multiple Random Projections
Figure 2 for Stabilizing GAN Training with Multiple Random Projections
Figure 3 for Stabilizing GAN Training with Multiple Random Projections
Figure 4 for Stabilizing GAN Training with Multiple Random Projections
Viaarxiv icon

Towards Understanding the Role of Over-Parametrization in Generalization of Neural Networks

Add code
May 30, 2018
Figure 1 for Towards Understanding the Role of Over-Parametrization in Generalization of Neural Networks
Figure 2 for Towards Understanding the Role of Over-Parametrization in Generalization of Neural Networks
Figure 3 for Towards Understanding the Role of Over-Parametrization in Generalization of Neural Networks
Figure 4 for Towards Understanding the Role of Over-Parametrization in Generalization of Neural Networks
Viaarxiv icon

Smoothed analysis for low-rank solutions to semidefinite programs in quadratic penalty form

Add code
Mar 01, 2018
Viaarxiv icon