Alert button
Picture for Sashank J. Reddi

Sashank J. Reddi

Alert button

Why distillation helps: a statistical perspective

Add code
Bookmark button
Alert button
May 21, 2020
Aditya Krishna Menon, Ankit Singh Rawat, Sashank J. Reddi, Seungyeon Kim, Sanjiv Kumar

Figure 1 for Why distillation helps: a statistical perspective
Figure 2 for Why distillation helps: a statistical perspective
Figure 3 for Why distillation helps: a statistical perspective
Figure 4 for Why distillation helps: a statistical perspective
Viaarxiv icon

Doubly-stochastic mining for heterogeneous retrieval

Add code
Bookmark button
Alert button
Apr 23, 2020
Ankit Singh Rawat, Aditya Krishna Menon, Andreas Veit, Felix Yu, Sashank J. Reddi, Sanjiv Kumar

Figure 1 for Doubly-stochastic mining for heterogeneous retrieval
Figure 2 for Doubly-stochastic mining for heterogeneous retrieval
Figure 3 for Doubly-stochastic mining for heterogeneous retrieval
Figure 4 for Doubly-stochastic mining for heterogeneous retrieval
Viaarxiv icon

Low-Rank Bottleneck in Multi-head Attention Models

Add code
Bookmark button
Alert button
Feb 17, 2020
Srinadh Bhojanapalli, Chulhee Yun, Ankit Singh Rawat, Sashank J. Reddi, Sanjiv Kumar

Figure 1 for Low-Rank Bottleneck in Multi-head Attention Models
Figure 2 for Low-Rank Bottleneck in Multi-head Attention Models
Figure 3 for Low-Rank Bottleneck in Multi-head Attention Models
Figure 4 for Low-Rank Bottleneck in Multi-head Attention Models
Viaarxiv icon

Are Transformers universal approximators of sequence-to-sequence functions?

Add code
Bookmark button
Alert button
Dec 20, 2019
Chulhee Yun, Srinadh Bhojanapalli, Ankit Singh Rawat, Sashank J. Reddi, Sanjiv Kumar

Figure 1 for Are Transformers universal approximators of sequence-to-sequence functions?
Figure 2 for Are Transformers universal approximators of sequence-to-sequence functions?
Viaarxiv icon

SCAFFOLD: Stochastic Controlled Averaging for On-Device Federated Learning

Add code
Bookmark button
Alert button
Oct 14, 2019
Sai Praneeth Karimireddy, Satyen Kale, Mehryar Mohri, Sashank J. Reddi, Sebastian U. Stich, Ananda Theertha Suresh

Figure 1 for SCAFFOLD: Stochastic Controlled Averaging for On-Device Federated Learning
Figure 2 for SCAFFOLD: Stochastic Controlled Averaging for On-Device Federated Learning
Figure 3 for SCAFFOLD: Stochastic Controlled Averaging for On-Device Federated Learning
Figure 4 for SCAFFOLD: Stochastic Controlled Averaging for On-Device Federated Learning
Viaarxiv icon

AdaCliP: Adaptive Clipping for Private SGD

Add code
Bookmark button
Alert button
Aug 20, 2019
Venkatadheeraj Pichapati, Ananda Theertha Suresh, Felix X. Yu, Sashank J. Reddi, Sanjiv Kumar

Figure 1 for AdaCliP: Adaptive Clipping for Private SGD
Figure 2 for AdaCliP: Adaptive Clipping for Private SGD
Figure 3 for AdaCliP: Adaptive Clipping for Private SGD
Figure 4 for AdaCliP: Adaptive Clipping for Private SGD
Viaarxiv icon

On the Convergence of Adam and Beyond

Add code
Bookmark button
Alert button
Apr 19, 2019
Sashank J. Reddi, Satyen Kale, Sanjiv Kumar

Figure 1 for On the Convergence of Adam and Beyond
Figure 2 for On the Convergence of Adam and Beyond
Viaarxiv icon

Escaping Saddle Points with Adaptive Gradient Methods

Add code
Bookmark button
Alert button
Jan 26, 2019
Matthew Staib, Sashank J. Reddi, Satyen Kale, Sanjiv Kumar, Suvrit Sra

Figure 1 for Escaping Saddle Points with Adaptive Gradient Methods
Figure 2 for Escaping Saddle Points with Adaptive Gradient Methods
Viaarxiv icon

Stochastic Negative Mining for Learning with Large Output Spaces

Add code
Bookmark button
Alert button
Oct 16, 2018
Sashank J. Reddi, Satyen Kale, Felix Yu, Dan Holtmann-Rice, Jiecao Chen, Sanjiv Kumar

Figure 1 for Stochastic Negative Mining for Learning with Large Output Spaces
Figure 2 for Stochastic Negative Mining for Learning with Large Output Spaces
Figure 3 for Stochastic Negative Mining for Learning with Large Output Spaces
Viaarxiv icon

Riemannian SVRG: Fast Stochastic Optimization on Riemannian Manifolds

Add code
Bookmark button
Alert button
Apr 07, 2017
Hongyi Zhang, Sashank J. Reddi, Suvrit Sra

Figure 1 for Riemannian SVRG: Fast Stochastic Optimization on Riemannian Manifolds
Figure 2 for Riemannian SVRG: Fast Stochastic Optimization on Riemannian Manifolds
Figure 3 for Riemannian SVRG: Fast Stochastic Optimization on Riemannian Manifolds
Viaarxiv icon