Picture for Sashank J. Reddi

Sashank J. Reddi

Doubly-stochastic mining for heterogeneous retrieval

Add code
Apr 23, 2020
Figure 1 for Doubly-stochastic mining for heterogeneous retrieval
Figure 2 for Doubly-stochastic mining for heterogeneous retrieval
Figure 3 for Doubly-stochastic mining for heterogeneous retrieval
Figure 4 for Doubly-stochastic mining for heterogeneous retrieval
Viaarxiv icon

Low-Rank Bottleneck in Multi-head Attention Models

Add code
Feb 17, 2020
Figure 1 for Low-Rank Bottleneck in Multi-head Attention Models
Figure 2 for Low-Rank Bottleneck in Multi-head Attention Models
Figure 3 for Low-Rank Bottleneck in Multi-head Attention Models
Figure 4 for Low-Rank Bottleneck in Multi-head Attention Models
Viaarxiv icon

Are Transformers universal approximators of sequence-to-sequence functions?

Add code
Dec 20, 2019
Figure 1 for Are Transformers universal approximators of sequence-to-sequence functions?
Figure 2 for Are Transformers universal approximators of sequence-to-sequence functions?
Viaarxiv icon

SCAFFOLD: Stochastic Controlled Averaging for On-Device Federated Learning

Add code
Oct 14, 2019
Figure 1 for SCAFFOLD: Stochastic Controlled Averaging for On-Device Federated Learning
Figure 2 for SCAFFOLD: Stochastic Controlled Averaging for On-Device Federated Learning
Figure 3 for SCAFFOLD: Stochastic Controlled Averaging for On-Device Federated Learning
Figure 4 for SCAFFOLD: Stochastic Controlled Averaging for On-Device Federated Learning
Viaarxiv icon

AdaCliP: Adaptive Clipping for Private SGD

Add code
Aug 20, 2019
Figure 1 for AdaCliP: Adaptive Clipping for Private SGD
Figure 2 for AdaCliP: Adaptive Clipping for Private SGD
Figure 3 for AdaCliP: Adaptive Clipping for Private SGD
Figure 4 for AdaCliP: Adaptive Clipping for Private SGD
Viaarxiv icon

On the Convergence of Adam and Beyond

Add code
Apr 19, 2019
Figure 1 for On the Convergence of Adam and Beyond
Figure 2 for On the Convergence of Adam and Beyond
Viaarxiv icon

Escaping Saddle Points with Adaptive Gradient Methods

Add code
Jan 26, 2019
Figure 1 for Escaping Saddle Points with Adaptive Gradient Methods
Figure 2 for Escaping Saddle Points with Adaptive Gradient Methods
Viaarxiv icon

Stochastic Negative Mining for Learning with Large Output Spaces

Add code
Oct 16, 2018
Figure 1 for Stochastic Negative Mining for Learning with Large Output Spaces
Figure 2 for Stochastic Negative Mining for Learning with Large Output Spaces
Figure 3 for Stochastic Negative Mining for Learning with Large Output Spaces
Viaarxiv icon

Riemannian SVRG: Fast Stochastic Optimization on Riemannian Manifolds

Add code
Apr 07, 2017
Figure 1 for Riemannian SVRG: Fast Stochastic Optimization on Riemannian Manifolds
Figure 2 for Riemannian SVRG: Fast Stochastic Optimization on Riemannian Manifolds
Figure 3 for Riemannian SVRG: Fast Stochastic Optimization on Riemannian Manifolds
Viaarxiv icon

AIDE: Fast and Communication Efficient Distributed Optimization

Add code
Aug 24, 2016
Figure 1 for AIDE: Fast and Communication Efficient Distributed Optimization
Figure 2 for AIDE: Fast and Communication Efficient Distributed Optimization
Figure 3 for AIDE: Fast and Communication Efficient Distributed Optimization
Figure 4 for AIDE: Fast and Communication Efficient Distributed Optimization
Viaarxiv icon