Picture for Sanjiv Kumar

Sanjiv Kumar

Google Research

Are Transformers universal approximators of sequence-to-sequence functions?

Add code
Dec 20, 2019
Figure 1 for Are Transformers universal approximators of sequence-to-sequence functions?
Figure 2 for Are Transformers universal approximators of sequence-to-sequence functions?
Viaarxiv icon

Why ADAM Beats SGD for Attention Models

Add code
Dec 06, 2019
Figure 1 for Why ADAM Beats SGD for Attention Models
Figure 2 for Why ADAM Beats SGD for Attention Models
Figure 3 for Why ADAM Beats SGD for Attention Models
Figure 4 for Why ADAM Beats SGD for Attention Models
Viaarxiv icon

Learning to Learn by Zeroth-Order Oracle

Add code
Oct 21, 2019
Figure 1 for Learning to Learn by Zeroth-Order Oracle
Figure 2 for Learning to Learn by Zeroth-Order Oracle
Figure 3 for Learning to Learn by Zeroth-Order Oracle
Figure 4 for Learning to Learn by Zeroth-Order Oracle
Viaarxiv icon

Online Hierarchical Clustering Approximations

Add code
Sep 20, 2019
Figure 1 for Online Hierarchical Clustering Approximations
Figure 2 for Online Hierarchical Clustering Approximations
Figure 3 for Online Hierarchical Clustering Approximations
Figure 4 for Online Hierarchical Clustering Approximations
Viaarxiv icon

New Loss Functions for Fast Maximum Inner Product Search

Add code
Sep 11, 2019
Figure 1 for New Loss Functions for Fast Maximum Inner Product Search
Figure 2 for New Loss Functions for Fast Maximum Inner Product Search
Viaarxiv icon

AdaCliP: Adaptive Clipping for Private SGD

Add code
Aug 20, 2019
Figure 1 for AdaCliP: Adaptive Clipping for Private SGD
Figure 2 for AdaCliP: Adaptive Clipping for Private SGD
Figure 3 for AdaCliP: Adaptive Clipping for Private SGD
Figure 4 for AdaCliP: Adaptive Clipping for Private SGD
Viaarxiv icon

Sampled Softmax with Random Fourier Features

Add code
Jul 24, 2019
Figure 1 for Sampled Softmax with Random Fourier Features
Figure 2 for Sampled Softmax with Random Fourier Features
Figure 3 for Sampled Softmax with Random Fourier Features
Figure 4 for Sampled Softmax with Random Fourier Features
Viaarxiv icon

Neural SDE: Stabilizing Neural ODE Networks with Stochastic Noise

Add code
Jun 05, 2019
Figure 1 for Neural SDE: Stabilizing Neural ODE Networks with Stochastic Noise
Figure 2 for Neural SDE: Stabilizing Neural ODE Networks with Stochastic Noise
Figure 3 for Neural SDE: Stabilizing Neural ODE Networks with Stochastic Noise
Figure 4 for Neural SDE: Stabilizing Neural ODE Networks with Stochastic Noise
Viaarxiv icon

On the Convergence of Adam and Beyond

Add code
Apr 19, 2019
Figure 1 for On the Convergence of Adam and Beyond
Figure 2 for On the Convergence of Adam and Beyond
Viaarxiv icon

Local Orthogonal Decomposition for Maximum Inner Product Search

Add code
Mar 25, 2019
Figure 1 for Local Orthogonal Decomposition for Maximum Inner Product Search
Figure 2 for Local Orthogonal Decomposition for Maximum Inner Product Search
Figure 3 for Local Orthogonal Decomposition for Maximum Inner Product Search
Figure 4 for Local Orthogonal Decomposition for Maximum Inner Product Search
Viaarxiv icon