Picture for Ashish Vaswani

Ashish Vaswani

Attention Augmented Convolutional Networks

Add code
Apr 22, 2019
Figure 1 for Attention Augmented Convolutional Networks
Figure 2 for Attention Augmented Convolutional Networks
Figure 3 for Attention Augmented Convolutional Networks
Figure 4 for Attention Augmented Convolutional Networks
Viaarxiv icon

Mesh-TensorFlow: Deep Learning for Supercomputers

Add code
Nov 05, 2018
Figure 1 for Mesh-TensorFlow: Deep Learning for Supercomputers
Figure 2 for Mesh-TensorFlow: Deep Learning for Supercomputers
Figure 3 for Mesh-TensorFlow: Deep Learning for Supercomputers
Viaarxiv icon

Relational inductive biases, deep learning, and graph networks

Add code
Oct 17, 2018
Figure 1 for Relational inductive biases, deep learning, and graph networks
Figure 2 for Relational inductive biases, deep learning, and graph networks
Figure 3 for Relational inductive biases, deep learning, and graph networks
Figure 4 for Relational inductive biases, deep learning, and graph networks
Viaarxiv icon

Music Transformer

Add code
Oct 10, 2018
Figure 1 for Music Transformer
Figure 2 for Music Transformer
Figure 3 for Music Transformer
Figure 4 for Music Transformer
Viaarxiv icon

Theory and Experiments on Vector Quantized Autoencoders

Add code
Jul 20, 2018
Figure 1 for Theory and Experiments on Vector Quantized Autoencoders
Figure 2 for Theory and Experiments on Vector Quantized Autoencoders
Figure 3 for Theory and Experiments on Vector Quantized Autoencoders
Figure 4 for Theory and Experiments on Vector Quantized Autoencoders
Viaarxiv icon

Image Transformer

Add code
Jun 15, 2018
Figure 1 for Image Transformer
Figure 2 for Image Transformer
Figure 3 for Image Transformer
Figure 4 for Image Transformer
Viaarxiv icon

Fast Decoding in Sequence Models using Discrete Latent Variables

Add code
Jun 07, 2018
Figure 1 for Fast Decoding in Sequence Models using Discrete Latent Variables
Figure 2 for Fast Decoding in Sequence Models using Discrete Latent Variables
Figure 3 for Fast Decoding in Sequence Models using Discrete Latent Variables
Figure 4 for Fast Decoding in Sequence Models using Discrete Latent Variables
Viaarxiv icon

Self-Attention with Relative Position Representations

Add code
Apr 12, 2018
Figure 1 for Self-Attention with Relative Position Representations
Figure 2 for Self-Attention with Relative Position Representations
Figure 3 for Self-Attention with Relative Position Representations
Figure 4 for Self-Attention with Relative Position Representations
Viaarxiv icon

Tensor2Tensor for Neural Machine Translation

Add code
Mar 16, 2018
Figure 1 for Tensor2Tensor for Neural Machine Translation
Viaarxiv icon

Attention Is All You Need

Add code
Dec 06, 2017
Figure 1 for Attention Is All You Need
Figure 2 for Attention Is All You Need
Figure 3 for Attention Is All You Need
Figure 4 for Attention Is All You Need
Viaarxiv icon