Alert button
Picture for Ashish Vaswani

Ashish Vaswani

Alert button

Mesh-TensorFlow: Deep Learning for Supercomputers

Nov 05, 2018
Noam Shazeer, Youlong Cheng, Niki Parmar, Dustin Tran, Ashish Vaswani, Penporn Koanantakool, Peter Hawkins, HyoukJoong Lee, Mingsheng Hong, Cliff Young, Ryan Sepassi, Blake Hechtman

Figure 1 for Mesh-TensorFlow: Deep Learning for Supercomputers
Figure 2 for Mesh-TensorFlow: Deep Learning for Supercomputers
Figure 3 for Mesh-TensorFlow: Deep Learning for Supercomputers
Viaarxiv icon

Relational inductive biases, deep learning, and graph networks

Oct 17, 2018
Peter W. Battaglia, Jessica B. Hamrick, Victor Bapst, Alvaro Sanchez-Gonzalez, Vinicius Zambaldi, Mateusz Malinowski, Andrea Tacchetti, David Raposo, Adam Santoro, Ryan Faulkner, Caglar Gulcehre, Francis Song, Andrew Ballard, Justin Gilmer, George Dahl, Ashish Vaswani, Kelsey Allen, Charles Nash, Victoria Langston, Chris Dyer, Nicolas Heess, Daan Wierstra, Pushmeet Kohli, Matt Botvinick, Oriol Vinyals, Yujia Li, Razvan Pascanu

Figure 1 for Relational inductive biases, deep learning, and graph networks
Figure 2 for Relational inductive biases, deep learning, and graph networks
Figure 3 for Relational inductive biases, deep learning, and graph networks
Figure 4 for Relational inductive biases, deep learning, and graph networks
Viaarxiv icon

Music Transformer

Oct 10, 2018
Cheng-Zhi Anna Huang, Ashish Vaswani, Jakob Uszkoreit, Noam Shazeer, Ian Simon, Curtis Hawthorne, Andrew M. Dai, Matthew D. Hoffman, Monica Dinculescu, Douglas Eck

Figure 1 for Music Transformer
Figure 2 for Music Transformer
Figure 3 for Music Transformer
Figure 4 for Music Transformer
Viaarxiv icon

Theory and Experiments on Vector Quantized Autoencoders

Jul 20, 2018
Aurko Roy, Ashish Vaswani, Arvind Neelakantan, Niki Parmar

Figure 1 for Theory and Experiments on Vector Quantized Autoencoders
Figure 2 for Theory and Experiments on Vector Quantized Autoencoders
Figure 3 for Theory and Experiments on Vector Quantized Autoencoders
Figure 4 for Theory and Experiments on Vector Quantized Autoencoders
Viaarxiv icon

Image Transformer

Jun 15, 2018
Niki Parmar, Ashish Vaswani, Jakob Uszkoreit, Łukasz Kaiser, Noam Shazeer, Alexander Ku, Dustin Tran

Figure 1 for Image Transformer
Figure 2 for Image Transformer
Figure 3 for Image Transformer
Figure 4 for Image Transformer
Viaarxiv icon

Fast Decoding in Sequence Models using Discrete Latent Variables

Jun 07, 2018
Łukasz Kaiser, Aurko Roy, Ashish Vaswani, Niki Parmar, Samy Bengio, Jakob Uszkoreit, Noam Shazeer

Figure 1 for Fast Decoding in Sequence Models using Discrete Latent Variables
Figure 2 for Fast Decoding in Sequence Models using Discrete Latent Variables
Figure 3 for Fast Decoding in Sequence Models using Discrete Latent Variables
Figure 4 for Fast Decoding in Sequence Models using Discrete Latent Variables
Viaarxiv icon

Self-Attention with Relative Position Representations

Apr 12, 2018
Peter Shaw, Jakob Uszkoreit, Ashish Vaswani

Figure 1 for Self-Attention with Relative Position Representations
Figure 2 for Self-Attention with Relative Position Representations
Figure 3 for Self-Attention with Relative Position Representations
Figure 4 for Self-Attention with Relative Position Representations
Viaarxiv icon

Tensor2Tensor for Neural Machine Translation

Mar 16, 2018
Ashish Vaswani, Samy Bengio, Eugene Brevdo, Francois Chollet, Aidan N. Gomez, Stephan Gouws, Llion Jones, Łukasz Kaiser, Nal Kalchbrenner, Niki Parmar, Ryan Sepassi, Noam Shazeer, Jakob Uszkoreit

Figure 1 for Tensor2Tensor for Neural Machine Translation
Viaarxiv icon

Attention Is All You Need

Dec 06, 2017
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin

Figure 1 for Attention Is All You Need
Figure 2 for Attention Is All You Need
Figure 3 for Attention Is All You Need
Figure 4 for Attention Is All You Need
Viaarxiv icon

One Model To Learn Them All

Jun 16, 2017
Lukasz Kaiser, Aidan N. Gomez, Noam Shazeer, Ashish Vaswani, Niki Parmar, Llion Jones, Jakob Uszkoreit

Figure 1 for One Model To Learn Them All
Figure 2 for One Model To Learn Them All
Figure 3 for One Model To Learn Them All
Figure 4 for One Model To Learn Them All
Viaarxiv icon