Alert button
Picture for Lukasz Kaiser

Lukasz Kaiser

Alert button

Attention Is All You Need

Add code
Bookmark button
Alert button
Dec 06, 2017
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin

Figure 1 for Attention Is All You Need
Figure 2 for Attention Is All You Need
Figure 3 for Attention Is All You Need
Figure 4 for Attention Is All You Need
Viaarxiv icon

One Model To Learn Them All

Add code
Bookmark button
Alert button
Jun 16, 2017
Lukasz Kaiser, Aidan N. Gomez, Noam Shazeer, Ashish Vaswani, Niki Parmar, Llion Jones, Jakob Uszkoreit

Figure 1 for One Model To Learn Them All
Figure 2 for One Model To Learn Them All
Figure 3 for One Model To Learn Them All
Figure 4 for One Model To Learn Them All
Viaarxiv icon

Depthwise Separable Convolutions for Neural Machine Translation

Add code
Bookmark button
Alert button
Jun 16, 2017
Lukasz Kaiser, Aidan N. Gomez, Francois Chollet

Figure 1 for Depthwise Separable Convolutions for Neural Machine Translation
Figure 2 for Depthwise Separable Convolutions for Neural Machine Translation
Figure 3 for Depthwise Separable Convolutions for Neural Machine Translation
Figure 4 for Depthwise Separable Convolutions for Neural Machine Translation
Viaarxiv icon

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

Add code
Bookmark button
Alert button
Mar 16, 2016
Martín Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Ian Goodfellow, Andrew Harp, Geoffrey Irving, Michael Isard, Yangqing Jia, Rafal Jozefowicz, Lukasz Kaiser, Manjunath Kudlur, Josh Levenberg, Dan Mane, Rajat Monga, Sherry Moore, Derek Murray, Chris Olah, Mike Schuster, Jonathon Shlens, Benoit Steiner, Ilya Sutskever, Kunal Talwar, Paul Tucker, Vincent Vanhoucke, Vijay Vasudevan, Fernanda Viegas, Oriol Vinyals, Pete Warden, Martin Wattenberg, Martin Wicke, Yuan Yu, Xiaoqiang Zheng

Figure 1 for TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Figure 2 for TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Figure 3 for TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Figure 4 for TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Viaarxiv icon

Multi-task Sequence to Sequence Learning

Add code
Bookmark button
Alert button
Mar 01, 2016
Minh-Thang Luong, Quoc V. Le, Ilya Sutskever, Oriol Vinyals, Lukasz Kaiser

Figure 1 for Multi-task Sequence to Sequence Learning
Figure 2 for Multi-task Sequence to Sequence Learning
Figure 3 for Multi-task Sequence to Sequence Learning
Figure 4 for Multi-task Sequence to Sequence Learning
Viaarxiv icon

Adding Gradient Noise Improves Learning for Very Deep Networks

Add code
Bookmark button
Alert button
Nov 21, 2015
Arvind Neelakantan, Luke Vilnis, Quoc V. Le, Ilya Sutskever, Lukasz Kaiser, Karol Kurach, James Martens

Figure 1 for Adding Gradient Noise Improves Learning for Very Deep Networks
Figure 2 for Adding Gradient Noise Improves Learning for Very Deep Networks
Figure 3 for Adding Gradient Noise Improves Learning for Very Deep Networks
Figure 4 for Adding Gradient Noise Improves Learning for Very Deep Networks
Viaarxiv icon

Grammar as a Foreign Language

Add code
Bookmark button
Alert button
Jun 09, 2015
Oriol Vinyals, Lukasz Kaiser, Terry Koo, Slav Petrov, Ilya Sutskever, Geoffrey Hinton

Figure 1 for Grammar as a Foreign Language
Figure 2 for Grammar as a Foreign Language
Figure 3 for Grammar as a Foreign Language
Figure 4 for Grammar as a Foreign Language
Viaarxiv icon