Picture for Lukasz Kaiser

Lukasz Kaiser

Tony

Generating Wikipedia by Summarizing Long Sequences

Add code
Jan 30, 2018
Figure 1 for Generating Wikipedia by Summarizing Long Sequences
Figure 2 for Generating Wikipedia by Summarizing Long Sequences
Figure 3 for Generating Wikipedia by Summarizing Long Sequences
Figure 4 for Generating Wikipedia by Summarizing Long Sequences
Viaarxiv icon

Unsupervised Cipher Cracking Using Discrete GANs

Add code
Jan 15, 2018
Figure 1 for Unsupervised Cipher Cracking Using Discrete GANs
Figure 2 for Unsupervised Cipher Cracking Using Discrete GANs
Figure 3 for Unsupervised Cipher Cracking Using Discrete GANs
Figure 4 for Unsupervised Cipher Cracking Using Discrete GANs
Viaarxiv icon

Attention Is All You Need

Add code
Dec 06, 2017
Figure 1 for Attention Is All You Need
Figure 2 for Attention Is All You Need
Figure 3 for Attention Is All You Need
Figure 4 for Attention Is All You Need
Viaarxiv icon

One Model To Learn Them All

Add code
Jun 16, 2017
Figure 1 for One Model To Learn Them All
Figure 2 for One Model To Learn Them All
Figure 3 for One Model To Learn Them All
Figure 4 for One Model To Learn Them All
Viaarxiv icon

Depthwise Separable Convolutions for Neural Machine Translation

Add code
Jun 16, 2017
Figure 1 for Depthwise Separable Convolutions for Neural Machine Translation
Figure 2 for Depthwise Separable Convolutions for Neural Machine Translation
Figure 3 for Depthwise Separable Convolutions for Neural Machine Translation
Figure 4 for Depthwise Separable Convolutions for Neural Machine Translation
Viaarxiv icon

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

Add code
Mar 16, 2016
Figure 1 for TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Figure 2 for TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Figure 3 for TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Figure 4 for TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Viaarxiv icon

Multi-task Sequence to Sequence Learning

Add code
Mar 01, 2016
Figure 1 for Multi-task Sequence to Sequence Learning
Figure 2 for Multi-task Sequence to Sequence Learning
Figure 3 for Multi-task Sequence to Sequence Learning
Figure 4 for Multi-task Sequence to Sequence Learning
Viaarxiv icon

Adding Gradient Noise Improves Learning for Very Deep Networks

Add code
Nov 21, 2015
Figure 1 for Adding Gradient Noise Improves Learning for Very Deep Networks
Figure 2 for Adding Gradient Noise Improves Learning for Very Deep Networks
Figure 3 for Adding Gradient Noise Improves Learning for Very Deep Networks
Figure 4 for Adding Gradient Noise Improves Learning for Very Deep Networks
Viaarxiv icon

Grammar as a Foreign Language

Add code
Jun 09, 2015
Figure 1 for Grammar as a Foreign Language
Figure 2 for Grammar as a Foreign Language
Figure 3 for Grammar as a Foreign Language
Figure 4 for Grammar as a Foreign Language
Viaarxiv icon