Picture for Lukasz Kaiser

Lukasz Kaiser

Training Verifiers to Solve Math Word Problems

Add code
Nov 18, 2021
Figure 1 for Training Verifiers to Solve Math Word Problems
Figure 2 for Training Verifiers to Solve Math Word Problems
Figure 3 for Training Verifiers to Solve Math Word Problems
Figure 4 for Training Verifiers to Solve Math Word Problems
Viaarxiv icon

Evaluating Large Language Models Trained on Code

Add code
Jul 14, 2021
Figure 1 for Evaluating Large Language Models Trained on Code
Figure 2 for Evaluating Large Language Models Trained on Code
Figure 3 for Evaluating Large Language Models Trained on Code
Figure 4 for Evaluating Large Language Models Trained on Code
Viaarxiv icon

Rethinking Attention with Performers

Add code
Sep 30, 2020
Figure 1 for Rethinking Attention with Performers
Figure 2 for Rethinking Attention with Performers
Figure 3 for Rethinking Attention with Performers
Figure 4 for Rethinking Attention with Performers
Viaarxiv icon

Parallel Scheduled Sampling

Add code
Jun 11, 2019
Figure 1 for Parallel Scheduled Sampling
Figure 2 for Parallel Scheduled Sampling
Figure 3 for Parallel Scheduled Sampling
Figure 4 for Parallel Scheduled Sampling
Viaarxiv icon

Sample Efficient Text Summarization Using a Single Pre-Trained Transformer

Add code
May 21, 2019
Figure 1 for Sample Efficient Text Summarization Using a Single Pre-Trained Transformer
Figure 2 for Sample Efficient Text Summarization Using a Single Pre-Trained Transformer
Figure 3 for Sample Efficient Text Summarization Using a Single Pre-Trained Transformer
Figure 4 for Sample Efficient Text Summarization Using a Single Pre-Trained Transformer
Viaarxiv icon

Model-Based Reinforcement Learning for Atari

Add code
Mar 05, 2019
Figure 1 for Model-Based Reinforcement Learning for Atari
Figure 2 for Model-Based Reinforcement Learning for Atari
Figure 3 for Model-Based Reinforcement Learning for Atari
Figure 4 for Model-Based Reinforcement Learning for Atari
Viaarxiv icon

Area Attention

Add code
Oct 30, 2018
Figure 1 for Area Attention
Figure 2 for Area Attention
Figure 3 for Area Attention
Figure 4 for Area Attention
Viaarxiv icon

Generating Wikipedia by Summarizing Long Sequences

Add code
Jan 30, 2018
Figure 1 for Generating Wikipedia by Summarizing Long Sequences
Figure 2 for Generating Wikipedia by Summarizing Long Sequences
Figure 3 for Generating Wikipedia by Summarizing Long Sequences
Figure 4 for Generating Wikipedia by Summarizing Long Sequences
Viaarxiv icon

Unsupervised Cipher Cracking Using Discrete GANs

Add code
Jan 15, 2018
Figure 1 for Unsupervised Cipher Cracking Using Discrete GANs
Figure 2 for Unsupervised Cipher Cracking Using Discrete GANs
Figure 3 for Unsupervised Cipher Cracking Using Discrete GANs
Figure 4 for Unsupervised Cipher Cracking Using Discrete GANs
Viaarxiv icon

Attention Is All You Need

Add code
Dec 06, 2017
Figure 1 for Attention Is All You Need
Figure 2 for Attention Is All You Need
Figure 3 for Attention Is All You Need
Figure 4 for Attention Is All You Need
Viaarxiv icon