Picture for Rico Sennrich

Rico Sennrich

Root Mean Square Layer Normalization

Add code
Oct 16, 2019
Figure 1 for Root Mean Square Layer Normalization
Figure 2 for Root Mean Square Layer Normalization
Figure 3 for Root Mean Square Layer Normalization
Figure 4 for Root Mean Square Layer Normalization
Viaarxiv icon

Context-Aware Monolingual Repair for Neural Machine Translation

Add code
Oct 15, 2019
Figure 1 for Context-Aware Monolingual Repair for Neural Machine Translation
Figure 2 for Context-Aware Monolingual Repair for Neural Machine Translation
Figure 3 for Context-Aware Monolingual Repair for Neural Machine Translation
Figure 4 for Context-Aware Monolingual Repair for Neural Machine Translation
Viaarxiv icon

The Bottom-up Evolution of Representations in the Transformer: A Study with Machine Translation and Language Modeling Objectives

Add code
Sep 03, 2019
Figure 1 for The Bottom-up Evolution of Representations in the Transformer: A Study with Machine Translation and Language Modeling Objectives
Figure 2 for The Bottom-up Evolution of Representations in the Transformer: A Study with Machine Translation and Language Modeling Objectives
Figure 3 for The Bottom-up Evolution of Representations in the Transformer: A Study with Machine Translation and Language Modeling Objectives
Figure 4 for The Bottom-up Evolution of Representations in the Transformer: A Study with Machine Translation and Language Modeling Objectives
Viaarxiv icon

Encoders Help You Disambiguate Word Senses in Neural Machine Translation

Add code
Aug 30, 2019
Figure 1 for Encoders Help You Disambiguate Word Senses in Neural Machine Translation
Figure 2 for Encoders Help You Disambiguate Word Senses in Neural Machine Translation
Figure 3 for Encoders Help You Disambiguate Word Senses in Neural Machine Translation
Figure 4 for Encoders Help You Disambiguate Word Senses in Neural Machine Translation
Viaarxiv icon

Improving Deep Transformer with Depth-Scaled Initialization and Merged Attention

Add code
Aug 29, 2019
Figure 1 for Improving Deep Transformer with Depth-Scaled Initialization and Merged Attention
Figure 2 for Improving Deep Transformer with Depth-Scaled Initialization and Merged Attention
Figure 3 for Improving Deep Transformer with Depth-Scaled Initialization and Merged Attention
Figure 4 for Improving Deep Transformer with Depth-Scaled Initialization and Merged Attention
Viaarxiv icon

Understanding Neural Machine Translation by Simplification: The Case of Encoder-free Models

Add code
Jul 18, 2019
Figure 1 for Understanding Neural Machine Translation by Simplification: The Case of Encoder-free Models
Figure 2 for Understanding Neural Machine Translation by Simplification: The Case of Encoder-free Models
Figure 3 for Understanding Neural Machine Translation by Simplification: The Case of Encoder-free Models
Figure 4 for Understanding Neural Machine Translation by Simplification: The Case of Encoder-free Models
Viaarxiv icon

Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts

Add code
Jun 28, 2019
Figure 1 for Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts
Figure 2 for Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts
Figure 3 for Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts
Figure 4 for Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts
Viaarxiv icon

Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned

Add code
Jun 07, 2019
Figure 1 for Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned
Figure 2 for Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned
Figure 3 for Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned
Figure 4 for Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned
Viaarxiv icon

A Lightweight Recurrent Network for Sequence Modeling

Add code
May 30, 2019
Figure 1 for A Lightweight Recurrent Network for Sequence Modeling
Figure 2 for A Lightweight Recurrent Network for Sequence Modeling
Figure 3 for A Lightweight Recurrent Network for Sequence Modeling
Figure 4 for A Lightweight Recurrent Network for Sequence Modeling
Viaarxiv icon

Revisiting Low-Resource Neural Machine Translation: A Case Study

Add code
May 28, 2019
Figure 1 for Revisiting Low-Resource Neural Machine Translation: A Case Study
Figure 2 for Revisiting Low-Resource Neural Machine Translation: A Case Study
Figure 3 for Revisiting Low-Resource Neural Machine Translation: A Case Study
Figure 4 for Revisiting Low-Resource Neural Machine Translation: A Case Study
Viaarxiv icon