Picture for Shuming Ma

Shuming Ma

On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation

Add code
May 18, 2023
Figure 1 for On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation
Figure 2 for On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation
Figure 3 for On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation
Figure 4 for On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation
Viaarxiv icon

Discourse Centric Evaluation of Machine Translation with a Densely Annotated Parallel Corpus

Add code
May 18, 2023
Figure 1 for Discourse Centric Evaluation of Machine Translation with a Densely Annotated Parallel Corpus
Figure 2 for Discourse Centric Evaluation of Machine Translation with a Densely Annotated Parallel Corpus
Figure 3 for Discourse Centric Evaluation of Machine Translation with a Densely Annotated Parallel Corpus
Figure 4 for Discourse Centric Evaluation of Machine Translation with a Densely Annotated Parallel Corpus
Viaarxiv icon

On the Pareto Front of Multilingual Neural Machine Translation

Add code
Apr 07, 2023
Figure 1 for On the Pareto Front of Multilingual Neural Machine Translation
Figure 2 for On the Pareto Front of Multilingual Neural Machine Translation
Figure 3 for On the Pareto Front of Multilingual Neural Machine Translation
Figure 4 for On the Pareto Front of Multilingual Neural Machine Translation
Viaarxiv icon

Language Is Not All You Need: Aligning Perception with Language Models

Add code
Mar 01, 2023
Figure 1 for Language Is Not All You Need: Aligning Perception with Language Models
Figure 2 for Language Is Not All You Need: Aligning Perception with Language Models
Figure 3 for Language Is Not All You Need: Aligning Perception with Language Models
Figure 4 for Language Is Not All You Need: Aligning Perception with Language Models
Viaarxiv icon

Are More Layers Beneficial to Graph Transformers?

Add code
Mar 01, 2023
Viaarxiv icon

HanoiT: Enhancing Context-aware Translation via Selective Context

Add code
Jan 17, 2023
Viaarxiv icon

GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator

Add code
Dec 20, 2022
Figure 1 for GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
Figure 2 for GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
Figure 3 for GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
Figure 4 for GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
Viaarxiv icon

A Length-Extrapolatable Transformer

Add code
Dec 20, 2022
Viaarxiv icon

TRIP: Triangular Document-level Pre-training for Multilingual Language Models

Add code
Dec 15, 2022
Figure 1 for TRIP: Triangular Document-level Pre-training for Multilingual Language Models
Figure 2 for TRIP: Triangular Document-level Pre-training for Multilingual Language Models
Figure 3 for TRIP: Triangular Document-level Pre-training for Multilingual Language Models
Figure 4 for TRIP: Triangular Document-level Pre-training for Multilingual Language Models
Viaarxiv icon

TorchScale: Transformers at Scale

Add code
Nov 23, 2022
Viaarxiv icon