Picture for Shuming Ma

Shuming Ma

A Bilingual Parallel Corpus with Discourse Annotations

Add code
Oct 26, 2022
Figure 1 for A Bilingual Parallel Corpus with Discourse Annotations
Figure 2 for A Bilingual Parallel Corpus with Discourse Annotations
Figure 3 for A Bilingual Parallel Corpus with Discourse Annotations
Figure 4 for A Bilingual Parallel Corpus with Discourse Annotations
Viaarxiv icon

Foundation Transformers

Add code
Oct 19, 2022
Figure 1 for Foundation Transformers
Figure 2 for Foundation Transformers
Figure 3 for Foundation Transformers
Figure 4 for Foundation Transformers
Viaarxiv icon

CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual Labeled Sequence Translation

Add code
Oct 13, 2022
Figure 1 for CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual Labeled Sequence Translation
Figure 2 for CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual Labeled Sequence Translation
Figure 3 for CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual Labeled Sequence Translation
Figure 4 for CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual Labeled Sequence Translation
Viaarxiv icon

Multilingual Transitivity and Bidirectional Multilingual Agreement for Multilingual Document-level Machine Translation

Add code
Oct 05, 2022
Figure 1 for Multilingual Transitivity and Bidirectional Multilingual Agreement for Multilingual Document-level Machine Translation
Figure 2 for Multilingual Transitivity and Bidirectional Multilingual Agreement for Multilingual Document-level Machine Translation
Figure 3 for Multilingual Transitivity and Bidirectional Multilingual Agreement for Multilingual Document-level Machine Translation
Figure 4 for Multilingual Transitivity and Bidirectional Multilingual Agreement for Multilingual Document-level Machine Translation
Viaarxiv icon

GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation

Add code
Jul 29, 2022
Figure 1 for GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation
Figure 2 for GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation
Figure 3 for GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation
Figure 4 for GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation
Viaarxiv icon

HLT-MT: High-resource Language-specific Training for Multilingual Neural Machine Translation

Add code
Jul 15, 2022
Figure 1 for HLT-MT: High-resource Language-specific Training for Multilingual Neural Machine Translation
Figure 2 for HLT-MT: High-resource Language-specific Training for Multilingual Neural Machine Translation
Figure 3 for HLT-MT: High-resource Language-specific Training for Multilingual Neural Machine Translation
Figure 4 for HLT-MT: High-resource Language-specific Training for Multilingual Neural Machine Translation
Viaarxiv icon

UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation

Add code
Jul 11, 2022
Figure 1 for UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation
Figure 2 for UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation
Figure 3 for UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation
Figure 4 for UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation
Viaarxiv icon

Language Models are General-Purpose Interfaces

Add code
Jun 13, 2022
Figure 1 for Language Models are General-Purpose Interfaces
Figure 2 for Language Models are General-Purpose Interfaces
Figure 3 for Language Models are General-Purpose Interfaces
Figure 4 for Language Models are General-Purpose Interfaces
Viaarxiv icon

On the Representation Collapse of Sparse Mixture of Experts

Add code
Apr 20, 2022
Figure 1 for On the Representation Collapse of Sparse Mixture of Experts
Figure 2 for On the Representation Collapse of Sparse Mixture of Experts
Figure 3 for On the Representation Collapse of Sparse Mixture of Experts
Figure 4 for On the Representation Collapse of Sparse Mixture of Experts
Viaarxiv icon

StableMoE: Stable Routing Strategy for Mixture of Experts

Add code
Apr 18, 2022
Figure 1 for StableMoE: Stable Routing Strategy for Mixture of Experts
Figure 2 for StableMoE: Stable Routing Strategy for Mixture of Experts
Figure 3 for StableMoE: Stable Routing Strategy for Mixture of Experts
Figure 4 for StableMoE: Stable Routing Strategy for Mixture of Experts
Viaarxiv icon