Picture for Maxim Krikun

Maxim Krikun

GSPMD: General and Scalable Parallelization for ML Computation Graphs

Add code
May 10, 2021
Figure 1 for GSPMD: General and Scalable Parallelization for ML Computation Graphs
Figure 2 for GSPMD: General and Scalable Parallelization for ML Computation Graphs
Figure 3 for GSPMD: General and Scalable Parallelization for ML Computation Graphs
Figure 4 for GSPMD: General and Scalable Parallelization for ML Computation Graphs
Viaarxiv icon

GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding

Add code
Jun 30, 2020
Figure 1 for GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
Figure 2 for GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
Figure 3 for GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
Figure 4 for GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
Viaarxiv icon

Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges

Add code
Jul 11, 2019
Figure 1 for Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges
Figure 2 for Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges
Figure 3 for Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges
Figure 4 for Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges
Viaarxiv icon

Reinforcement Learning based Curriculum Optimization for Neural Machine Translation

Add code
Feb 28, 2019
Figure 1 for Reinforcement Learning based Curriculum Optimization for Neural Machine Translation
Figure 2 for Reinforcement Learning based Curriculum Optimization for Neural Machine Translation
Figure 3 for Reinforcement Learning based Curriculum Optimization for Neural Machine Translation
Figure 4 for Reinforcement Learning based Curriculum Optimization for Neural Machine Translation
Viaarxiv icon

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

Add code
Feb 21, 2019
Figure 1 for Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Figure 2 for Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Figure 3 for Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Viaarxiv icon

Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation

Add code
Aug 21, 2017
Viaarxiv icon

Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation

Add code
Oct 08, 2016
Figure 1 for Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Figure 2 for Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Figure 3 for Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Figure 4 for Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Viaarxiv icon