Alert button
Picture for Orhan Firat

Orhan Firat

Alert button

GLaM: Efficient Scaling of Language Models with Mixture-of-Experts

Add code
Bookmark button
Alert button
Dec 13, 2021
Nan Du, Yanping Huang, Andrew M. Dai, Simon Tong, Dmitry Lepikhin, Yuanzhong Xu, Maxim Krikun, Yanqi Zhou, Adams Wei Yu, Orhan Firat, Barret Zoph, Liam Fedus, Maarten Bosma, Zongwei Zhou, Tao Wang, Yu Emma Wang, Kellie Webster, Marie Pellat, Kevin Robinson, Kathy Meier-Hellstern, Toju Duke, Lucas Dixon, Kun Zhang, Quoc V Le, Yonghui Wu, Zhifeng Chen, Claire Cui

Figure 1 for GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
Figure 2 for GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
Figure 3 for GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
Figure 4 for GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
Viaarxiv icon

A Loss Curvature Perspective on Training Instability in Deep Learning

Add code
Bookmark button
Alert button
Oct 08, 2021
Justin Gilmer, Behrooz Ghorbani, Ankush Garg, Sneha Kudugunta, Behnam Neyshabur, David Cardoze, George Dahl, Zachary Nado, Orhan Firat

Figure 1 for A Loss Curvature Perspective on Training Instability in Deep Learning
Figure 2 for A Loss Curvature Perspective on Training Instability in Deep Learning
Figure 3 for A Loss Curvature Perspective on Training Instability in Deep Learning
Figure 4 for A Loss Curvature Perspective on Training Instability in Deep Learning
Viaarxiv icon

Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference

Add code
Bookmark button
Alert button
Sep 24, 2021
Sneha Kudugunta, Yanping Huang, Ankur Bapna, Maxim Krikun, Dmitry Lepikhin, Minh-Thang Luong, Orhan Firat

Figure 1 for Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference
Figure 2 for Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference
Figure 3 for Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference
Figure 4 for Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference
Viaarxiv icon

Multilingual Document-Level Translation Enables Zero-Shot Transfer From Sentences to Documents

Add code
Bookmark button
Alert button
Sep 21, 2021
Biao Zhang, Ankur Bapna, Melvin Johnson, Ali Dabirmoghaddam, Naveen Arivazhagan, Orhan Firat

Figure 1 for Multilingual Document-Level Translation Enables Zero-Shot Transfer From Sentences to Documents
Figure 2 for Multilingual Document-Level Translation Enables Zero-Shot Transfer From Sentences to Documents
Figure 3 for Multilingual Document-Level Translation Enables Zero-Shot Transfer From Sentences to Documents
Figure 4 for Multilingual Document-Level Translation Enables Zero-Shot Transfer From Sentences to Documents
Viaarxiv icon

Towards Zero-Label Language Learning

Add code
Bookmark button
Alert button
Sep 19, 2021
Zirui Wang, Adams Wei Yu, Orhan Firat, Yuan Cao

Figure 1 for Towards Zero-Label Language Learning
Figure 2 for Towards Zero-Label Language Learning
Figure 3 for Towards Zero-Label Language Learning
Figure 4 for Towards Zero-Label Language Learning
Viaarxiv icon

Scaling Laws for Neural Machine Translation

Add code
Bookmark button
Alert button
Sep 16, 2021
Behrooz Ghorbani, Orhan Firat, Markus Freitag, Ankur Bapna, Maxim Krikun, Xavier Garcia, Ciprian Chelba, Colin Cherry

Figure 1 for Scaling Laws for Neural Machine Translation
Figure 2 for Scaling Laws for Neural Machine Translation
Figure 3 for Scaling Laws for Neural Machine Translation
Figure 4 for Scaling Laws for Neural Machine Translation
Viaarxiv icon

Evaluating Multiway Multilingual NMT in the Turkic Languages

Add code
Bookmark button
Alert button
Sep 13, 2021
Jamshidbek Mirzakhalov, Anoop Babu, Aigiz Kunafin, Ahsan Wahab, Behzod Moydinboyev, Sardana Ivanova, Mokhiyakhon Uzokova, Shaxnoza Pulatova, Duygu Ataman, Julia Kreutzer, Francis Tyers, Orhan Firat, John Licato, Sriram Chellappan

Figure 1 for Evaluating Multiway Multilingual NMT in the Turkic Languages
Figure 2 for Evaluating Multiway Multilingual NMT in the Turkic Languages
Figure 3 for Evaluating Multiway Multilingual NMT in the Turkic Languages
Figure 4 for Evaluating Multiway Multilingual NMT in the Turkic Languages
Viaarxiv icon

A Large-Scale Study of Machine Translation in the Turkic Languages

Add code
Bookmark button
Alert button
Sep 09, 2021
Jamshidbek Mirzakhalov, Anoop Babu, Duygu Ataman, Sherzod Kariev, Francis Tyers, Otabek Abduraufov, Mammad Hajili, Sardana Ivanova, Abror Khaytbaev, Antonio Laverghetta Jr., Behzodbek Moydinboyev, Esra Onal, Shaxnoza Pulatova, Ahsan Wahab, Orhan Firat, Sriram Chellappan

Figure 1 for A Large-Scale Study of Machine Translation in the Turkic Languages
Figure 2 for A Large-Scale Study of Machine Translation in the Turkic Languages
Figure 3 for A Large-Scale Study of Machine Translation in the Turkic Languages
Figure 4 for A Large-Scale Study of Machine Translation in the Turkic Languages
Viaarxiv icon

Towards Universality in Multilingual Text Rewriting

Add code
Bookmark button
Alert button
Jul 30, 2021
Xavier Garcia, Noah Constant, Mandy Guo, Orhan Firat

Figure 1 for Towards Universality in Multilingual Text Rewriting
Figure 2 for Towards Universality in Multilingual Text Rewriting
Figure 3 for Towards Universality in Multilingual Text Rewriting
Figure 4 for Towards Universality in Multilingual Text Rewriting
Viaarxiv icon

XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation

Add code
Bookmark button
Alert button
Apr 15, 2021
Sebastian Ruder, Noah Constant, Jan Botha, Aditya Siddhant, Orhan Firat, Jinlan Fu, Pengfei Liu, Junjie Hu, Graham Neubig, Melvin Johnson

Figure 1 for XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation
Figure 2 for XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation
Figure 3 for XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation
Figure 4 for XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation
Viaarxiv icon