Alert button
Picture for Naman Goyal

Naman Goyal

Alert button

On the Role of Bidirectionality in Language Model Pre-Training

Add code
Bookmark button
Alert button
May 24, 2022
Mikel Artetxe, Jingfei Du, Naman Goyal, Luke Zettlemoyer, Ves Stoyanov

Figure 1 for On the Role of Bidirectionality in Language Model Pre-Training
Figure 2 for On the Role of Bidirectionality in Language Model Pre-Training
Figure 3 for On the Role of Bidirectionality in Language Model Pre-Training
Figure 4 for On the Role of Bidirectionality in Language Model Pre-Training
Viaarxiv icon

Lifting the Curse of Multilinguality by Pre-training Modular Transformers

Add code
Bookmark button
Alert button
May 12, 2022
Jonas Pfeiffer, Naman Goyal, Xi Victoria Lin, Xian Li, James Cross, Sebastian Riedel, Mikel Artetxe

Figure 1 for Lifting the Curse of Multilinguality by Pre-training Modular Transformers
Figure 2 for Lifting the Curse of Multilinguality by Pre-training Modular Transformers
Figure 3 for Lifting the Curse of Multilinguality by Pre-training Modular Transformers
Figure 4 for Lifting the Curse of Multilinguality by Pre-training Modular Transformers
Viaarxiv icon

OPT: Open Pre-trained Transformer Language Models

Add code
Bookmark button
Alert button
May 05, 2022
Susan Zhang, Stephen Roller, Naman Goyal, Mikel Artetxe, Moya Chen, Shuohui Chen, Christopher Dewan, Mona Diab, Xian Li, Xi Victoria Lin, Todor Mihaylov, Myle Ott, Sam Shleifer, Kurt Shuster, Daniel Simig, Punit Singh Koura, Anjali Sridhar, Tianlu Wang, Luke Zettlemoyer

Figure 1 for OPT: Open Pre-trained Transformer Language Models
Figure 2 for OPT: Open Pre-trained Transformer Language Models
Figure 3 for OPT: Open Pre-trained Transformer Language Models
Figure 4 for OPT: Open Pre-trained Transformer Language Models
Viaarxiv icon

How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training?

Add code
Bookmark button
Alert button
Apr 29, 2022
Shiyue Zhang, Vishrav Chaudhary, Naman Goyal, James Cross, Guillaume Wenzek, Mohit Bansal, Francisco Guzman

Figure 1 for How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training?
Figure 2 for How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training?
Figure 3 for How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training?
Figure 4 for How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training?
Viaarxiv icon

Graph Neural Networks for Image Classification and Reinforcement Learning using Graph representations

Add code
Bookmark button
Alert button
Mar 08, 2022
Naman Goyal, David Steiner

Figure 1 for Graph Neural Networks for Image Classification and Reinforcement Learning using Graph representations
Figure 2 for Graph Neural Networks for Image Classification and Reinforcement Learning using Graph representations
Figure 3 for Graph Neural Networks for Image Classification and Reinforcement Learning using Graph representations
Figure 4 for Graph Neural Networks for Image Classification and Reinforcement Learning using Graph representations
Viaarxiv icon

CM3: A Causal Masked Multimodal Model of the Internet

Add code
Bookmark button
Alert button
Jan 19, 2022
Armen Aghajanyan, Bernie Huang, Candace Ross, Vladimir Karpukhin, Hu Xu, Naman Goyal, Dmytro Okhonko, Mandar Joshi, Gargi Ghosh, Mike Lewis, Luke Zettlemoyer

Figure 1 for CM3: A Causal Masked Multimodal Model of the Internet
Figure 2 for CM3: A Causal Masked Multimodal Model of the Internet
Figure 3 for CM3: A Causal Masked Multimodal Model of the Internet
Figure 4 for CM3: A Causal Masked Multimodal Model of the Internet
Viaarxiv icon

Efficient Large Scale Language Modeling with Mixtures of Experts

Add code
Bookmark button
Alert button
Dec 20, 2021
Mikel Artetxe, Shruti Bhosale, Naman Goyal, Todor Mihaylov, Myle Ott, Sam Shleifer, Xi Victoria Lin, Jingfei Du, Srinivasan Iyer, Ramakanth Pasunuru, Giri Anantharaman, Xian Li, Shuohui Chen, Halil Akin, Mandeep Baines, Louis Martin, Xing Zhou, Punit Singh Koura, Brian O'Horo, Jeff Wang, Luke Zettlemoyer, Mona Diab, Zornitsa Kozareva, Ves Stoyanov

Figure 1 for Efficient Large Scale Language Modeling with Mixtures of Experts
Figure 2 for Efficient Large Scale Language Modeling with Mixtures of Experts
Figure 3 for Efficient Large Scale Language Modeling with Mixtures of Experts
Figure 4 for Efficient Large Scale Language Modeling with Mixtures of Experts
Viaarxiv icon

Few-shot Learning with Multilingual Language Models

Add code
Bookmark button
Alert button
Dec 20, 2021
Xi Victoria Lin, Todor Mihaylov, Mikel Artetxe, Tianlu Wang, Shuohui Chen, Daniel Simig, Myle Ott, Naman Goyal, Shruti Bhosale, Jingfei Du, Ramakanth Pasunuru, Sam Shleifer, Punit Singh Koura, Vishrav Chaudhary, Brian O'Horo, Jeff Wang, Luke Zettlemoyer, Zornitsa Kozareva, Mona Diab, Veselin Stoyanov, Xian Li

Figure 1 for Few-shot Learning with Multilingual Language Models
Figure 2 for Few-shot Learning with Multilingual Language Models
Figure 3 for Few-shot Learning with Multilingual Language Models
Figure 4 for Few-shot Learning with Multilingual Language Models
Viaarxiv icon

XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale

Add code
Bookmark button
Alert button
Nov 19, 2021
Arun Babu, Changhan Wang, Andros Tjandra, Kushal Lakhotia, Qiantong Xu, Naman Goyal, Kritika Singh, Patrick von Platen, Yatharth Saraf, Juan Pino, Alexei Baevski, Alexis Conneau, Michael Auli

Figure 1 for XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Figure 2 for XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Figure 3 for XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Figure 4 for XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Viaarxiv icon