Alert button
Picture for Sergey Edunov

Sergey Edunov

Alert button

NLLB Team

Effective Long-Context Scaling of Foundation Models

Add code
Bookmark button
Alert button
Sep 27, 2023
Wenhan Xiong, Jingyu Liu, Igor Molybog, Hejia Zhang, Prajjwal Bhargava, Rui Hou, Louis Martin, Rashi Rungta, Karthik Abinav Sankararaman, Barlas Oguz, Madian Khabsa, Han Fang, Yashar Mehdad, Sharan Narang, Kshitiz Malik, Angela Fan, Shruti Bhosale, Sergey Edunov, Mike Lewis, Sinong Wang, Hao Ma

Figure 1 for Effective Long-Context Scaling of Foundation Models
Figure 2 for Effective Long-Context Scaling of Foundation Models
Figure 3 for Effective Long-Context Scaling of Foundation Models
Figure 4 for Effective Long-Context Scaling of Foundation Models
Viaarxiv icon

Llama 2: Open Foundation and Fine-Tuned Chat Models

Add code
Bookmark button
Alert button
Jul 19, 2023
Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini, Rui Hou, Hakan Inan, Marcin Kardas, Viktor Kerkez, Madian Khabsa, Isabel Kloumann, Artem Korenev, Punit Singh Koura, Marie-Anne Lachaux, Thibaut Lavril, Jenya Lee, Diana Liskovich, Yinghai Lu, Yuning Mao, Xavier Martinet, Todor Mihaylov, Pushkar Mishra, Igor Molybog, Yixin Nie, Andrew Poulton, Jeremy Reizenstein, Rashi Rungta, Kalyan Saladi, Alan Schelten, Ruan Silva, Eric Michael Smith, Ranjan Subramanian, Xiaoqing Ellen Tan, Binh Tang, Ross Taylor, Adina Williams, Jian Xiang Kuan, Puxin Xu, Zheng Yan, Iliyan Zarov, Yuchen Zhang, Angela Fan, Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom

Figure 1 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 2 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 3 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 4 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Viaarxiv icon

No Language Left Behind: Scaling Human-Centered Machine Translation

Add code
Bookmark button
Alert button
Jul 11, 2022
NLLB team, Marta R. Costa-jussà, James Cross, Onur Çelebi, Maha Elbayad, Kenneth Heafield, Kevin Heffernan, Elahe Kalbassi, Janice Lam, Daniel Licht, Jean Maillard, Anna Sun, Skyler Wang, Guillaume Wenzek, Al Youngblood, Bapi Akula, Loic Barrault, Gabriel Mejia Gonzalez, Prangthip Hansanti, John Hoffman, Semarley Jarrett, Kaushik Ram Sadagopan, Dirk Rowe, Shannon Spruit, Chau Tran, Pierre Andrews, Necip Fazil Ayan, Shruti Bhosale, Sergey Edunov, Angela Fan, Cynthia Gao, Vedanuj Goswami, Francisco Guzmán, Philipp Koehn, Alexandre Mourachko, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Jeff Wang

Figure 1 for No Language Left Behind: Scaling Human-Centered Machine Translation
Figure 2 for No Language Left Behind: Scaling Human-Centered Machine Translation
Figure 3 for No Language Left Behind: Scaling Human-Centered Machine Translation
Figure 4 for No Language Left Behind: Scaling Human-Centered Machine Translation
Viaarxiv icon

LegoNN: Building Modular Encoder-Decoder Models

Add code
Bookmark button
Alert button
Jun 07, 2022
Siddharth Dalmia, Dmytro Okhonko, Mike Lewis, Sergey Edunov, Shinji Watanabe, Florian Metze, Luke Zettlemoyer, Abdelrahman Mohamed

Figure 1 for LegoNN: Building Modular Encoder-Decoder Models
Figure 2 for LegoNN: Building Modular Encoder-Decoder Models
Figure 3 for LegoNN: Building Modular Encoder-Decoder Models
Figure 4 for LegoNN: Building Modular Encoder-Decoder Models
Viaarxiv icon

Facebook AI WMT21 News Translation Task Submission

Add code
Bookmark button
Alert button
Aug 06, 2021
Chau Tran, Shruti Bhosale, James Cross, Philipp Koehn, Sergey Edunov, Angela Fan

Figure 1 for Facebook AI WMT21 News Translation Task Submission
Figure 2 for Facebook AI WMT21 News Translation Task Submission
Figure 3 for Facebook AI WMT21 News Translation Task Submission
Figure 4 for Facebook AI WMT21 News Translation Task Submission
Viaarxiv icon

A Comparison of Approaches to Document-level Machine Translation

Add code
Bookmark button
Alert button
Jan 26, 2021
Zhiyi Ma, Sergey Edunov, Michael Auli

Figure 1 for A Comparison of Approaches to Document-level Machine Translation
Figure 2 for A Comparison of Approaches to Document-level Machine Translation
Figure 3 for A Comparison of Approaches to Document-level Machine Translation
Figure 4 for A Comparison of Approaches to Document-level Machine Translation
Viaarxiv icon

Language Models not just for Pre-training: Fast Online Neural Noisy Channel Modeling

Add code
Bookmark button
Alert button
Nov 13, 2020
Shruti Bhosale, Kyra Yee, Sergey Edunov, Michael Auli

Figure 1 for Language Models not just for Pre-training: Fast Online Neural Noisy Channel Modeling
Figure 2 for Language Models not just for Pre-training: Fast Online Neural Noisy Channel Modeling
Figure 3 for Language Models not just for Pre-training: Fast Online Neural Noisy Channel Modeling
Figure 4 for Language Models not just for Pre-training: Fast Online Neural Noisy Channel Modeling
Viaarxiv icon

Beyond English-Centric Multilingual Machine Translation

Add code
Bookmark button
Alert button
Oct 21, 2020
Angela Fan, Shruti Bhosale, Holger Schwenk, Zhiyi Ma, Ahmed El-Kishky, Siddharth Goyal, Mandeep Baines, Onur Celebi, Guillaume Wenzek, Vishrav Chaudhary, Naman Goyal, Tom Birch, Vitaliy Liptchinsky, Sergey Edunov, Edouard Grave, Michael Auli, Armand Joulin

Figure 1 for Beyond English-Centric Multilingual Machine Translation
Figure 2 for Beyond English-Centric Multilingual Machine Translation
Figure 3 for Beyond English-Centric Multilingual Machine Translation
Figure 4 for Beyond English-Centric Multilingual Machine Translation
Viaarxiv icon

Large scale weakly and semi-supervised learning for low-resource video ASR

Add code
Bookmark button
Alert button
May 16, 2020
Kritika Singh, Vimal Manohar, Alex Xiao, Sergey Edunov, Ross Girshick, Vitaliy Liptchinsky, Christian Fuegen, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed

Figure 1 for Large scale weakly and semi-supervised learning for low-resource video ASR
Figure 2 for Large scale weakly and semi-supervised learning for low-resource video ASR
Figure 3 for Large scale weakly and semi-supervised learning for low-resource video ASR
Viaarxiv icon

Dense Passage Retrieval for Open-Domain Question Answering

Add code
Bookmark button
Alert button
May 02, 2020
Vladimir Karpukhin, Barlas Oğuz, Sewon Min, Patrick Lewis, Ledell Wu, Sergey Edunov, Danqi Chen, Wen-tau Yih

Figure 1 for Dense Passage Retrieval for Open-Domain Question Answering
Figure 2 for Dense Passage Retrieval for Open-Domain Question Answering
Figure 3 for Dense Passage Retrieval for Open-Domain Question Answering
Figure 4 for Dense Passage Retrieval for Open-Domain Question Answering
Viaarxiv icon