Alert button
Picture for Holger Schwenk

Holger Schwenk

Alert button

Seamless: Multilingual Expressive and Streaming Speech Translation

Dec 08, 2023
Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek, Yilin Yang, Ethan Ye, Ivan Evtimov, Pierre Fernandez, Cynthia Gao, Prangthip Hansanti, Elahe Kalbassi, Amanda Kallet, Artyom Kozhevnikov, Gabriel Mejia Gonzalez, Robin San Roman, Christophe Touret, Corinne Wong, Carleigh Wood, Bokai Yu, Pierre Andrews, Can Balioglu, Peng-Jen Chen, Marta R. Costa-jussà, Maha Elbayad, Hongyu Gong, Francisco Guzmán, Kevin Heffernan, Somya Jain, Justine Kao, Ann Lee, Xutai Ma, Alex Mourachko, Benjamin Peloquin, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Anna Sun, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang, Mary Williamson

Figure 1 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 2 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 3 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 4 for Seamless: Multilingual Expressive and Streaming Speech Translation
Viaarxiv icon

Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer

Oct 05, 2023
Paul-Ambroise Duquenne, Holger Schwenk, Benoît Sagot

Figure 1 for Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer
Figure 2 for Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer
Figure 3 for Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer
Figure 4 for Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer
Viaarxiv icon

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation

Aug 23, 2023
Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Cora Meglioli, David Dale, Ning Dong, Paul-Ambroise Duquenne, Hady Elsahar, Hongyu Gong, Kevin Heffernan, John Hoffman, Christopher Klaiber, Pengwei Li, Daniel Licht, Jean Maillard, Alice Rakotoarison, Kaushik Ram Sadagopan, Guillaume Wenzek, Ethan Ye, Bapi Akula, Peng-Jen Chen, Naji El Hachem, Brian Ellis, Gabriel Mejia Gonzalez, Justin Haaheim, Prangthip Hansanti, Russ Howes, Bernie Huang, Min-Jae Hwang, Hirofumi Inaguma, Somya Jain, Elahe Kalbassi, Amanda Kallet, Ilia Kulikov, Janice Lam, Daniel Li, Xutai Ma, Ruslan Mavlyutov, Benjamin Peloquin, Mohamed Ramadan, Abinesh Ramakrishnan, Anna Sun, Kevin Tran, Tuan Tran, Igor Tufanov, Vish Vogeti, Carleigh Wood, Yilin Yang, Bokai Yu, Pierre Andrews, Can Balioglu, Marta R. Costa-jussà, Onur Celebi, Maha Elbayad, Cynthia Gao, Francisco Guzmán, Justine Kao, Ann Lee, Alexandre Mourachko, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang

Figure 1 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Figure 2 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Figure 3 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Figure 4 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Viaarxiv icon

SONAR: Sentence-Level Multimodal and Language-Agnostic Representations

Aug 23, 2023
Paul-Ambroise Duquenne, Holger Schwenk, Benoît Sagot

Figure 1 for SONAR: Sentence-Level Multimodal and Language-Agnostic Representations
Figure 2 for SONAR: Sentence-Level Multimodal and Language-Agnostic Representations
Figure 3 for SONAR: Sentence-Level Multimodal and Language-Agnostic Representations
Figure 4 for SONAR: Sentence-Level Multimodal and Language-Agnostic Representations
Viaarxiv icon

xSIM++: An Improved Proxy to Bitext Mining Performance for Low-Resource Languages

Jun 22, 2023
Mingda Chen, Kevin Heffernan, Onur Çelebi, Alex Mourachko, Holger Schwenk

Figure 1 for xSIM++: An Improved Proxy to Bitext Mining Performance for Low-Resource Languages
Figure 2 for xSIM++: An Improved Proxy to Bitext Mining Performance for Low-Resource Languages
Figure 3 for xSIM++: An Improved Proxy to Bitext Mining Performance for Low-Resource Languages
Figure 4 for xSIM++: An Improved Proxy to Bitext Mining Performance for Low-Resource Languages
Viaarxiv icon

BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric

Dec 16, 2022
Mingda Chen, Paul-Ambroise Duquenne, Pierre Andrews, Justine Kao, Alexandre Mourachko, Holger Schwenk, Marta R. Costa-jussà

Figure 1 for BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric
Figure 2 for BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric
Figure 3 for BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric
Figure 4 for BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric
Viaarxiv icon

Speech-to-Speech Translation For A Real-world Unwritten Language

Nov 11, 2022
Peng-Jen Chen, Kevin Tran, Yilin Yang, Jingfei Du, Justine Kao, Yu-An Chung, Paden Tomasello, Paul-Ambroise Duquenne, Holger Schwenk, Hongyu Gong, Hirofumi Inaguma, Sravya Popuri, Changhan Wang, Juan Pino, Wei-Ning Hsu, Ann Lee

Figure 1 for Speech-to-Speech Translation For A Real-world Unwritten Language
Figure 2 for Speech-to-Speech Translation For A Real-world Unwritten Language
Figure 3 for Speech-to-Speech Translation For A Real-world Unwritten Language
Figure 4 for Speech-to-Speech Translation For A Real-world Unwritten Language
Viaarxiv icon

SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations

Nov 08, 2022
Paul-Ambroise Duquenne, Hongyu Gong, Ning Dong, Jingfei Du, Ann Lee, Vedanuj Goswani, Changhan Wang, Juan Pino, Benoît Sagot, Holger Schwenk

Figure 1 for SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Figure 2 for SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Figure 3 for SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Figure 4 for SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Viaarxiv icon

DiffEdit: Diffusion-based semantic image editing with mask guidance

Oct 20, 2022
Guillaume Couairon, Jakob Verbeek, Holger Schwenk, Matthieu Cord

Figure 1 for DiffEdit: Diffusion-based semantic image editing with mask guidance
Figure 2 for DiffEdit: Diffusion-based semantic image editing with mask guidance
Figure 3 for DiffEdit: Diffusion-based semantic image editing with mask guidance
Figure 4 for DiffEdit: Diffusion-based semantic image editing with mask guidance
Viaarxiv icon

Multilingual Representation Distillation with Contrastive Learning

Oct 10, 2022
Weiting Tan, Kevin Heffernan, Holger Schwenk, Philipp Koehn

Figure 1 for Multilingual Representation Distillation with Contrastive Learning
Figure 2 for Multilingual Representation Distillation with Contrastive Learning
Figure 3 for Multilingual Representation Distillation with Contrastive Learning
Figure 4 for Multilingual Representation Distillation with Contrastive Learning
Viaarxiv icon