Alert button
Picture for Juan Pino

Juan Pino

Alert button

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation

Mar 07, 2023
Mohamed Anwar, Bowen Shi, Vedanuj Goswami, Wei-Ning Hsu, Juan Pino, Changhan Wang

Figure 1 for MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation
Figure 2 for MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation
Figure 3 for MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation
Figure 4 for MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation
Viaarxiv icon

Pre-training for Speech Translation: CTC Meets Optimal Transport

Jan 27, 2023
Phuong-Hang Le, Hongyu Gong, Changhan Wang, Juan Pino, Benjamin Lecouteux, Didier Schwab

Figure 1 for Pre-training for Speech Translation: CTC Meets Optimal Transport
Figure 2 for Pre-training for Speech Translation: CTC Meets Optimal Transport
Figure 3 for Pre-training for Speech Translation: CTC Meets Optimal Transport
Figure 4 for Pre-training for Speech Translation: CTC Meets Optimal Transport
Viaarxiv icon

UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units

Dec 15, 2022
Hirofumi Inaguma, Sravya Popuri, Ilia Kulikov, Peng-Jen Chen, Changhan Wang, Yu-An Chung, Yun Tang, Ann Lee, Shinji Watanabe, Juan Pino

Figure 1 for UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units
Figure 2 for UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units
Figure 3 for UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units
Figure 4 for UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units
Viaarxiv icon

Speech-to-Speech Translation For A Real-world Unwritten Language

Nov 11, 2022
Peng-Jen Chen, Kevin Tran, Yilin Yang, Jingfei Du, Justine Kao, Yu-An Chung, Paden Tomasello, Paul-Ambroise Duquenne, Holger Schwenk, Hongyu Gong, Hirofumi Inaguma, Sravya Popuri, Changhan Wang, Juan Pino, Wei-Ning Hsu, Ann Lee

Figure 1 for Speech-to-Speech Translation For A Real-world Unwritten Language
Figure 2 for Speech-to-Speech Translation For A Real-world Unwritten Language
Figure 3 for Speech-to-Speech Translation For A Real-world Unwritten Language
Figure 4 for Speech-to-Speech Translation For A Real-world Unwritten Language
Viaarxiv icon

SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations

Nov 08, 2022
Paul-Ambroise Duquenne, Hongyu Gong, Ning Dong, Jingfei Du, Ann Lee, Vedanuj Goswani, Changhan Wang, Juan Pino, Benoît Sagot, Holger Schwenk

Figure 1 for SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Figure 2 for SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Figure 3 for SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Figure 4 for SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Viaarxiv icon

Simple and Effective Unsupervised Speech Translation

Oct 18, 2022
Changhan Wang, Hirofumi Inaguma, Peng-Jen Chen, Ilia Kulikov, Yun Tang, Wei-Ning Hsu, Michael Auli, Juan Pino

Figure 1 for Simple and Effective Unsupervised Speech Translation
Figure 2 for Simple and Effective Unsupervised Speech Translation
Figure 3 for Simple and Effective Unsupervised Speech Translation
Figure 4 for Simple and Effective Unsupervised Speech Translation
Viaarxiv icon

Unified Speech-Text Pre-training for Speech Translation and Recognition

Apr 11, 2022
Yun Tang, Hongyu Gong, Ning Dong, Changhan Wang, Wei-Ning Hsu, Jiatao Gu, Alexei Baevski, Xian Li, Abdelrahman Mohamed, Michael Auli, Juan Pino

Figure 1 for Unified Speech-Text Pre-training for Speech Translation and Recognition
Figure 2 for Unified Speech-Text Pre-training for Speech Translation and Recognition
Figure 3 for Unified Speech-Text Pre-training for Speech Translation and Recognition
Figure 4 for Unified Speech-Text Pre-training for Speech Translation and Recognition
Viaarxiv icon

Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation

Apr 06, 2022
Sravya Popuri, Peng-Jen Chen, Changhan Wang, Juan Pino, Yossi Adi, Jiatao Gu, Wei-Ning Hsu, Ann Lee

Figure 1 for Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
Figure 2 for Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
Figure 3 for Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
Figure 4 for Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
Viaarxiv icon

Textless Speech-to-Speech Translation on Real Data

Dec 15, 2021
Ann Lee, Hongyu Gong, Paul-Ambroise Duquenne, Holger Schwenk, Peng-Jen Chen, Changhan Wang, Sravya Popuri, Juan Pino, Jiatao Gu, Wei-Ning Hsu

Figure 1 for Textless Speech-to-Speech Translation on Real Data
Figure 2 for Textless Speech-to-Speech Translation on Real Data
Figure 3 for Textless Speech-to-Speech Translation on Real Data
Figure 4 for Textless Speech-to-Speech Translation on Real Data
Viaarxiv icon