Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Enhancing Speech-to-Speech Translation with Multiple TTS Targets


Apr 10, 2023
Jiatong Shi, Yun Tang, Ann Lee, Hirofumi Inaguma, Changhan Wang, Juan Pino, Shinji Watanabe

Add code


   Access Paper or Ask Questions

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation


Mar 07, 2023
Mohamed Anwar, Bowen Shi, Vedanuj Goswami, Wei-Ning Hsu, Juan Pino, Changhan Wang

Add code


   Access Paper or Ask Questions

Pre-training for Speech Translation: CTC Meets Optimal Transport


Jan 27, 2023
Phuong-Hang Le, Hongyu Gong, Changhan Wang, Juan Pino, Benjamin Lecouteux, Didier Schwab

Add code


   Access Paper or Ask Questions

A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation


Jan 25, 2023
Wen-Chin Huang, Benjamin Peloquin, Justine Kao, Changhan Wang, Hongyu Gong, Elizabeth Salesky, Yossi Adi, Ann Lee, Peng-Jen Chen

Add code

* This is the full version of our submission to ICASSP 2023 

   Access Paper or Ask Questions

UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units


Dec 15, 2022
Hirofumi Inaguma, Sravya Popuri, Ilia Kulikov, Peng-Jen Chen, Changhan Wang, Yu-An Chung, Yun Tang, Ann Lee, Shinji Watanabe, Juan Pino

Add code

* Early draft. Work in progress 

   Access Paper or Ask Questions

Introducing Semantics into Speech Encoders


Nov 15, 2022
Derek Xu, Shuyan Dong, Changhan Wang, Suyoun Kim, Zhaojiang Lin, Akshat Shrivastava, Shang-Wen Li, Liang-Hsuan Tseng, Alexei Baevski, Guan-Ting Lin, Hung-yi Lee, Yizhou Sun, Wei Wang

Add code

* 11 pages, 3 figures 

   Access Paper or Ask Questions

Speech-to-Speech Translation For A Real-world Unwritten Language


Nov 11, 2022
Peng-Jen Chen, Kevin Tran, Yilin Yang, Jingfei Du, Justine Kao, Yu-An Chung, Paden Tomasello, Paul-Ambroise Duquenne, Holger Schwenk, Hongyu Gong, Hirofumi Inaguma, Sravya Popuri, Changhan Wang, Juan Pino, Wei-Ning Hsu, Ann Lee

Add code


   Access Paper or Ask Questions

SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations


Nov 08, 2022
Paul-Ambroise Duquenne, Hongyu Gong, Ning Dong, Jingfei Du, Ann Lee, Vedanuj Goswani, Changhan Wang, Juan Pino, Benoît Sagot, Holger Schwenk

Add code

* 18 pages 

   Access Paper or Ask Questions

Improving Speech-to-Speech Translation Through Unlabeled Text


Oct 26, 2022
Xuan-Phi Nguyen, Sravya Popuri, Changhan Wang, Yun Tang, Ilia Kulikov, Hongyu Gong

Add code


   Access Paper or Ask Questions

1
2
3
4
5
>>