Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

ESPnet-ONNX: Bridging a Gap Between Research and Production


Sep 20, 2022
Masao Someki, Yosuke Higuchi, Tomoki Hayashi, Shinji Watanabe

* Accepted to APSIPA ASC 2022 

   Access Paper or Ask Questions

 • Share via Twitter
 • Share via Facebook
 • Share via LinkedIn
 • Share via Whatsapp
 • Share via Messenger
 • Share via Email

Deep Speech Synthesis from Articulatory Representations


Sep 13, 2022
Peter Wu, Shinji Watanabe, Louis Goldstein, Alan W Black, Gopala K. Anumanchipalli


   Access Paper or Ask Questions

 • Share via Twitter
 • Share via Facebook
 • Share via LinkedIn
 • Share via Whatsapp
 • Share via Messenger
 • Share via Email

TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation


Sep 08, 2022
Zhong-Qiu Wang, Samuele Cornell, Shukjae Choi, Younglo Lee, Byeong-Yeol Kim, Shinji Watanabe

* in submission 

   Access Paper or Ask Questions

 • Share via Twitter
 • Share via Facebook
 • Share via LinkedIn
 • Share via Whatsapp
 • Share via Messenger
 • Share via Email

ASR2K: Speech Recognition for Around 2000 Languages without Audio


Sep 06, 2022
Xinjian Li, Florian Metze, David R Mortensen, Alan W Black, Shinji Watanabe

* INTERSPEECH 2022 

   Access Paper or Ask Questions

 • Share via Twitter
 • Share via Facebook
 • Share via LinkedIn
 • Share via Whatsapp
 • Share via Messenger
 • Share via Email

VQ-T: RNN Transducers using Vector-Quantized Prediction Network States


Aug 03, 2022
Jiatong Shi, George Saon, David Haws, Shinji Watanabe, Brian Kingsbury

* Interspeech 2022 accepted paper 

   Access Paper or Ask Questions

 • Share via Twitter
 • Share via Facebook
 • Share via LinkedIn
 • Share via Whatsapp
 • Share via Messenger
 • Share via Email

When Is TTS Augmentation Through a Pivot Language Useful?


Jul 20, 2022
Nathaniel Robinson, Perez Ogayo, Swetha Gangu, David R. Mortensen, Shinji Watanabe


   Access Paper or Ask Questions

 • Share via Twitter
 • Share via Facebook
 • Share via LinkedIn
 • Share via Whatsapp
 • Share via Messenger
 • Share via Email

ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding


Jul 19, 2022
Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe

* To appear in Interspeech 2022 

   Access Paper or Ask Questions

 • Share via Twitter
 • Share via Facebook
 • Share via LinkedIn
 • Share via Whatsapp
 • Share via Messenger
 • Share via Email

Two-Pass Low Latency End-to-End Spoken Language Understanding


Jul 14, 2022
Siddhant Arora, Siddharth Dalmia, Xuankai Chang, Brian Yan, Alan Black, Shinji Watanabe

* INTERSPEECH 2022 

   Access Paper or Ask Questions

 • Share via Twitter
 • Share via Facebook
 • Share via LinkedIn
 • Share via Whatsapp
 • Share via Messenger
 • Share via Email

Improving Speech Enhancement through Fine-Grained Speech Characteristics


Jul 11, 2022
Muqiao Yang, Joseph Konan, David Bick, Anurag Kumar, Shinji Watanabe, Bhiksha Raj

* Accepted at InterSpeech 2022 

   Access Paper or Ask Questions

 • Share via Twitter
 • Share via Facebook
 • Share via LinkedIn
 • Share via Whatsapp
 • Share via Messenger
 • Share via Email

Online Continual Learning of End-to-End Speech Recognition Models


Jul 11, 2022
Muqiao Yang, Ian Lane, Shinji Watanabe

* Accepted at InterSpeech 2022 

   Access Paper or Ask Questions

 • Share via Twitter
 • Share via Facebook
 • Share via LinkedIn
 • Share via Whatsapp
 • Share via Messenger
 • Share via Email
1
2
3
4
5
6
7
>>