Alert button
Picture for Shinji Watanabe

Shinji Watanabe

Alert button

UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures

Add code
Bookmark button
Alert button
May 31, 2023
Zhong-Qiu Wang, Shinji Watanabe

Figure 1 for UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures
Figure 2 for UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures
Figure 3 for UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures
Figure 4 for UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures
Viaarxiv icon

Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning

Add code
Bookmark button
Alert button
May 29, 2023
Xuankai Chang, Brian Yan, Yuya Fujita, Takashi Maekaku, Shinji Watanabe

Figure 1 for Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
Figure 2 for Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
Figure 3 for Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
Figure 4 for Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
Viaarxiv icon

DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models

Add code
Bookmark button
Alert button
May 28, 2023
Yifan Peng, Yui Sudo, Shakeel Muhammad, Shinji Watanabe

Figure 1 for DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models
Figure 2 for DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models
Figure 3 for DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models
Viaarxiv icon

A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning

Add code
Bookmark button
Alert button
May 19, 2023
Jiyang Tang, William Chen, Xuankai Chang, Shinji Watanabe, Brian MacWhinney

Figure 1 for A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning
Figure 2 for A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning
Figure 3 for A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning
Figure 4 for A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning
Viaarxiv icon

Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization

Add code
Bookmark button
Alert button
May 18, 2023
Puyuan Peng, Brian Yan, Shinji Watanabe, David Harwath

Figure 1 for Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
Figure 2 for Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
Figure 3 for Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
Figure 4 for Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
Viaarxiv icon

A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks

Add code
Bookmark button
Alert button
May 18, 2023
Yifan Peng, Kwangyoun Kim, Felix Wu, Brian Yan, Siddhant Arora, William Chen, Jiyang Tang, Suwon Shon, Prashant Sridhar, Shinji Watanabe

Figure 1 for A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks
Figure 2 for A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks
Figure 3 for A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks
Figure 4 for A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks
Viaarxiv icon

ML-SUPERB: Multilingual Speech Universal PERformance Benchmark

Add code
Bookmark button
Alert button
May 18, 2023
Jiatong Shi, Dan Berrebbi, William Chen, Ho-Lam Chung, En-Pei Hu, Wei Ping Huang, Xuankai Chang, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Shinji Watanabe

Figure 1 for ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
Figure 2 for ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
Figure 3 for ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
Viaarxiv icon

Improving Cascaded Unsupervised Speech Translation with Denoising Back-translation

Add code
Bookmark button
Alert button
May 12, 2023
Yu-Kuan Fu, Liang-Hsuan Tseng, Jiatong Shi, Chen-An Li, Tsu-Yuan Hsu, Shinji Watanabe, Hung-yi Lee

Figure 1 for Improving Cascaded Unsupervised Speech Translation with Denoising Back-translation
Figure 2 for Improving Cascaded Unsupervised Speech Translation with Denoising Back-translation
Figure 3 for Improving Cascaded Unsupervised Speech Translation with Denoising Back-translation
Figure 4 for Improving Cascaded Unsupervised Speech Translation with Denoising Back-translation
Viaarxiv icon

The Pipeline System of ASR and NLU with MLM-based Data Augmentation toward STOP Low-resource Challenge

Add code
Bookmark button
Alert button
May 11, 2023
Hayato Futami, Jessica Huynh, Siddhant Arora, Shih-Lun Wu, Yosuke Kashiwagi, Yifan Peng, Brian Yan, Emiru Tsunoo, Shinji Watanabe

Figure 1 for The Pipeline System of ASR and NLU with MLM-based Data Augmentation toward STOP Low-resource Challenge
Figure 2 for The Pipeline System of ASR and NLU with MLM-based Data Augmentation toward STOP Low-resource Challenge
Figure 3 for The Pipeline System of ASR and NLU with MLM-based Data Augmentation toward STOP Low-resource Challenge
Figure 4 for The Pipeline System of ASR and NLU with MLM-based Data Augmentation toward STOP Low-resource Challenge
Viaarxiv icon

A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge

Add code
Bookmark button
Alert button
May 06, 2023
Siddhant Arora, Hayato Futami, Shih-Lun Wu, Jessica Huynh, Yifan Peng, Yosuke Kashiwagi, Emiru Tsunoo, Brian Yan, Shinji Watanabe

Figure 1 for A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge
Figure 2 for A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge
Figure 3 for A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge
Viaarxiv icon