Alert button

"speech": models, code, and papers
Alert button

Meta-Transfer Learning for Code-Switched Speech Recognition

Add code
Bookmark button
Alert button
Apr 29, 2020
Genta Indra Winata, Samuel Cahyawijaya, Zhaojiang Lin, Zihan Liu, Peng Xu, Pascale Fung

Figure 1 for Meta-Transfer Learning for Code-Switched Speech Recognition
Figure 2 for Meta-Transfer Learning for Code-Switched Speech Recognition
Figure 3 for Meta-Transfer Learning for Code-Switched Speech Recognition
Figure 4 for Meta-Transfer Learning for Code-Switched Speech Recognition
Viaarxiv icon

A Unified Framework for Speech Separation

Dec 17, 2019
Fahimeh Bahmaninezhad, Shi-Xiong Zhang, Yong Xu, Meng Yu, John H. L. Hansen, Dong Yu

Figure 1 for A Unified Framework for Speech Separation
Figure 2 for A Unified Framework for Speech Separation
Figure 3 for A Unified Framework for Speech Separation
Figure 4 for A Unified Framework for Speech Separation
Viaarxiv icon

Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output

Add code
Bookmark button
Alert button
Feb 20, 2021
Hangting Chen, Pengyuan Zhang

Figure 1 for Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output
Figure 2 for Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output
Figure 3 for Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output
Figure 4 for Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output
Viaarxiv icon

Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation

Add code
Bookmark button
Alert button
Apr 13, 2021
Hirofumi Inaguma, Tatsuya Kawahara, Shinji Watanabe

Figure 1 for Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation
Figure 2 for Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation
Figure 3 for Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation
Figure 4 for Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation
Viaarxiv icon

Dereverberation using joint estimation of dry speech signal and acoustic system

Add code
Bookmark button
Alert button
Jul 24, 2020
Sanna Wager, Keunwoo Choi, Simon Durand

Figure 1 for Dereverberation using joint estimation of dry speech signal and acoustic system
Viaarxiv icon

Improving the Naturalness of Simulated Conversations for End-to-End Neural Diarization

Add code
Bookmark button
Alert button
Apr 24, 2022
Natsuo Yamashita, Shota Horiguchi, Takeshi Homma

Figure 1 for Improving the Naturalness of Simulated Conversations for End-to-End Neural Diarization
Figure 2 for Improving the Naturalness of Simulated Conversations for End-to-End Neural Diarization
Figure 3 for Improving the Naturalness of Simulated Conversations for End-to-End Neural Diarization
Figure 4 for Improving the Naturalness of Simulated Conversations for End-to-End Neural Diarization
Viaarxiv icon

Improving Reverberant Speech Separation with Multi-stage Training and Curriculum Learning

Add code
Bookmark button
Alert button
Jul 19, 2021
Rohith Aralikatti, Anton Ratnarajah, Zhenyu Tang, Dinesh Manocha

Figure 1 for Improving Reverberant Speech Separation with Multi-stage Training and Curriculum Learning
Figure 2 for Improving Reverberant Speech Separation with Multi-stage Training and Curriculum Learning
Figure 3 for Improving Reverberant Speech Separation with Multi-stage Training and Curriculum Learning
Figure 4 for Improving Reverberant Speech Separation with Multi-stage Training and Curriculum Learning
Viaarxiv icon

Speech-to-Singing Conversion based on Boundary Equilibrium GAN

Add code
Bookmark button
Alert button
May 28, 2020
Da-Yi Wu, Yi-Hsuan Yang

Figure 1 for Speech-to-Singing Conversion based on Boundary Equilibrium GAN
Figure 2 for Speech-to-Singing Conversion based on Boundary Equilibrium GAN
Figure 3 for Speech-to-Singing Conversion based on Boundary Equilibrium GAN
Figure 4 for Speech-to-Singing Conversion based on Boundary Equilibrium GAN
Viaarxiv icon

Interpretable Dysarthric Speaker Adaptation based on Optimal-Transport

Mar 14, 2022
Rosanna Turrisi, Leonardo Badino

Figure 1 for Interpretable Dysarthric Speaker Adaptation based on Optimal-Transport
Figure 2 for Interpretable Dysarthric Speaker Adaptation based on Optimal-Transport
Figure 3 for Interpretable Dysarthric Speaker Adaptation based on Optimal-Transport
Viaarxiv icon

A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems

Aug 17, 2021
Xiaoqiang Wang, Yanqing Liu, Sheng Zhao, Jinyu Li

Figure 1 for A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems
Figure 2 for A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems
Figure 3 for A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems
Figure 4 for A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems
Viaarxiv icon