Alert button

"speech recognition": models, code, and papers
Alert button

Disentangleing Content and Fine-grained Prosody Information via Hybrid ASR Bottleneck Features for Voice Conversion

Add code
Bookmark button
Alert button
Mar 24, 2022
Xintao Zhao, Feng Liu, Changhe Song, Zhiyong Wu, Shiyin Kang, Deyi Tuo, Helen Meng

Figure 1 for Disentangleing Content and Fine-grained Prosody Information via Hybrid ASR Bottleneck Features for Voice Conversion
Figure 2 for Disentangleing Content and Fine-grained Prosody Information via Hybrid ASR Bottleneck Features for Voice Conversion
Figure 3 for Disentangleing Content and Fine-grained Prosody Information via Hybrid ASR Bottleneck Features for Voice Conversion
Figure 4 for Disentangleing Content and Fine-grained Prosody Information via Hybrid ASR Bottleneck Features for Voice Conversion
Viaarxiv icon

Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition

Jun 16, 2018
Pengcheng Guo, Haihua Xu, Lei Xie, Eng Siong Chng

Figure 1 for Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition
Figure 2 for Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition
Figure 3 for Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition
Figure 4 for Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition
Viaarxiv icon

Enabling Deep Learning for All-in EDGE paradigm

Apr 07, 2022
Praveen Joshi, Haithem Afli, Mohammed Hasanuzzaman, Chandra Thapa, Ted Scully

Figure 1 for Enabling Deep Learning for All-in EDGE paradigm
Figure 2 for Enabling Deep Learning for All-in EDGE paradigm
Figure 3 for Enabling Deep Learning for All-in EDGE paradigm
Figure 4 for Enabling Deep Learning for All-in EDGE paradigm
Viaarxiv icon

Creating Speech-to-Speech Corpus from Dubbed Series

Mar 07, 2022
Massa Baali, Wassim El-Hajj, Ahmed Ali

Figure 1 for Creating Speech-to-Speech Corpus from Dubbed Series
Figure 2 for Creating Speech-to-Speech Corpus from Dubbed Series
Figure 3 for Creating Speech-to-Speech Corpus from Dubbed Series
Figure 4 for Creating Speech-to-Speech Corpus from Dubbed Series
Viaarxiv icon

'Beach' to 'Bitch': Inadvertent Unsafe Transcription of Kids' Content on YouTube

Add code
Bookmark button
Alert button
Feb 17, 2022
Krithika Ramesh, Ashiqur R. KhudaBukhsh, Sumeet Kumar

Figure 1 for 'Beach' to 'Bitch': Inadvertent Unsafe Transcription of Kids' Content on YouTube
Figure 2 for 'Beach' to 'Bitch': Inadvertent Unsafe Transcription of Kids' Content on YouTube
Figure 3 for 'Beach' to 'Bitch': Inadvertent Unsafe Transcription of Kids' Content on YouTube
Figure 4 for 'Beach' to 'Bitch': Inadvertent Unsafe Transcription of Kids' Content on YouTube
Viaarxiv icon

Improving End-to-End Models for Set Prediction in Spoken Language Understanding

Jan 28, 2022
Hong-Kwang J. Kuo, Zoltan Tuske, Samuel Thomas, Brian Kingsbury, George Saon

Figure 1 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Figure 2 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Figure 3 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Figure 4 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Viaarxiv icon

A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond

Add code
Bookmark button
Alert button
Apr 20, 2022
Yisheng Xiao, Lijun Wu, Junliang Guo, Juntao Li, Min Zhang, Tao Qin, Tie-yan Liu

Figure 1 for A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond
Figure 2 for A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond
Figure 3 for A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond
Figure 4 for A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond
Viaarxiv icon

Closing the Gap between Single-User and Multi-User VoiceFilter-Lite

Feb 24, 2022
Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ian McGraw

Figure 1 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Figure 2 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Figure 3 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Figure 4 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Viaarxiv icon

Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling

Mar 13, 2019
Peidong Wang, Ke Tan, DeLiang Wang

Figure 1 for Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling
Figure 2 for Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling
Figure 3 for Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling
Figure 4 for Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling
Viaarxiv icon

CTC Variations Through New WFST Topologies

Add code
Bookmark button
Alert button
Oct 06, 2021
Aleksandr Laptev, Somshubra Majumdar, Boris Ginsburg

Figure 1 for CTC Variations Through New WFST Topologies
Figure 2 for CTC Variations Through New WFST Topologies
Figure 3 for CTC Variations Through New WFST Topologies
Figure 4 for CTC Variations Through New WFST Topologies
Viaarxiv icon