Alert button

"speech recognition": models, code, and papers
Alert button

Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation

Aug 04, 2021
Seongmin Park, Dongchan Shin, Sangyoun Paik, Subong Choi, Alena Kazakova, Jihwa Lee

Figure 1 for Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation
Figure 2 for Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation
Figure 3 for Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation
Figure 4 for Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation
Viaarxiv icon

Bounds on mutual information of mixture data for classification tasks

Jan 27, 2021
Yijun Ding, Amit Ashok

Figure 1 for Bounds on mutual information of mixture data for classification tasks
Figure 2 for Bounds on mutual information of mixture data for classification tasks
Figure 3 for Bounds on mutual information of mixture data for classification tasks
Figure 4 for Bounds on mutual information of mixture data for classification tasks
Viaarxiv icon

SEP-28k: A Dataset for Stuttering Event Detection From Podcasts With People Who Stutter

Feb 24, 2021
Colin Lea, Vikramjit Mitra, Aparna Joshi, Sachin Kajarekar, Jeffrey P. Bigham

Figure 1 for SEP-28k: A Dataset for Stuttering Event Detection From Podcasts With People Who Stutter
Figure 2 for SEP-28k: A Dataset for Stuttering Event Detection From Podcasts With People Who Stutter
Figure 3 for SEP-28k: A Dataset for Stuttering Event Detection From Podcasts With People Who Stutter
Figure 4 for SEP-28k: A Dataset for Stuttering Event Detection From Podcasts With People Who Stutter
Viaarxiv icon

User-Initiated Repetition-Based Recovery in Multi-Utterance Dialogue Systems

Aug 02, 2021
Hoang Long Nguyen, Vincent Renkens, Joris Pelemans, Srividya Pranavi Potharaju, Anil Kumar Nalamalapu, Murat Akbacak

Figure 1 for User-Initiated Repetition-Based Recovery in Multi-Utterance Dialogue Systems
Figure 2 for User-Initiated Repetition-Based Recovery in Multi-Utterance Dialogue Systems
Figure 3 for User-Initiated Repetition-Based Recovery in Multi-Utterance Dialogue Systems
Figure 4 for User-Initiated Repetition-Based Recovery in Multi-Utterance Dialogue Systems
Viaarxiv icon

KARI: KAnari/QCRI's End-to-End systems for the INTERSPEECH 2021 Indian Languages Code-Switching Challenge

Add code
Bookmark button
Alert button
Jun 10, 2021
Amir Hussein, Shammur Chowdhury, Ahmed Ali

Figure 1 for KARI: KAnari/QCRI's End-to-End systems for the INTERSPEECH 2021 Indian Languages Code-Switching Challenge
Figure 2 for KARI: KAnari/QCRI's End-to-End systems for the INTERSPEECH 2021 Indian Languages Code-Switching Challenge
Figure 3 for KARI: KAnari/QCRI's End-to-End systems for the INTERSPEECH 2021 Indian Languages Code-Switching Challenge
Figure 4 for KARI: KAnari/QCRI's End-to-End systems for the INTERSPEECH 2021 Indian Languages Code-Switching Challenge
Viaarxiv icon

TENET: A Time-reversal Enhancement Network for Noise-robust ASR

Add code
Bookmark button
Alert button
Jul 08, 2021
Fu-An Chao, Shao-Wei Fan Jiang, Bi-Cheng Yan, Jeih-weih Hung, Berlin Chen

Figure 1 for TENET: A Time-reversal Enhancement Network for Noise-robust ASR
Figure 2 for TENET: A Time-reversal Enhancement Network for Noise-robust ASR
Figure 3 for TENET: A Time-reversal Enhancement Network for Noise-robust ASR
Figure 4 for TENET: A Time-reversal Enhancement Network for Noise-robust ASR
Viaarxiv icon

Discriminative Self-training for Punctuation Prediction

Add code
Bookmark button
Alert button
Apr 21, 2021
Qian Chen, Wen Wang, Mengzhe Chen, Qinglin Zhang

Figure 1 for Discriminative Self-training for Punctuation Prediction
Figure 2 for Discriminative Self-training for Punctuation Prediction
Figure 3 for Discriminative Self-training for Punctuation Prediction
Figure 4 for Discriminative Self-training for Punctuation Prediction
Viaarxiv icon

Similarity-and-Independence-Aware Beamformer with Iterative Casting and Boost Start for Target Source Extraction Using Reference

Oct 18, 2021
Atsuo Hiroe

Figure 1 for Similarity-and-Independence-Aware Beamformer with Iterative Casting and Boost Start for Target Source Extraction Using Reference
Figure 2 for Similarity-and-Independence-Aware Beamformer with Iterative Casting and Boost Start for Target Source Extraction Using Reference
Figure 3 for Similarity-and-Independence-Aware Beamformer with Iterative Casting and Boost Start for Target Source Extraction Using Reference
Figure 4 for Similarity-and-Independence-Aware Beamformer with Iterative Casting and Boost Start for Target Source Extraction Using Reference
Viaarxiv icon

A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio

Add code
Bookmark button
Alert button
Jul 06, 2021
Naoyuki Kanda, Xiong Xiao, Jian Wu, Tianyan Zhou, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

Figure 1 for A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio
Figure 2 for A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio
Figure 3 for A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio
Figure 4 for A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio
Viaarxiv icon

Mitigating Noisy Inputs for Question Answering

Add code
Bookmark button
Alert button
Aug 08, 2019
Denis Peskov, Joe Barrow, Pedro Rodriguez, Graham Neubig, Jordan Boyd-Graber

Figure 1 for Mitigating Noisy Inputs for Question Answering
Figure 2 for Mitigating Noisy Inputs for Question Answering
Figure 3 for Mitigating Noisy Inputs for Question Answering
Figure 4 for Mitigating Noisy Inputs for Question Answering
Viaarxiv icon