Alert button

"speech": models, code, and papers
Alert button

Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech

Add code
Bookmark button
Alert button
Jul 22, 2021
Duo Ma, Nana Hou, Van Tung Pham, Haihua Xu, Eng Siong Chng

Figure 1 for Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech
Figure 2 for Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech
Figure 3 for Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech
Figure 4 for Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech
Viaarxiv icon

Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning

Sep 15, 2021
Keqi Deng, Songjun Cao, Long Ma

Figure 1 for Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning
Figure 2 for Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning
Figure 3 for Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning
Figure 4 for Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning
Viaarxiv icon

A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation

Add code
Bookmark button
Alert button
Oct 11, 2021
Yosuke Higuchi, Nanxin Chen, Yuya Fujita, Hirofumi Inaguma, Tatsuya Komatsu, Jaesong Lee, Jumon Nozaki, Tianzi Wang, Shinji Watanabe

Figure 1 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Figure 2 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Figure 3 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Figure 4 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Viaarxiv icon

Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions

Nov 18, 2021
Chunxi Liu, Michael Picheny, Leda Sarı, Pooja Chitkara, Alex Xiao, Xiaohui Zhang, Mark Chou, Andres Alvarado, Caner Hazirbas, Yatharth Saraf

Figure 1 for Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions
Figure 2 for Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions
Viaarxiv icon

Multimodal Transformer Distillation for Audio-Visual Synchronization

Add code
Bookmark button
Alert button
Oct 27, 2022
Xuanjun Chen, Haibin Wu, Chung-Che Wang, Hung-yi Lee, Jyh-Shing Roger Jang

Figure 1 for Multimodal Transformer Distillation for Audio-Visual Synchronization
Figure 2 for Multimodal Transformer Distillation for Audio-Visual Synchronization
Figure 3 for Multimodal Transformer Distillation for Audio-Visual Synchronization
Figure 4 for Multimodal Transformer Distillation for Audio-Visual Synchronization
Viaarxiv icon

Opening the Black Box of wav2vec Feature Encoder

Add code
Bookmark button
Alert button
Oct 27, 2022
Kwanghee Choi, Eun Jung Yeo

Figure 1 for Opening the Black Box of wav2vec Feature Encoder
Figure 2 for Opening the Black Box of wav2vec Feature Encoder
Figure 3 for Opening the Black Box of wav2vec Feature Encoder
Figure 4 for Opening the Black Box of wav2vec Feature Encoder
Viaarxiv icon

Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation

Add code
Bookmark button
Alert button
Aug 16, 2021
Zhong-Qiu Wang, Gordon Wichern, Jonathan Le Roux

Figure 1 for Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation
Figure 2 for Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation
Figure 3 for Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation
Figure 4 for Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation
Viaarxiv icon

One to rule them all: Towards Joint Indic Language Hate Speech Detection

Add code
Bookmark button
Alert button
Sep 28, 2021
Mehar Bhatia, Tenzin Singhay Bhotia, Akshat Agarwal, Prakash Ramesh, Shubham Gupta, Kumar Shridhar, Felix Laumann, Ayushman Dash

Figure 1 for One to rule them all: Towards Joint Indic Language Hate Speech Detection
Figure 2 for One to rule them all: Towards Joint Indic Language Hate Speech Detection
Figure 3 for One to rule them all: Towards Joint Indic Language Hate Speech Detection
Figure 4 for One to rule them all: Towards Joint Indic Language Hate Speech Detection
Viaarxiv icon

WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit

Add code
Bookmark button
Alert button
Oct 30, 2022
Jie Wang, Menglong Xu, Jingyong Hou, Binbin Zhang, Xiao-Lei Zhang, Lei Xie, Fuping Pan

Figure 1 for WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit
Figure 2 for WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit
Figure 3 for WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit
Figure 4 for WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit
Viaarxiv icon

DuDe: Dual-Decoder Multilingual ASR for Indian Languages using Common Label Set

Oct 30, 2022
Arunkumar A, Mudit Batra, Umesh S

Figure 1 for DuDe: Dual-Decoder Multilingual ASR for Indian Languages using Common Label Set
Figure 2 for DuDe: Dual-Decoder Multilingual ASR for Indian Languages using Common Label Set
Figure 3 for DuDe: Dual-Decoder Multilingual ASR for Indian Languages using Common Label Set
Figure 4 for DuDe: Dual-Decoder Multilingual ASR for Indian Languages using Common Label Set
Viaarxiv icon