Alert button

"speech": models, code, and papers
Alert button

XAI-Increment: A Novel Approach Leveraging LIME Explanations for Improved Incremental Learning

Nov 02, 2022
Arnab Neelim Mazumder, Niall Lyons, Anand Dubey, Ashutosh Pandey, Avik Santra

Figure 1 for XAI-Increment: A Novel Approach Leveraging LIME Explanations for Improved Incremental Learning
Figure 2 for XAI-Increment: A Novel Approach Leveraging LIME Explanations for Improved Incremental Learning
Figure 3 for XAI-Increment: A Novel Approach Leveraging LIME Explanations for Improved Incremental Learning
Figure 4 for XAI-Increment: A Novel Approach Leveraging LIME Explanations for Improved Incremental Learning
Viaarxiv icon

Internal Language Model Estimation based Adaptive Language Model Fusion for Domain Adaptation

Nov 02, 2022
Rao Ma, Xiaobo Wu, Jin Qiu, Yanan Qin, Haihua Xu, Peihao Wu, Zejun Ma

Figure 1 for Internal Language Model Estimation based Adaptive Language Model Fusion for Domain Adaptation
Figure 2 for Internal Language Model Estimation based Adaptive Language Model Fusion for Domain Adaptation
Figure 3 for Internal Language Model Estimation based Adaptive Language Model Fusion for Domain Adaptation
Figure 4 for Internal Language Model Estimation based Adaptive Language Model Fusion for Domain Adaptation
Viaarxiv icon

End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics

Nov 07, 2022
Eda Okur, Saurav Sahay, Roddy Fuentes Alba, Lama Nachman

Figure 1 for End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics
Figure 2 for End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics
Figure 3 for End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics
Figure 4 for End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics
Viaarxiv icon

Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification

Aug 05, 2021
Yidi Jiang, Bidisha Sharma, Maulik Madhavi, Haizhou Li

Figure 1 for Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
Figure 2 for Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
Figure 3 for Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
Figure 4 for Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
Viaarxiv icon

Probing phoneme, language and speaker information in unsupervised speech representations

Mar 30, 2022
Maureen de Seyssel, Marvin Lavechin, Yossi Adi, Emmanuel Dupoux, Guillaume Wisniewski

Figure 1 for Probing phoneme, language and speaker information in unsupervised speech representations
Figure 2 for Probing phoneme, language and speaker information in unsupervised speech representations
Figure 3 for Probing phoneme, language and speaker information in unsupervised speech representations
Figure 4 for Probing phoneme, language and speaker information in unsupervised speech representations
Viaarxiv icon

BirdSoundsDenoising: Deep Visual Audio Denoising for Bird Sounds

Oct 18, 2022
Youshan Zhang, Jialu Li

Figure 1 for BirdSoundsDenoising: Deep Visual Audio Denoising for Bird Sounds
Figure 2 for BirdSoundsDenoising: Deep Visual Audio Denoising for Bird Sounds
Figure 3 for BirdSoundsDenoising: Deep Visual Audio Denoising for Bird Sounds
Figure 4 for BirdSoundsDenoising: Deep Visual Audio Denoising for Bird Sounds
Viaarxiv icon

AequeVox: Automated Fairness Testing of Speech Recognition Systems

Oct 19, 2021
Sai Sathiesh Rajan, Sakshi Udeshi, Sudipta Chattopadhyay

Figure 1 for AequeVox: Automated Fairness Testing of Speech Recognition Systems
Figure 2 for AequeVox: Automated Fairness Testing of Speech Recognition Systems
Figure 3 for AequeVox: Automated Fairness Testing of Speech Recognition Systems
Figure 4 for AequeVox: Automated Fairness Testing of Speech Recognition Systems
Viaarxiv icon

Speech Representations and Phoneme Classification for Preserving the Endangered Language of Ladin

Aug 27, 2021
Zane Durante, Leena Mathur, Eric Ye, Sichong Zhao, Tejas Ramdas, Khalil Iskarous

Figure 1 for Speech Representations and Phoneme Classification for Preserving the Endangered Language of Ladin
Figure 2 for Speech Representations and Phoneme Classification for Preserving the Endangered Language of Ladin
Figure 3 for Speech Representations and Phoneme Classification for Preserving the Endangered Language of Ladin
Figure 4 for Speech Representations and Phoneme Classification for Preserving the Endangered Language of Ladin
Viaarxiv icon

Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance

Oct 27, 2022
Yuanzhe Chen, Ming Tu, Tang Li, Xin Li, Qiuqiang Kong, Jiaxin Li, Zhichao Wang, Qiao Tian, Yuping Wang, Yuxuan Wang

Figure 1 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Figure 2 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Figure 3 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Figure 4 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Viaarxiv icon

Acoustical Analysis of Speech Under Physical Stress in Relation to Physical Activities and Physical Literacy

Nov 20, 2021
Si-Ioi Ng, Rui-Si Ma, Tan Lee, Raymond Kim-Wai Sum

Figure 1 for Acoustical Analysis of Speech Under Physical Stress in Relation to Physical Activities and Physical Literacy
Figure 2 for Acoustical Analysis of Speech Under Physical Stress in Relation to Physical Activities and Physical Literacy
Figure 3 for Acoustical Analysis of Speech Under Physical Stress in Relation to Physical Activities and Physical Literacy
Figure 4 for Acoustical Analysis of Speech Under Physical Stress in Relation to Physical Activities and Physical Literacy
Viaarxiv icon