Alert button

"speech": models, code, and papers
Alert button

Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding

Add code
Bookmark button
Alert button
Mar 02, 2023
Yingting Li, Ambuj Mehrish, Shuai Zhao, Rishabh Bhardwaj, Amir Zadeh, Navonil Majumder, Rada Mihalcea, Soujanya Poria

Figure 1 for Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding
Figure 2 for Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding
Figure 3 for Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding
Figure 4 for Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding
Viaarxiv icon

On the Impact of Voice Anonymization on Speech-Based COVID-19 Detection

Add code
Bookmark button
Alert button
Apr 05, 2023
Yi Zhu, Mohamed Imoussaïne-Aïkous, Carolyn Côté-Lussier, Tiago H. Falk

Figure 1 for On the Impact of Voice Anonymization on Speech-Based COVID-19 Detection
Figure 2 for On the Impact of Voice Anonymization on Speech-Based COVID-19 Detection
Figure 3 for On the Impact of Voice Anonymization on Speech-Based COVID-19 Detection
Figure 4 for On the Impact of Voice Anonymization on Speech-Based COVID-19 Detection
Viaarxiv icon

Personalized speech enhancement combining band-split RNN and speaker attentive module

Add code
Bookmark button
Alert button
Feb 20, 2023
Xiaohuai Le, Zhongshu Hou, Li Chen, Chao He, Yiqing Guo, Cheng Chen, Xianjun Xia, Jing Lu

Figure 1 for Personalized speech enhancement combining band-split RNN and speaker attentive module
Viaarxiv icon

LipLearner: Customizable Silent Speech Interactions on Mobile Devices

Add code
Bookmark button
Alert button
Feb 14, 2023
Zixiong Su, Shitao Fang, Jun Rekimoto

Figure 1 for LipLearner: Customizable Silent Speech Interactions on Mobile Devices
Figure 2 for LipLearner: Customizable Silent Speech Interactions on Mobile Devices
Figure 3 for LipLearner: Customizable Silent Speech Interactions on Mobile Devices
Figure 4 for LipLearner: Customizable Silent Speech Interactions on Mobile Devices
Viaarxiv icon

Frequency bin-wise single channel speech presence probability estimation using multiple DNNs

Add code
Bookmark button
Alert button
Feb 23, 2023
Shuai Tao, Himavanth Reddy, Jesper Rindom Jensen, Mads Græsbøll Christensen

Figure 1 for Frequency bin-wise single channel speech presence probability estimation using multiple DNNs
Figure 2 for Frequency bin-wise single channel speech presence probability estimation using multiple DNNs
Figure 3 for Frequency bin-wise single channel speech presence probability estimation using multiple DNNs
Figure 4 for Frequency bin-wise single channel speech presence probability estimation using multiple DNNs
Viaarxiv icon

Robust Acoustic and Semantic Contextual Biasing in Neural Transducers for Speech Recognition

May 09, 2023
Xuandi Fu, Kanthashree Mysore Sathyendra, Ankur Gandhe, Jing Liu, Grant P. Strimel, Ross McGowan, Athanasios Mouchtaris

Figure 1 for Robust Acoustic and Semantic Contextual Biasing in Neural Transducers for Speech Recognition
Figure 2 for Robust Acoustic and Semantic Contextual Biasing in Neural Transducers for Speech Recognition
Figure 3 for Robust Acoustic and Semantic Contextual Biasing in Neural Transducers for Speech Recognition
Figure 4 for Robust Acoustic and Semantic Contextual Biasing in Neural Transducers for Speech Recognition
Viaarxiv icon

Parmesan: mathematical concept extraction for education

Add code
Bookmark button
Alert button
Jul 17, 2023
Jacob Collard, Valeria de Paiva, Eswaran Subrahmanian

Figure 1 for Parmesan: mathematical concept extraction for education
Figure 2 for Parmesan: mathematical concept extraction for education
Figure 3 for Parmesan: mathematical concept extraction for education
Figure 4 for Parmesan: mathematical concept extraction for education
Viaarxiv icon

End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations

Add code
Bookmark button
Alert button
Mar 21, 2023
Giovanni Morrone, Samuele Cornell, Luca Serafini, Enrico Zovato, Alessio Brutti, Stefano Squartini

Figure 1 for End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations
Figure 2 for End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations
Figure 3 for End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations
Figure 4 for End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations
Viaarxiv icon

The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios

Add code
Bookmark button
Alert button
Jun 23, 2023
Samuele Cornell, Matthew Wiesner, Shinji Watanabe, Desh Raj, Xuankai Chang, Paola Garcia, Yoshiki Masuyama, Zhong-Qiu Wang, Stefano Squartini, Sanjeev Khudanpur

Figure 1 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Figure 2 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Figure 3 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Figure 4 for The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Viaarxiv icon

Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices

Jul 14, 2023
Zitha Sasindran, Harsha Yelchuri, T. V. Prabhakar

Figure 1 for Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices
Figure 2 for Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices
Figure 3 for Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices
Figure 4 for Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices
Viaarxiv icon