Alert button

"speech recognition": models, code, and papers
Alert button

Pre-trained Speech Processing Models Contain Human-Like Biases that Propagate to Speech Emotion Recognition

Add code
Bookmark button
Alert button
Oct 29, 2023
Isaac Slaughter, Craig Greenberg, Reva Schwartz, Aylin Caliskan

Viaarxiv icon

Combining Language Models For Specialized Domains: A Colorful Approach

Nov 01, 2023
Daniel Eitan, Menachem Pirchi, Neta Glazer, Shai Meital, Gil Ayach, Gidon Krendel, Aviv Shamsian, Aviv Navon, Gil Hetz, Joseph Keshet

Viaarxiv icon

Improved Child Text-to-Speech Synthesis through Fastpitch-based Transfer Learning

Add code
Bookmark button
Alert button
Nov 07, 2023
Rishabh Jain, Peter Corcoran

Viaarxiv icon

DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific Experts

Nov 02, 2023
Thomas Palmeira Ferraz, Marcely Zanon Boito, Caroline Brun, Vassilina Nikoulina

Viaarxiv icon

Server-side Rescoring of Spoken Entity-centric Knowledge Queries for Virtual Assistants

Nov 02, 2023
Youyuan Zhang, Sashank Gondala, Thiago Fraga-Silva, Christophe Van Gysel

Viaarxiv icon

Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification

Aug 02, 2023
Laurin Wagner, Mario Zusag, Theresa Bloder

Figure 1 for Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification
Figure 2 for Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification
Figure 3 for Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification
Figure 4 for Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification
Viaarxiv icon

Churn Prediction via Multimodal Fusion Learning:Integrating Customer Financial Literacy, Voice, and Behavioral Data

Dec 03, 2023
David Hason Rudd, Huan Huo, Md Rafiqul Islam, Guandong Xu

Figure 1 for Churn Prediction via Multimodal Fusion Learning:Integrating Customer Financial Literacy, Voice, and Behavioral Data
Figure 2 for Churn Prediction via Multimodal Fusion Learning:Integrating Customer Financial Literacy, Voice, and Behavioral Data
Figure 3 for Churn Prediction via Multimodal Fusion Learning:Integrating Customer Financial Literacy, Voice, and Behavioral Data
Figure 4 for Churn Prediction via Multimodal Fusion Learning:Integrating Customer Financial Literacy, Voice, and Behavioral Data
Viaarxiv icon

Identifying Planetary Names in Astronomy Papers: A Multi-Step Approach

Dec 17, 2023
Golnaz Shapurian, Michael J Kurtz, Alberto Accomazzi

Viaarxiv icon

Augmenty: A Python Library for Structured Text Augmentation

Dec 09, 2023
Kenneth Enevoldsen

Viaarxiv icon

Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation

Oct 23, 2023
Sara Papi, Peidong Wang, Junkun Chen, Jian Xue, Naoyuki Kanda, Jinyu Li, Yashesh Gaur

Viaarxiv icon