Alert button

"speech recognition": models, code, and papers
Alert button

Server-side Rescoring of Spoken Entity-centric Knowledge Queries for Virtual Assistants

Nov 02, 2023
Youyuan Zhang, Sashank Gondala, Thiago Fraga-Silva, Christophe Van Gysel

Viaarxiv icon

Improved Child Text-to-Speech Synthesis through Fastpitch-based Transfer Learning

Add code
Bookmark button
Alert button
Nov 07, 2023
Rishabh Jain, Peter Corcoran

Viaarxiv icon

Churn Prediction via Multimodal Fusion Learning:Integrating Customer Financial Literacy, Voice, and Behavioral Data

Dec 03, 2023
David Hason Rudd, Huan Huo, Md Rafiqul Islam, Guandong Xu

Figure 1 for Churn Prediction via Multimodal Fusion Learning:Integrating Customer Financial Literacy, Voice, and Behavioral Data
Figure 2 for Churn Prediction via Multimodal Fusion Learning:Integrating Customer Financial Literacy, Voice, and Behavioral Data
Figure 3 for Churn Prediction via Multimodal Fusion Learning:Integrating Customer Financial Literacy, Voice, and Behavioral Data
Figure 4 for Churn Prediction via Multimodal Fusion Learning:Integrating Customer Financial Literacy, Voice, and Behavioral Data
Viaarxiv icon

RescueSpeech: A German Corpus for Speech Recognition in Search and Rescue Domain

Add code
Bookmark button
Alert button
Jun 06, 2023
Sangeet Sagar, Mirco Ravanelli, Bernd Kiefer, Ivana Kruijff Korbayova, Josef van Genabith

Figure 1 for RescueSpeech: A German Corpus for Speech Recognition in Search and Rescue Domain
Figure 2 for RescueSpeech: A German Corpus for Speech Recognition in Search and Rescue Domain
Figure 3 for RescueSpeech: A German Corpus for Speech Recognition in Search and Rescue Domain
Figure 4 for RescueSpeech: A German Corpus for Speech Recognition in Search and Rescue Domain
Viaarxiv icon

Identifying Planetary Names in Astronomy Papers: A Multi-Step Approach

Dec 17, 2023
Golnaz Shapurian, Michael J Kurtz, Alberto Accomazzi

Viaarxiv icon

Augmenty: A Python Library for Structured Text Augmentation

Dec 09, 2023
Kenneth Enevoldsen

Viaarxiv icon

Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation

Oct 23, 2023
Sara Papi, Peidong Wang, Junkun Chen, Jian Xue, Naoyuki Kanda, Jinyu Li, Yashesh Gaur

Viaarxiv icon

A Change of Heart: Improving Speech Emotion Recognition through Speech-to-Text Modality Conversion

Add code
Bookmark button
Alert button
Jul 21, 2023
Zeinab Sadat Taghavi, Ali Satvaty, Hossein Sameti

Viaarxiv icon

Use of Speech Impairment Severity for Dysarthric Speech Recognition

May 18, 2023
Mengzhe Geng, Zengrui Jin, Tianzi Wang, Shujie Hu, Jiajun Deng, Mingyu Cui, Guinan Li, Jianwei Yu, Xurong Xie, Xunying Liu

Figure 1 for Use of Speech Impairment Severity for Dysarthric Speech Recognition
Figure 2 for Use of Speech Impairment Severity for Dysarthric Speech Recognition
Figure 3 for Use of Speech Impairment Severity for Dysarthric Speech Recognition
Figure 4 for Use of Speech Impairment Severity for Dysarthric Speech Recognition
Viaarxiv icon

Towards training Bilingual and Code-Switched Speech Recognition models from Monolingual data sources

Add code
Bookmark button
Alert button
Jun 14, 2023
Kunal Dhawan, Dima Rekesh, Boris Ginsburg

Figure 1 for Towards training Bilingual and Code-Switched Speech Recognition models from Monolingual data sources
Figure 2 for Towards training Bilingual and Code-Switched Speech Recognition models from Monolingual data sources
Figure 3 for Towards training Bilingual and Code-Switched Speech Recognition models from Monolingual data sources
Figure 4 for Towards training Bilingual and Code-Switched Speech Recognition models from Monolingual data sources
Viaarxiv icon