Alert button

"speech recognition": models, code, and papers
Alert button

Fine-grained Generalization Analysis of Structured Output Prediction

May 31, 2021
Waleed Mustafa, Yunwen Lei, Antoine Ledent, Marius Kloft

Figure 1 for Fine-grained Generalization Analysis of Structured Output Prediction
Viaarxiv icon

Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction

Apr 26, 2021
David Qiu, Yanzhang He, Qiujia Li, Yu Zhang, Liangliang Cao, Ian McGraw

Figure 1 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Figure 2 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Figure 3 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Figure 4 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Viaarxiv icon

RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis

Jun 15, 2021
Rohola Zandie, Mohammad H. Mahoor, Julia Madsen, Eshrat S. Emamian

Figure 1 for RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis
Figure 2 for RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis
Figure 3 for RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis
Figure 4 for RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis
Viaarxiv icon

Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning

Mar 16, 2021
Jama Hussein Mohamud, Lloyd Acquaye Thompson, Aissatou Ndoye, Laurent Besacier

Figure 1 for Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning
Figure 2 for Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning
Figure 3 for Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning
Figure 4 for Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning
Viaarxiv icon

"Notic My Speech" -- Blending Speech Patterns With Multimedia

Jun 12, 2020
Dhruva Sahrawat, Yaman Kumar, Shashwat Aggarwal, Yifang Yin, Rajiv Ratn Shah, Roger Zimmermann

Figure 1 for "Notic My Speech" -- Blending Speech Patterns With Multimedia
Figure 2 for "Notic My Speech" -- Blending Speech Patterns With Multimedia
Figure 3 for "Notic My Speech" -- Blending Speech Patterns With Multimedia
Figure 4 for "Notic My Speech" -- Blending Speech Patterns With Multimedia
Viaarxiv icon

Integrating Recurrence Dynamics for Speech Emotion Recognition

Nov 09, 2018
Efthymios Tzinis, Georgios Paraskevopoulos, Christos Baziotis, Alexandros Potamianos

Figure 1 for Integrating Recurrence Dynamics for Speech Emotion Recognition
Figure 2 for Integrating Recurrence Dynamics for Speech Emotion Recognition
Figure 3 for Integrating Recurrence Dynamics for Speech Emotion Recognition
Figure 4 for Integrating Recurrence Dynamics for Speech Emotion Recognition
Viaarxiv icon

Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System

Jun 18, 2021
Jinhan Wang, Yunzheng Zhu, Ruchao Fan, Wei Chu, Abeer Alwan

Figure 1 for Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System
Figure 2 for Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System
Figure 3 for Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System
Viaarxiv icon

The IWSLT 2021 BUT Speech Translation Systems

Jul 13, 2021
Hari Krishna Vydana, Martin Karafi'at, Luk'as Burget, "Honza" Cernock'y

Figure 1 for The IWSLT 2021 BUT Speech Translation Systems
Figure 2 for The IWSLT 2021 BUT Speech Translation Systems
Figure 3 for The IWSLT 2021 BUT Speech Translation Systems
Figure 4 for The IWSLT 2021 BUT Speech Translation Systems
Viaarxiv icon

Dynamic Gradient Aggregation for Federated Domain Adaptation

Jun 14, 2021
Dimitrios Dimitriadis, Kenichi Kumatani, Robert Gmyr, Yashesh Gaur, Sefik Emre Eskimez

Figure 1 for Dynamic Gradient Aggregation for Federated Domain Adaptation
Figure 2 for Dynamic Gradient Aggregation for Federated Domain Adaptation
Figure 3 for Dynamic Gradient Aggregation for Federated Domain Adaptation
Figure 4 for Dynamic Gradient Aggregation for Federated Domain Adaptation
Viaarxiv icon

May I Ask Who's Calling? Named Entity Recognition on Call Center Transcripts for Privacy Law Compliance

Oct 29, 2020
Micaela Kaplan

Figure 1 for May I Ask Who's Calling? Named Entity Recognition on Call Center Transcripts for Privacy Law Compliance
Figure 2 for May I Ask Who's Calling? Named Entity Recognition on Call Center Transcripts for Privacy Law Compliance
Figure 3 for May I Ask Who's Calling? Named Entity Recognition on Call Center Transcripts for Privacy Law Compliance
Figure 4 for May I Ask Who's Calling? Named Entity Recognition on Call Center Transcripts for Privacy Law Compliance
Viaarxiv icon