Alert button

"speech recognition": models, code, and papers
Alert button

End-to-End Speech Recognition with High-Frame-Rate Features Extraction

Jul 12, 2019
Cong-Thanh Do

Figure 1 for End-to-End Speech Recognition with High-Frame-Rate Features Extraction
Figure 2 for End-to-End Speech Recognition with High-Frame-Rate Features Extraction
Figure 3 for End-to-End Speech Recognition with High-Frame-Rate Features Extraction
Figure 4 for End-to-End Speech Recognition with High-Frame-Rate Features Extraction
Viaarxiv icon

Speech Emotion Recognition using Supervised Deep Recurrent System for Mental Health Monitoring

Aug 26, 2022
Nelly Elsayed, Zag ElSayed, Navid Asadizanjani, Murat Ozer, Ahmed Abdelgawad, Magdy Bayoumi

Figure 1 for Speech Emotion Recognition using Supervised Deep Recurrent System for Mental Health Monitoring
Figure 2 for Speech Emotion Recognition using Supervised Deep Recurrent System for Mental Health Monitoring
Figure 3 for Speech Emotion Recognition using Supervised Deep Recurrent System for Mental Health Monitoring
Figure 4 for Speech Emotion Recognition using Supervised Deep Recurrent System for Mental Health Monitoring
Viaarxiv icon

Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies

Add code
Bookmark button
Alert button
Sep 29, 2017
Seppo Enarvi, Peter Smit, Sami Virpioja, Mikko Kurimo

Figure 1 for Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies
Figure 2 for Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies
Figure 3 for Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies
Figure 4 for Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies
Viaarxiv icon

Why has (reasonably accurate) Automatic Speech Recognition been so hard to achieve?

Feb 28, 2010
Steven Wegmann, Larry Gillick

Figure 1 for Why has (reasonably accurate) Automatic Speech Recognition been so hard to achieve?
Figure 2 for Why has (reasonably accurate) Automatic Speech Recognition been so hard to achieve?
Figure 3 for Why has (reasonably accurate) Automatic Speech Recognition been so hard to achieve?
Figure 4 for Why has (reasonably accurate) Automatic Speech Recognition been so hard to achieve?
Viaarxiv icon

End-to-end multi-talker audio-visual ASR using an active speaker attention module

Apr 01, 2022
Richard Rose, Olivier Siohan

Figure 1 for End-to-end multi-talker audio-visual ASR using an active speaker attention module
Figure 2 for End-to-end multi-talker audio-visual ASR using an active speaker attention module
Figure 3 for End-to-end multi-talker audio-visual ASR using an active speaker attention module
Figure 4 for End-to-end multi-talker audio-visual ASR using an active speaker attention module
Viaarxiv icon

Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models

Add code
Bookmark button
Alert button
Feb 26, 2022
Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, N. Tomashenko

Figure 1 for Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models
Figure 2 for Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models
Figure 3 for Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models
Figure 4 for Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models
Viaarxiv icon

Transfer learning from High-Resource to Low-Resource Language Improves Speech Affect Recognition Classification Accuracy

Mar 04, 2021
Sara Durrani, Umair Arshad

Figure 1 for Transfer learning from High-Resource to Low-Resource Language Improves Speech Affect Recognition Classification Accuracy
Figure 2 for Transfer learning from High-Resource to Low-Resource Language Improves Speech Affect Recognition Classification Accuracy
Figure 3 for Transfer learning from High-Resource to Low-Resource Language Improves Speech Affect Recognition Classification Accuracy
Figure 4 for Transfer learning from High-Resource to Low-Resource Language Improves Speech Affect Recognition Classification Accuracy
Viaarxiv icon

Spectral feature mapping with mimic loss for robust speech recognition

Add code
Bookmark button
Alert button
Mar 26, 2018
Deblin Bagchi, Peter Plantinga, Adam Stiff, Eric Fosler-Lussier

Figure 1 for Spectral feature mapping with mimic loss for robust speech recognition
Figure 2 for Spectral feature mapping with mimic loss for robust speech recognition
Figure 3 for Spectral feature mapping with mimic loss for robust speech recognition
Figure 4 for Spectral feature mapping with mimic loss for robust speech recognition
Viaarxiv icon

CMGAN: Conformer-based Metric GAN for Speech Enhancement

Add code
Bookmark button
Alert button
Mar 28, 2022
Ruizhe Cao, Sherif Abdulatif, Bin Yang

Figure 1 for CMGAN: Conformer-based Metric GAN for Speech Enhancement
Figure 2 for CMGAN: Conformer-based Metric GAN for Speech Enhancement
Figure 3 for CMGAN: Conformer-based Metric GAN for Speech Enhancement
Viaarxiv icon