Alert button

"speech": models, code, and papers
Alert button

Multistage linguistic conditioning of convolutional layers for speech emotion recognition

Oct 13, 2021
Andreas Triantafyllopoulos, Uwe Reichel, Shuo Liu, Stephan Huber, Florian Eyben, Björn W. Schuller

Figure 1 for Multistage linguistic conditioning of convolutional layers for speech emotion recognition
Figure 2 for Multistage linguistic conditioning of convolutional layers for speech emotion recognition
Figure 3 for Multistage linguistic conditioning of convolutional layers for speech emotion recognition
Figure 4 for Multistage linguistic conditioning of convolutional layers for speech emotion recognition
Viaarxiv icon

Low-resource Low-footprint Wake-word Detection using Knowledge Distillation

Jul 06, 2022
Arindam Ghosh, Mark Fuhs, Deblin Bagchi, Bahman Farahani, Monika Woszczyna

Figure 1 for Low-resource Low-footprint Wake-word Detection using Knowledge Distillation
Figure 2 for Low-resource Low-footprint Wake-word Detection using Knowledge Distillation
Figure 3 for Low-resource Low-footprint Wake-word Detection using Knowledge Distillation
Figure 4 for Low-resource Low-footprint Wake-word Detection using Knowledge Distillation
Viaarxiv icon

Learning to Understand Child-directed and Adult-directed Speech

May 22, 2020
Lieke Gelderloos, Grzegorz Chrupała, Afra Alishahi

Figure 1 for Learning to Understand Child-directed and Adult-directed Speech
Figure 2 for Learning to Understand Child-directed and Adult-directed Speech
Figure 3 for Learning to Understand Child-directed and Adult-directed Speech
Figure 4 for Learning to Understand Child-directed and Adult-directed Speech
Viaarxiv icon

Transfer Learning based Speech Affect Recognition in Urdu

Mar 05, 2021
Sara Durrani, Muhammad Umair Arshad

Figure 1 for Transfer Learning based Speech Affect Recognition in Urdu
Figure 2 for Transfer Learning based Speech Affect Recognition in Urdu
Figure 3 for Transfer Learning based Speech Affect Recognition in Urdu
Figure 4 for Transfer Learning based Speech Affect Recognition in Urdu
Viaarxiv icon

Detecting English Speech in the Air Traffic Control Voice Communication

Apr 06, 2021
Igor Szoke, Santosh Kesiraju, Ondrej Novotny, Martin Kocour, Karel Vesely, Jan "Honza" Cernocky

Figure 1 for Detecting English Speech in the Air Traffic Control Voice Communication
Figure 2 for Detecting English Speech in the Air Traffic Control Voice Communication
Figure 3 for Detecting English Speech in the Air Traffic Control Voice Communication
Figure 4 for Detecting English Speech in the Air Traffic Control Voice Communication
Viaarxiv icon

Classification of Emotions and Evaluation of Customer Satisfaction from Speech in Real World Acoustic Environments

Aug 26, 2021
Luis Felipe Parra-Gallego, Juan Rafael Orozco-Arroyave

Figure 1 for Classification of Emotions and Evaluation of Customer Satisfaction from Speech in Real World Acoustic Environments
Figure 2 for Classification of Emotions and Evaluation of Customer Satisfaction from Speech in Real World Acoustic Environments
Figure 3 for Classification of Emotions and Evaluation of Customer Satisfaction from Speech in Real World Acoustic Environments
Figure 4 for Classification of Emotions and Evaluation of Customer Satisfaction from Speech in Real World Acoustic Environments
Viaarxiv icon

Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model

Apr 07, 2022
Nick J. C. Wang, Lu Wang, Yandan Sun, Haimei Kang, Dejun Zhang

Figure 1 for Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model
Figure 2 for Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model
Figure 3 for Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model
Figure 4 for Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model
Viaarxiv icon

3D Convolutional Neural Networks for Ultrasound-Based Silent Speech Interfaces

Apr 23, 2021
László Tóth, Amin Honarmandi Shandiz

Figure 1 for 3D Convolutional Neural Networks for Ultrasound-Based Silent Speech Interfaces
Figure 2 for 3D Convolutional Neural Networks for Ultrasound-Based Silent Speech Interfaces
Figure 3 for 3D Convolutional Neural Networks for Ultrasound-Based Silent Speech Interfaces
Figure 4 for 3D Convolutional Neural Networks for Ultrasound-Based Silent Speech Interfaces
Viaarxiv icon

BembaSpeech: A Speech Recognition Corpus for the Bemba Language

Feb 09, 2021
Claytone Sikasote, Antonios Anastasopoulos

Figure 1 for BembaSpeech: A Speech Recognition Corpus for the Bemba Language
Figure 2 for BembaSpeech: A Speech Recognition Corpus for the Bemba Language
Figure 3 for BembaSpeech: A Speech Recognition Corpus for the Bemba Language
Figure 4 for BembaSpeech: A Speech Recognition Corpus for the Bemba Language
Viaarxiv icon

CarneliNet: Neural Mixture Model for Automatic Speech Recognition

Jul 22, 2021
Aleksei Kalinov, Somshubra Majumdar, Jagadeesh Balam, Boris Ginsburg

Figure 1 for CarneliNet: Neural Mixture Model for Automatic Speech Recognition
Figure 2 for CarneliNet: Neural Mixture Model for Automatic Speech Recognition
Figure 3 for CarneliNet: Neural Mixture Model for Automatic Speech Recognition
Figure 4 for CarneliNet: Neural Mixture Model for Automatic Speech Recognition
Viaarxiv icon