Alert button

"speech recognition": models, code, and papers
Alert button

Fixed-MAML for Few Shot Classification in Multilingual Speech Emotion Recognition

Add code
Bookmark button
Alert button
Jan 05, 2021
Anugunj Naman, Liliana Mancini

Figure 1 for Fixed-MAML for Few Shot Classification in Multilingual Speech Emotion Recognition
Figure 2 for Fixed-MAML for Few Shot Classification in Multilingual Speech Emotion Recognition
Figure 3 for Fixed-MAML for Few Shot Classification in Multilingual Speech Emotion Recognition
Figure 4 for Fixed-MAML for Few Shot Classification in Multilingual Speech Emotion Recognition
Viaarxiv icon

Does Speech enhancement of publicly available data help build robust Speech Recognition Systems?

Oct 29, 2019
Bhavya Ghai, Buvana Ramanan, Klaus Mueller

Figure 1 for Does Speech enhancement of publicly available data help build robust Speech Recognition Systems?
Figure 2 for Does Speech enhancement of publicly available data help build robust Speech Recognition Systems?
Viaarxiv icon

Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation

Add code
Bookmark button
Alert button
Oct 29, 2019
Thai-Son Nguyen, Sebastian Stueker, Jan Niehues, Alex Waibel

Figure 1 for Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation
Figure 2 for Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation
Figure 3 for Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation
Figure 4 for Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation
Viaarxiv icon

Parallel Composition of Weighted Finite-State Transducers

Add code
Bookmark button
Alert button
Oct 06, 2021
Shubho Sengupta, Vineel Pratap, Awni Hannun

Figure 1 for Parallel Composition of Weighted Finite-State Transducers
Figure 2 for Parallel Composition of Weighted Finite-State Transducers
Figure 3 for Parallel Composition of Weighted Finite-State Transducers
Figure 4 for Parallel Composition of Weighted Finite-State Transducers
Viaarxiv icon

Dawn of the transformer era in speech emotion recognition: closing the valence gap

Add code
Bookmark button
Alert button
Mar 16, 2022
Johannes Wagner, Andreas Triantafyllopoulos, Hagen Wierstorf, Maximilian Schmitt, Felix Burkhardt, Florian Eyben, Björn W. Schuller

Figure 1 for Dawn of the transformer era in speech emotion recognition: closing the valence gap
Figure 2 for Dawn of the transformer era in speech emotion recognition: closing the valence gap
Figure 3 for Dawn of the transformer era in speech emotion recognition: closing the valence gap
Figure 4 for Dawn of the transformer era in speech emotion recognition: closing the valence gap
Viaarxiv icon

Visual Speech Recognition Using PCA Networks and LSTMs in a Tandem GMM-HMM System

Oct 19, 2017
Marina Zimmermann, Mostafa Mehdipour Ghazi, Hazım Kemal Ekenel, Jean-Philippe Thiran

Figure 1 for Visual Speech Recognition Using PCA Networks and LSTMs in a Tandem GMM-HMM System
Figure 2 for Visual Speech Recognition Using PCA Networks and LSTMs in a Tandem GMM-HMM System
Figure 3 for Visual Speech Recognition Using PCA Networks and LSTMs in a Tandem GMM-HMM System
Figure 4 for Visual Speech Recognition Using PCA Networks and LSTMs in a Tandem GMM-HMM System
Viaarxiv icon

Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition

Mar 22, 2019
Yao Qin, Nicholas Carlini, Ian Goodfellow, Garrison Cottrell, Colin Raffel

Figure 1 for Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition
Figure 2 for Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition
Figure 3 for Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition
Viaarxiv icon

SpeechMoE2: Mixture-of-Experts Model with Improved Routing

Nov 23, 2021
Zhao You, Shulin Feng, Dan Su, Dong Yu

Figure 1 for SpeechMoE2: Mixture-of-Experts Model with Improved Routing
Figure 2 for SpeechMoE2: Mixture-of-Experts Model with Improved Routing
Figure 3 for SpeechMoE2: Mixture-of-Experts Model with Improved Routing
Figure 4 for SpeechMoE2: Mixture-of-Experts Model with Improved Routing
Viaarxiv icon

Privacy against Real-Time Speech Emotion Detection via Acoustic Adversarial Evasion of Machine Learning

Nov 17, 2022
Brian Testa, Yi Xiao, Avery Gump, Asif Salekin

Figure 1 for Privacy against Real-Time Speech Emotion Detection via Acoustic Adversarial Evasion of Machine Learning
Figure 2 for Privacy against Real-Time Speech Emotion Detection via Acoustic Adversarial Evasion of Machine Learning
Figure 3 for Privacy against Real-Time Speech Emotion Detection via Acoustic Adversarial Evasion of Machine Learning
Figure 4 for Privacy against Real-Time Speech Emotion Detection via Acoustic Adversarial Evasion of Machine Learning
Viaarxiv icon

Automatic Spoken Language Identification using a Time-Delay Neural Network

May 19, 2022
Benjamin Kepecs, Homayoon Beigi

Figure 1 for Automatic Spoken Language Identification using a Time-Delay Neural Network
Figure 2 for Automatic Spoken Language Identification using a Time-Delay Neural Network
Figure 3 for Automatic Spoken Language Identification using a Time-Delay Neural Network
Figure 4 for Automatic Spoken Language Identification using a Time-Delay Neural Network
Viaarxiv icon