Alert button

"speech recognition": models, code, and papers
Alert button

Beyond Isolated Utterances: Conversational Emotion Recognition

Sep 13, 2021
Raghavendra Pappagari, Piotr Żelasko, Jesús Villalba, Laureano Moro-Velazquez, Najim Dehak

Figure 1 for Beyond Isolated Utterances: Conversational Emotion Recognition
Figure 2 for Beyond Isolated Utterances: Conversational Emotion Recognition
Figure 3 for Beyond Isolated Utterances: Conversational Emotion Recognition
Figure 4 for Beyond Isolated Utterances: Conversational Emotion Recognition
Viaarxiv icon

Multi-modal embeddings using multi-task learning for emotion recognition

Sep 10, 2020
Aparna Khare, Srinivas Parthasarathy, Shiva Sundaram

Figure 1 for Multi-modal embeddings using multi-task learning for emotion recognition
Figure 2 for Multi-modal embeddings using multi-task learning for emotion recognition
Figure 3 for Multi-modal embeddings using multi-task learning for emotion recognition
Viaarxiv icon

The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS

Add code
Bookmark button
Alert button
Oct 06, 2020
Wen-Chin Huang, Tomoki Hayashi, Shinji Watanabe, Tomoki Toda

Figure 1 for The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS
Figure 2 for The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS
Figure 3 for The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS
Figure 4 for The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS
Viaarxiv icon

Split Computing and Early Exiting for Deep Learning Applications: Survey and Research Challenges

Add code
Bookmark button
Alert button
Mar 08, 2021
Yoshitomo Matsubara, Marco Levorato, Francesco Restuccia

Figure 1 for Split Computing and Early Exiting for Deep Learning Applications: Survey and Research Challenges
Figure 2 for Split Computing and Early Exiting for Deep Learning Applications: Survey and Research Challenges
Figure 3 for Split Computing and Early Exiting for Deep Learning Applications: Survey and Research Challenges
Figure 4 for Split Computing and Early Exiting for Deep Learning Applications: Survey and Research Challenges
Viaarxiv icon

Transferable Positive/Negative Speech Emotion Recognition via Class-wise Adversarial Domain Adaptation

Oct 30, 2018
Hao Zhou, Ke Chen

Figure 1 for Transferable Positive/Negative Speech Emotion Recognition via Class-wise Adversarial Domain Adaptation
Figure 2 for Transferable Positive/Negative Speech Emotion Recognition via Class-wise Adversarial Domain Adaptation
Figure 3 for Transferable Positive/Negative Speech Emotion Recognition via Class-wise Adversarial Domain Adaptation
Figure 4 for Transferable Positive/Negative Speech Emotion Recognition via Class-wise Adversarial Domain Adaptation
Viaarxiv icon

ConcealNet: An End-to-end Neural Network for Packet Loss Concealment in Deep Speech Emotion Recognition

May 15, 2020
Mostafa M. Mohamed, Björn W. Schuller

Figure 1 for ConcealNet: An End-to-end Neural Network for Packet Loss Concealment in Deep Speech Emotion Recognition
Figure 2 for ConcealNet: An End-to-end Neural Network for Packet Loss Concealment in Deep Speech Emotion Recognition
Figure 3 for ConcealNet: An End-to-end Neural Network for Packet Loss Concealment in Deep Speech Emotion Recognition
Figure 4 for ConcealNet: An End-to-end Neural Network for Packet Loss Concealment in Deep Speech Emotion Recognition
Viaarxiv icon

Robustness of on-device Models: Adversarial Attack to Deep Learning Models on Android Apps

Add code
Bookmark button
Alert button
Feb 04, 2021
Yujin Huang, Han Hu, Chunyang Chen

Figure 1 for Robustness of on-device Models: Adversarial Attack to Deep Learning Models on Android Apps
Figure 2 for Robustness of on-device Models: Adversarial Attack to Deep Learning Models on Android Apps
Figure 3 for Robustness of on-device Models: Adversarial Attack to Deep Learning Models on Android Apps
Figure 4 for Robustness of on-device Models: Adversarial Attack to Deep Learning Models on Android Apps
Viaarxiv icon

Vocoder-free End-to-End Voice Conversion with Transformer Network

Add code
Bookmark button
Alert button
Feb 05, 2020
June-Woo Kim, Ho-Young Jung, Minho Lee

Figure 1 for Vocoder-free End-to-End Voice Conversion with Transformer Network
Figure 2 for Vocoder-free End-to-End Voice Conversion with Transformer Network
Figure 3 for Vocoder-free End-to-End Voice Conversion with Transformer Network
Figure 4 for Vocoder-free End-to-End Voice Conversion with Transformer Network
Viaarxiv icon

Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords

Add code
Bookmark button
Alert button
Feb 03, 2021
Prashanth Gurunath Shivakumar, Panayiotis Georgiou, Shrikanth Narayanan

Figure 1 for Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords
Figure 2 for Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords
Figure 3 for Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords
Figure 4 for Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords
Viaarxiv icon

Voice based self help System: User Experience Vs Accuracy

Apr 07, 2015
Sunil Kumar Kopparapu

Figure 1 for Voice based self help System: User Experience Vs Accuracy
Viaarxiv icon