Alert button

"speech": models, code, and papers
Alert button

SANTLR: Speech Annotation Toolkit for Low Resource Languages

Aug 02, 2019
Xinjian Li, Zhong Zhou, Siddharth Dalmia, Alan W. Black, Florian Metze

Figure 1 for SANTLR: Speech Annotation Toolkit for Low Resource Languages
Viaarxiv icon

Enhancing Segment-Based Speech Emotion Recognition by Deep Self-Learning

Mar 30, 2021
Shuiyang Mao, P. C. Ching, Tan Lee

Figure 1 for Enhancing Segment-Based Speech Emotion Recognition by Deep Self-Learning
Figure 2 for Enhancing Segment-Based Speech Emotion Recognition by Deep Self-Learning
Figure 3 for Enhancing Segment-Based Speech Emotion Recognition by Deep Self-Learning
Figure 4 for Enhancing Segment-Based Speech Emotion Recognition by Deep Self-Learning
Viaarxiv icon

Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction

Apr 15, 2022
Zifeng Zhao, Rongzhi Gu, Dongchao Yang, Jinchuan Tian, Yuexian Zou

Figure 1 for Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction
Figure 2 for Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction
Figure 3 for Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction
Figure 4 for Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction
Viaarxiv icon

Personalization Strategies for End-to-End Speech Recognition Systems

Feb 15, 2021
Aditya Gourav, Linda Liu, Ankur Gandhe, Yile Gu, Guitang Lan, Xiangyang Huang, Shashank Kalmane, Gautam Tiwari, Denis Filimonov, Ariya Rastrow, Andreas Stolcke, Ivan Bulyko

Figure 1 for Personalization Strategies for End-to-End Speech Recognition Systems
Figure 2 for Personalization Strategies for End-to-End Speech Recognition Systems
Figure 3 for Personalization Strategies for End-to-End Speech Recognition Systems
Figure 4 for Personalization Strategies for End-to-End Speech Recognition Systems
Viaarxiv icon

A Likelihood Ratio based Domain Adaptation Method for E2E Models

Jan 10, 2022
Chhavi Choudhury, Ankur Gandhe, Xiaohan Ding, Ivan Bulyko

Figure 1 for A Likelihood Ratio based Domain Adaptation Method for E2E Models
Figure 2 for A Likelihood Ratio based Domain Adaptation Method for E2E Models
Figure 3 for A Likelihood Ratio based Domain Adaptation Method for E2E Models
Figure 4 for A Likelihood Ratio based Domain Adaptation Method for E2E Models
Viaarxiv icon

Detecting Escalation Level from Speech with Transfer Learning and Acoustic-Lexical Information Fusion

Apr 13, 2021
Ziang Zhou, Yanze Xu, Shilei Zhang, Ming Li

Figure 1 for Detecting Escalation Level from Speech with Transfer Learning and Acoustic-Lexical Information Fusion
Figure 2 for Detecting Escalation Level from Speech with Transfer Learning and Acoustic-Lexical Information Fusion
Figure 3 for Detecting Escalation Level from Speech with Transfer Learning and Acoustic-Lexical Information Fusion
Figure 4 for Detecting Escalation Level from Speech with Transfer Learning and Acoustic-Lexical Information Fusion
Viaarxiv icon

Towards countering hate speech and personal attack in social media

Dec 05, 2019
Polychronis Charitidis, Stavros Doropoulos, Stavros Vologiannidis, Ioannis Papastergiou, Sophia Karakeva

Figure 1 for Towards countering hate speech and personal attack in social media
Figure 2 for Towards countering hate speech and personal attack in social media
Figure 3 for Towards countering hate speech and personal attack in social media
Figure 4 for Towards countering hate speech and personal attack in social media
Viaarxiv icon

Moving fast and slow: Analysis of representations and post-processing in speech-driven automatic gesture generation

Jul 16, 2020
Taras Kucherenko, Dai Hasegawa, Naoshi Kaneko, Gustav Eje Henter, Hedvig Kjellström

Figure 1 for Moving fast and slow: Analysis of representations and post-processing in speech-driven automatic gesture generation
Figure 2 for Moving fast and slow: Analysis of representations and post-processing in speech-driven automatic gesture generation
Figure 3 for Moving fast and slow: Analysis of representations and post-processing in speech-driven automatic gesture generation
Figure 4 for Moving fast and slow: Analysis of representations and post-processing in speech-driven automatic gesture generation
Viaarxiv icon

Emotion Intensity and its Control for Emotional Voice Conversion

Jan 10, 2022
Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li

Figure 1 for Emotion Intensity and its Control for Emotional Voice Conversion
Figure 2 for Emotion Intensity and its Control for Emotional Voice Conversion
Figure 3 for Emotion Intensity and its Control for Emotional Voice Conversion
Figure 4 for Emotion Intensity and its Control for Emotional Voice Conversion
Viaarxiv icon

Novel Hybrid DNN Approaches for Speaker Verification in Emotional and Stressful Talking Environments

Dec 26, 2021
Ismail Shahin, Ali Bou Nassif, Nawel Nemmour, Ashraf Elnagar, Adi Alhudhaif, Kemal Polat

Figure 1 for Novel Hybrid DNN Approaches for Speaker Verification in Emotional and Stressful Talking Environments
Figure 2 for Novel Hybrid DNN Approaches for Speaker Verification in Emotional and Stressful Talking Environments
Figure 3 for Novel Hybrid DNN Approaches for Speaker Verification in Emotional and Stressful Talking Environments
Figure 4 for Novel Hybrid DNN Approaches for Speaker Verification in Emotional and Stressful Talking Environments
Viaarxiv icon