Alert button

"speech": models, code, and papers
Alert button

SpeechNet: A Universal Modularized Model for Speech Processing Tasks

Add code
Bookmark button
Alert button
May 31, 2021
Yi-Chen Chen, Po-Han Chi, Shu-wen Yang, Kai-Wei Chang, Jheng-hao Lin, Sung-Feng Huang, Da-Rong Liu, Chi-Liang Liu, Cheng-Kuang Lee, Hung-yi Lee

Figure 1 for SpeechNet: A Universal Modularized Model for Speech Processing Tasks
Figure 2 for SpeechNet: A Universal Modularized Model for Speech Processing Tasks
Figure 3 for SpeechNet: A Universal Modularized Model for Speech Processing Tasks
Figure 4 for SpeechNet: A Universal Modularized Model for Speech Processing Tasks
Viaarxiv icon

Contrastive Regularization for Multimodal Emotion Recognition Using Audio and Text

Nov 20, 2022
Fan Qian, Jiqing Han

Figure 1 for Contrastive Regularization for Multimodal Emotion Recognition Using Audio and Text
Figure 2 for Contrastive Regularization for Multimodal Emotion Recognition Using Audio and Text
Figure 3 for Contrastive Regularization for Multimodal Emotion Recognition Using Audio and Text
Viaarxiv icon

Robust Speech Representation Learning via Flow-based Embedding Regularization

Add code
Bookmark button
Alert button
Dec 07, 2021
Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan

Figure 1 for Robust Speech Representation Learning via Flow-based Embedding Regularization
Figure 2 for Robust Speech Representation Learning via Flow-based Embedding Regularization
Figure 3 for Robust Speech Representation Learning via Flow-based Embedding Regularization
Figure 4 for Robust Speech Representation Learning via Flow-based Embedding Regularization
Viaarxiv icon

DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021

Add code
Bookmark button
Alert button
Nov 19, 2021
Yanqing Liu, Zhihang Xu, Gang Wang, Kuan Chen, Bohan Li, Xu Tan, Jinzhu Li, Lei He, Sheng Zhao

Figure 1 for DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021
Figure 2 for DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021
Figure 3 for DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021
Figure 4 for DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021
Viaarxiv icon

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Mar 24, 2022
Pavel Andreev, Aibek Alanov, Oleg Ivanov, Dmitry Vetrov

Figure 1 for HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement
Figure 2 for HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement
Figure 3 for HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement
Figure 4 for HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement
Viaarxiv icon

Going In Style: Audio Backdoors Through Stylistic Transformations

Add code
Bookmark button
Alert button
Nov 11, 2022
Stefanos Koffas, Luca Pajola, Stjepan Picek, Mauro Conti

Figure 1 for Going In Style: Audio Backdoors Through Stylistic Transformations
Figure 2 for Going In Style: Audio Backdoors Through Stylistic Transformations
Figure 3 for Going In Style: Audio Backdoors Through Stylistic Transformations
Viaarxiv icon

Speaker Anonymization with Phonetic Intermediate Representations

Add code
Bookmark button
Alert button
Jul 11, 2022
Sarina Meyer, Florian Lux, Pavel Denisov, Julia Koch, Pascal Tilli, Ngoc Thang Vu

Figure 1 for Speaker Anonymization with Phonetic Intermediate Representations
Figure 2 for Speaker Anonymization with Phonetic Intermediate Representations
Figure 3 for Speaker Anonymization with Phonetic Intermediate Representations
Figure 4 for Speaker Anonymization with Phonetic Intermediate Representations
Viaarxiv icon

Unsupervised Domain Adaptation in Speech Recognition using Phonetic Features

Aug 04, 2021
Rupam Ojha, C Chandra Sekhar

Figure 1 for Unsupervised Domain Adaptation in Speech Recognition using Phonetic Features
Figure 2 for Unsupervised Domain Adaptation in Speech Recognition using Phonetic Features
Figure 3 for Unsupervised Domain Adaptation in Speech Recognition using Phonetic Features
Figure 4 for Unsupervised Domain Adaptation in Speech Recognition using Phonetic Features
Viaarxiv icon

Hope Speech detection in under-resourced Kannada language

Add code
Bookmark button
Alert button
Aug 10, 2021
Adeep Hande, Ruba Priyadharshini, Anbukkarasi Sampath, Kingston Pal Thamburaj, Prabakaran Chandran, Bharathi Raja Chakravarthi

Figure 1 for Hope Speech detection in under-resourced Kannada language
Figure 2 for Hope Speech detection in under-resourced Kannada language
Figure 3 for Hope Speech detection in under-resourced Kannada language
Figure 4 for Hope Speech detection in under-resourced Kannada language
Viaarxiv icon

ASMDD: Arabic Speech Mispronunciation Detection Dataset

Nov 01, 2021
Salah A. Aly, Abdelrahman Salah, Hesham M. Eraqi

Figure 1 for ASMDD: Arabic Speech Mispronunciation Detection Dataset
Figure 2 for ASMDD: Arabic Speech Mispronunciation Detection Dataset
Figure 3 for ASMDD: Arabic Speech Mispronunciation Detection Dataset
Figure 4 for ASMDD: Arabic Speech Mispronunciation Detection Dataset
Viaarxiv icon