Alert button

"speech": models, code, and papers
Alert button

How to Leverage DNN-based speech enhancement for multi-channel speaker verification?

Add code
Bookmark button
Alert button
Oct 17, 2022
Sandipana Dowerah, Romain Serizel, Denis Jouvet, Mohammad Mohammadamini, Driss Matrouf

Figure 1 for How to Leverage DNN-based speech enhancement for multi-channel speaker verification?
Viaarxiv icon

Contrastive Representation Learning for Acoustic Parameter Estimation

Feb 22, 2023
Philipp Götz, Cagdas Tuna, Andreas Walther, Emanuël A. P. Habets

Figure 1 for Contrastive Representation Learning for Acoustic Parameter Estimation
Figure 2 for Contrastive Representation Learning for Acoustic Parameter Estimation
Figure 3 for Contrastive Representation Learning for Acoustic Parameter Estimation
Figure 4 for Contrastive Representation Learning for Acoustic Parameter Estimation
Viaarxiv icon

A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis

Add code
Bookmark button
Alert button
Aug 03, 2022
Qibing Bai, Tom Ko, Yu Zhang

Figure 1 for A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis
Figure 2 for A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis
Figure 3 for A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis
Figure 4 for A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis
Viaarxiv icon

Self-supervised speech unit discovery from articulatory and acoustic features using VQ-VAE

Add code
Bookmark button
Alert button
Jun 17, 2022
Marc-Antoine Georges, Jean-Luc Schwartz, Thomas Hueber

Figure 1 for Self-supervised speech unit discovery from articulatory and acoustic features using VQ-VAE
Figure 2 for Self-supervised speech unit discovery from articulatory and acoustic features using VQ-VAE
Figure 3 for Self-supervised speech unit discovery from articulatory and acoustic features using VQ-VAE
Viaarxiv icon

Speech Emotion: Investigating Model Representations, Multi-Task Learning and Knowledge Distillation

Jul 02, 2022
Vikramjit Mitra, Hsiang-Yun Sherry Chien, Vasudha Kowtha, Joseph Yitan Cheng, Erdrin Azemi

Figure 1 for Speech Emotion: Investigating Model Representations, Multi-Task Learning and Knowledge Distillation
Figure 2 for Speech Emotion: Investigating Model Representations, Multi-Task Learning and Knowledge Distillation
Figure 3 for Speech Emotion: Investigating Model Representations, Multi-Task Learning and Knowledge Distillation
Figure 4 for Speech Emotion: Investigating Model Representations, Multi-Task Learning and Knowledge Distillation
Viaarxiv icon

A Novel Speech-Driven Lip-Sync Model with CNN and LSTM

May 02, 2022
Xiaohong Li, Xiang Wang, Kai Wang, Shiguo Lian

Figure 1 for A Novel Speech-Driven Lip-Sync Model with CNN and LSTM
Figure 2 for A Novel Speech-Driven Lip-Sync Model with CNN and LSTM
Figure 3 for A Novel Speech-Driven Lip-Sync Model with CNN and LSTM
Figure 4 for A Novel Speech-Driven Lip-Sync Model with CNN and LSTM
Viaarxiv icon

FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition

Add code
Bookmark button
Alert button
Oct 31, 2022
Xingchen Song, Di Wu, Binbin Zhang, Zhiyong Wu, Wenpeng Li, Dongfang Li, Pengshen Zhang, Zhendong Peng, Fuping Pan, Changbao Zhu, Zhongqin Wu

Figure 1 for FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition
Figure 2 for FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition
Figure 3 for FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition
Figure 4 for FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition
Viaarxiv icon

Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks

Nov 03, 2022
Zitha Sasindran, Harsha Yelchuri, Supreeth Rao, T. V. Prabhakar

Figure 1 for Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks
Figure 2 for Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks
Figure 3 for Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks
Figure 4 for Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks
Viaarxiv icon

Leveraging Speaker Embeddings with Adversarial Multi-task Learning for Age Group Classification

Jan 22, 2023
Kwangje Baeg, Yeong-Gwan Kim, Young-Sub Han, Byoung-Ki Jeon

Figure 1 for Leveraging Speaker Embeddings with Adversarial Multi-task Learning for Age Group Classification
Figure 2 for Leveraging Speaker Embeddings with Adversarial Multi-task Learning for Age Group Classification
Figure 3 for Leveraging Speaker Embeddings with Adversarial Multi-task Learning for Age Group Classification
Figure 4 for Leveraging Speaker Embeddings with Adversarial Multi-task Learning for Age Group Classification
Viaarxiv icon

Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning

Add code
Bookmark button
Alert button
Jun 15, 2022
Rui Liu, Berrak Sisman, Björn Schuller, Guanglai Gao, Haizhou Li

Figure 1 for Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
Figure 2 for Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
Figure 3 for Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
Figure 4 for Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
Viaarxiv icon