Alert button

"speech": models, code, and papers
Alert button

Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech

Jun 25, 2021
Raahil Shah, Kamil Pokora, Abdelhamid Ezzerg, Viacheslav Klimkov, Goeric Huybrechts, Bartosz Putrycz, Daniel Korzekwa, Thomas Merritt

Figure 1 for Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Figure 2 for Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Figure 3 for Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Figure 4 for Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Viaarxiv icon

Contextual Hate Speech Detection in Code Mixed Text using Transformer Based Approaches

Nov 01, 2021
Ravindra Nayak, Raviraj Joshi

Figure 1 for Contextual Hate Speech Detection in Code Mixed Text using Transformer Based Approaches
Figure 2 for Contextual Hate Speech Detection in Code Mixed Text using Transformer Based Approaches
Figure 3 for Contextual Hate Speech Detection in Code Mixed Text using Transformer Based Approaches
Figure 4 for Contextual Hate Speech Detection in Code Mixed Text using Transformer Based Approaches
Viaarxiv icon

Generative Speech Coding with Predictive Variance Regularization

Feb 18, 2021
W. Bastiaan Kleijn, Andrew Storus, Michael Chinen, Tom Denton, Felicia S. C. Lim, Alejandro Luebs, Jan Skoglund, Hengchin Yeh

Figure 1 for Generative Speech Coding with Predictive Variance Regularization
Figure 2 for Generative Speech Coding with Predictive Variance Regularization
Figure 3 for Generative Speech Coding with Predictive Variance Regularization
Viaarxiv icon

Efficient Speech Emotion Recognition Using Multi-Scale CNN and Attention

Add code
Bookmark button
Alert button
Jun 08, 2021
Zixuan Peng, Yu Lu, Shengfeng Pan, Yunfeng Liu

Figure 1 for Efficient Speech Emotion Recognition Using Multi-Scale CNN and Attention
Figure 2 for Efficient Speech Emotion Recognition Using Multi-Scale CNN and Attention
Figure 3 for Efficient Speech Emotion Recognition Using Multi-Scale CNN and Attention
Viaarxiv icon

Context-Aware Transformer Transducer for Speech Recognition

Nov 05, 2021
Feng-Ju Chang, Jing Liu, Martin Radfar, Athanasios Mouchtaris, Maurizio Omologo, Ariya Rastrow, Siegfried Kunzmann

Figure 1 for Context-Aware Transformer Transducer for Speech Recognition
Figure 2 for Context-Aware Transformer Transducer for Speech Recognition
Figure 3 for Context-Aware Transformer Transducer for Speech Recognition
Figure 4 for Context-Aware Transformer Transducer for Speech Recognition
Viaarxiv icon

Low Bit-Rate Wideband Speech Coding: A Deep Generative Model based Approach

Feb 04, 2021
Gang Min, Xiongwei Zhang, Xia Zou, Xiangyang Liu

Figure 1 for Low Bit-Rate Wideband Speech Coding: A Deep Generative Model based Approach
Figure 2 for Low Bit-Rate Wideband Speech Coding: A Deep Generative Model based Approach
Figure 3 for Low Bit-Rate Wideband Speech Coding: A Deep Generative Model based Approach
Figure 4 for Low Bit-Rate Wideband Speech Coding: A Deep Generative Model based Approach
Viaarxiv icon

Time Domain Adversarial Voice Conversion for ADD 2022

Add code
Bookmark button
Alert button
Apr 20, 2022
Cheng Wen, Tingwei Guo, Xingjun Tan, Rui Yan, Shuran Zhou, Chuandong Xie, Wei Zou, Xiangang Li

Figure 1 for Time Domain Adversarial Voice Conversion for ADD 2022
Figure 2 for Time Domain Adversarial Voice Conversion for ADD 2022
Figure 3 for Time Domain Adversarial Voice Conversion for ADD 2022
Figure 4 for Time Domain Adversarial Voice Conversion for ADD 2022
Viaarxiv icon

Lightweight Adapter Tuning for Multilingual Speech Translation

Add code
Bookmark button
Alert button
Jun 02, 2021
Hang Le, Juan Pino, Changhan Wang, Jiatao Gu, Didier Schwab, Laurent Besacier

Figure 1 for Lightweight Adapter Tuning for Multilingual Speech Translation
Figure 2 for Lightweight Adapter Tuning for Multilingual Speech Translation
Figure 3 for Lightweight Adapter Tuning for Multilingual Speech Translation
Figure 4 for Lightweight Adapter Tuning for Multilingual Speech Translation
Viaarxiv icon

Generalized Representations Learning for Time Series Classification

Add code
Bookmark button
Alert button
Sep 15, 2022
Wang Lu, Jindong Wang, Xinwei Sun, Yiqiang Chen, Xing Xie

Figure 1 for Generalized Representations Learning for Time Series Classification
Figure 2 for Generalized Representations Learning for Time Series Classification
Figure 3 for Generalized Representations Learning for Time Series Classification
Figure 4 for Generalized Representations Learning for Time Series Classification
Viaarxiv icon

MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement

Add code
Bookmark button
Alert button
Apr 08, 2021
Szu-Wei Fu, Cheng Yu, Tsun-An Hsieh, Peter Plantinga, Mirco Ravanelli, Xugang Lu, Yu Tsao

Figure 1 for MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement
Figure 2 for MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement
Figure 3 for MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement
Figure 4 for MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement
Viaarxiv icon