Alert button

"speech": models, code, and papers
Alert button

Generative Pre-Training for Speech with Autoregressive Predictive Coding

Add code
Bookmark button
Alert button
Oct 23, 2019
Yu-An Chung, James Glass

Figure 1 for Generative Pre-Training for Speech with Autoregressive Predictive Coding
Figure 2 for Generative Pre-Training for Speech with Autoregressive Predictive Coding
Figure 3 for Generative Pre-Training for Speech with Autoregressive Predictive Coding
Figure 4 for Generative Pre-Training for Speech with Autoregressive Predictive Coding
Viaarxiv icon

Deep Residual Local Feature Learning for Speech Emotion Recognition

Nov 19, 2020
Sattaya Singkul, Thakorn Chatchaisathaporn, Boontawee Suntisrivaraporn, Kuntpong Woraratpanya

Figure 1 for Deep Residual Local Feature Learning for Speech Emotion Recognition
Figure 2 for Deep Residual Local Feature Learning for Speech Emotion Recognition
Figure 3 for Deep Residual Local Feature Learning for Speech Emotion Recognition
Figure 4 for Deep Residual Local Feature Learning for Speech Emotion Recognition
Viaarxiv icon

Voice Conversion Can Improve ASR in Very Low-Resource Settings

Add code
Bookmark button
Alert button
Nov 04, 2021
Matthew Baas, Herman Kamper

Figure 1 for Voice Conversion Can Improve ASR in Very Low-Resource Settings
Figure 2 for Voice Conversion Can Improve ASR in Very Low-Resource Settings
Figure 3 for Voice Conversion Can Improve ASR in Very Low-Resource Settings
Figure 4 for Voice Conversion Can Improve ASR in Very Low-Resource Settings
Viaarxiv icon

To BAN or not to BAN: Bayesian Attention Networks for Reliable Hate Speech Detection

Add code
Bookmark button
Alert button
Jul 10, 2020
Kristian Miok, Blaz Skrlj, Daniela Zaharie, Marko Robnik-Sikonja

Figure 1 for To BAN or not to BAN: Bayesian Attention Networks for Reliable Hate Speech Detection
Figure 2 for To BAN or not to BAN: Bayesian Attention Networks for Reliable Hate Speech Detection
Figure 3 for To BAN or not to BAN: Bayesian Attention Networks for Reliable Hate Speech Detection
Figure 4 for To BAN or not to BAN: Bayesian Attention Networks for Reliable Hate Speech Detection
Viaarxiv icon

An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition

Jul 22, 2021
Ruchao Fan, Wei Chu, Peng Chang, Jing Xiao, Abeer Alwan

Figure 1 for An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Figure 2 for An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Figure 3 for An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Figure 4 for An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Viaarxiv icon

Self-Supervised Learning from Contrastive Mixtures for Personalized Speech Enhancement

Nov 06, 2020
Aswin Sivaraman, Minje Kim

Figure 1 for Self-Supervised Learning from Contrastive Mixtures for Personalized Speech Enhancement
Figure 2 for Self-Supervised Learning from Contrastive Mixtures for Personalized Speech Enhancement
Figure 3 for Self-Supervised Learning from Contrastive Mixtures for Personalized Speech Enhancement
Figure 4 for Self-Supervised Learning from Contrastive Mixtures for Personalized Speech Enhancement
Viaarxiv icon

Towards the Objective Speech Assessment of Smoking Status based on Voice Features: A Review of the Literature

Jun 15, 2021
Zhizhong Ma, Chris Bullen, Joanna Ting Wai Chu, Ruili Wang, Yingchun Wang, Satwinder Singh

Figure 1 for Towards the Objective Speech Assessment of Smoking Status based on Voice Features: A Review of the Literature
Figure 2 for Towards the Objective Speech Assessment of Smoking Status based on Voice Features: A Review of the Literature
Figure 3 for Towards the Objective Speech Assessment of Smoking Status based on Voice Features: A Review of the Literature
Figure 4 for Towards the Objective Speech Assessment of Smoking Status based on Voice Features: A Review of the Literature
Viaarxiv icon

Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks

Feb 02, 2020
Jingdong Li, Hui Zhang, Xueliang Zhang, Changliang Li

Figure 1 for Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks
Figure 2 for Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks
Figure 3 for Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks
Figure 4 for Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks
Viaarxiv icon

Deficient Basis Estimation of Noise Spatial Covariance Matrix for Rank-Constrained Spatial Covariance Matrix Estimation Method in Blind Speech Extraction

May 06, 2021
Yuto Kondo, Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari

Figure 1 for Deficient Basis Estimation of Noise Spatial Covariance Matrix for Rank-Constrained Spatial Covariance Matrix Estimation Method in Blind Speech Extraction
Figure 2 for Deficient Basis Estimation of Noise Spatial Covariance Matrix for Rank-Constrained Spatial Covariance Matrix Estimation Method in Blind Speech Extraction
Figure 3 for Deficient Basis Estimation of Noise Spatial Covariance Matrix for Rank-Constrained Spatial Covariance Matrix Estimation Method in Blind Speech Extraction
Figure 4 for Deficient Basis Estimation of Noise Spatial Covariance Matrix for Rank-Constrained Spatial Covariance Matrix Estimation Method in Blind Speech Extraction
Viaarxiv icon

Learning to Rank Microphones for Distant Speech Recognition

Add code
Bookmark button
Alert button
Apr 13, 2021
Samuele Cornell, Alessio Brutti, Marco Matassoni, Stefano Squartini

Figure 1 for Learning to Rank Microphones for Distant Speech Recognition
Figure 2 for Learning to Rank Microphones for Distant Speech Recognition
Figure 3 for Learning to Rank Microphones for Distant Speech Recognition
Figure 4 for Learning to Rank Microphones for Distant Speech Recognition
Viaarxiv icon