Alert button

"speech": models, code, and papers
Alert button

An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition

Jul 22, 2021
Ruchao Fan, Wei Chu, Peng Chang, Jing Xiao, Abeer Alwan

Figure 1 for An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Figure 2 for An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Figure 3 for An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Figure 4 for An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Viaarxiv icon

Deficient Basis Estimation of Noise Spatial Covariance Matrix for Rank-Constrained Spatial Covariance Matrix Estimation Method in Blind Speech Extraction

May 06, 2021
Yuto Kondo, Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari

Figure 1 for Deficient Basis Estimation of Noise Spatial Covariance Matrix for Rank-Constrained Spatial Covariance Matrix Estimation Method in Blind Speech Extraction
Figure 2 for Deficient Basis Estimation of Noise Spatial Covariance Matrix for Rank-Constrained Spatial Covariance Matrix Estimation Method in Blind Speech Extraction
Figure 3 for Deficient Basis Estimation of Noise Spatial Covariance Matrix for Rank-Constrained Spatial Covariance Matrix Estimation Method in Blind Speech Extraction
Figure 4 for Deficient Basis Estimation of Noise Spatial Covariance Matrix for Rank-Constrained Spatial Covariance Matrix Estimation Method in Blind Speech Extraction
Viaarxiv icon

Emotion Recognition from Speech

Add code
Bookmark button
Alert button
Dec 22, 2019
Kannan Venkataramanan, Haresh Rengaraj Rajamohan

Figure 1 for Emotion Recognition from Speech
Figure 2 for Emotion Recognition from Speech
Figure 3 for Emotion Recognition from Speech
Figure 4 for Emotion Recognition from Speech
Viaarxiv icon

Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam

Add code
Bookmark button
Alert button
Jan 23, 2020
Marc Delcroix, Tsubasa Ochiai, Katerina Zmolikova, Keisuke Kinoshita, Naohiro Tawara, Tomohiro Nakatani, Shoko Araki

Figure 1 for Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam
Figure 2 for Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam
Figure 3 for Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam
Figure 4 for Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam
Viaarxiv icon

Voice Conversion Can Improve ASR in Very Low-Resource Settings

Add code
Bookmark button
Alert button
Nov 04, 2021
Matthew Baas, Herman Kamper

Figure 1 for Voice Conversion Can Improve ASR in Very Low-Resource Settings
Figure 2 for Voice Conversion Can Improve ASR in Very Low-Resource Settings
Figure 3 for Voice Conversion Can Improve ASR in Very Low-Resource Settings
Figure 4 for Voice Conversion Can Improve ASR in Very Low-Resource Settings
Viaarxiv icon

Adversarial Feature Learning and Unsupervised Clustering based Speech Synthesis for Found Data with Acoustic and Textual Noise

Add code
Bookmark button
Alert button
Apr 28, 2020
Shan Yang, Yuxuan Wang, Lei Xie

Figure 1 for Adversarial Feature Learning and Unsupervised Clustering based Speech Synthesis for Found Data with Acoustic and Textual Noise
Figure 2 for Adversarial Feature Learning and Unsupervised Clustering based Speech Synthesis for Found Data with Acoustic and Textual Noise
Figure 3 for Adversarial Feature Learning and Unsupervised Clustering based Speech Synthesis for Found Data with Acoustic and Textual Noise
Viaarxiv icon

Multilingual and Multi-Aspect Hate Speech Analysis

Add code
Bookmark button
Alert button
Aug 29, 2019
Nedjma Ousidhoum, Zizheng Lin, Hongming Zhang, Yangqiu Song, Dit-Yan Yeung

Figure 1 for Multilingual and Multi-Aspect Hate Speech Analysis
Figure 2 for Multilingual and Multi-Aspect Hate Speech Analysis
Figure 3 for Multilingual and Multi-Aspect Hate Speech Analysis
Figure 4 for Multilingual and Multi-Aspect Hate Speech Analysis
Viaarxiv icon

Optimizing Speech Emotion Recognition using Manta-Ray Based Feature Selection

Sep 18, 2020
Soham Chattopadhyay, Arijit Dey, Hritam Basak

Figure 1 for Optimizing Speech Emotion Recognition using Manta-Ray Based Feature Selection
Figure 2 for Optimizing Speech Emotion Recognition using Manta-Ray Based Feature Selection
Figure 3 for Optimizing Speech Emotion Recognition using Manta-Ray Based Feature Selection
Figure 4 for Optimizing Speech Emotion Recognition using Manta-Ray Based Feature Selection
Viaarxiv icon

PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss

Aug 11, 2020
Umut Isik, Ritwik Giri, Neerad Phansalkar, Jean-Marc Valin, Karim Helwani, Arvindh Krishnaswamy

Figure 1 for PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Figure 2 for PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Figure 3 for PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Figure 4 for PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Viaarxiv icon

Advancing CTC-CRF Based End-to-End Speech Recognition with Wordpieces and Conformers

Add code
Bookmark button
Alert button
Jul 08, 2021
Huahuan Zheng, Wenjie Peng, Zhijian Ou, Jinsong Zhang

Figure 1 for Advancing CTC-CRF Based End-to-End Speech Recognition with Wordpieces and Conformers
Figure 2 for Advancing CTC-CRF Based End-to-End Speech Recognition with Wordpieces and Conformers
Figure 3 for Advancing CTC-CRF Based End-to-End Speech Recognition with Wordpieces and Conformers
Figure 4 for Advancing CTC-CRF Based End-to-End Speech Recognition with Wordpieces and Conformers
Viaarxiv icon