Alert button

"speech": models, code, and papers
Alert button

Monaural Multi-Talker Speech Recognition using Factorial Speech Processing Models

Oct 05, 2016
Mahdi Khademian, Mohammad Mehdi Homayounpour

Figure 1 for Monaural Multi-Talker Speech Recognition using Factorial Speech Processing Models
Figure 2 for Monaural Multi-Talker Speech Recognition using Factorial Speech Processing Models
Figure 3 for Monaural Multi-Talker Speech Recognition using Factorial Speech Processing Models
Figure 4 for Monaural Multi-Talker Speech Recognition using Factorial Speech Processing Models
Viaarxiv icon

Perceptual-based deep-learning denoiser as a defense against adversarial attacks on ASR systems

Add code
Bookmark button
Alert button
Jul 12, 2021
Anirudh Sreeram, Nicholas Mehlman, Raghuveer Peri, Dillon Knox, Shrikanth Narayanan

Figure 1 for Perceptual-based deep-learning denoiser as a defense against adversarial attacks on ASR systems
Figure 2 for Perceptual-based deep-learning denoiser as a defense against adversarial attacks on ASR systems
Figure 3 for Perceptual-based deep-learning denoiser as a defense against adversarial attacks on ASR systems
Figure 4 for Perceptual-based deep-learning denoiser as a defense against adversarial attacks on ASR systems
Viaarxiv icon

TalkTive: A Conversational Agent Using Backchannels to Engage Older Adults in Neurocognitive Disorders Screening

Feb 16, 2022
Zijian Ding, Jiawen Kang, Tinky Oi Ting HO, Ka Ho Wong, Helene H. Fung, Helen Meng, Xiaojuan Ma

Figure 1 for TalkTive: A Conversational Agent Using Backchannels to Engage Older Adults in Neurocognitive Disorders Screening
Figure 2 for TalkTive: A Conversational Agent Using Backchannels to Engage Older Adults in Neurocognitive Disorders Screening
Figure 3 for TalkTive: A Conversational Agent Using Backchannels to Engage Older Adults in Neurocognitive Disorders Screening
Figure 4 for TalkTive: A Conversational Agent Using Backchannels to Engage Older Adults in Neurocognitive Disorders Screening
Viaarxiv icon

Pre-training in Deep Reinforcement Learning for Automatic Speech Recognition

Oct 24, 2019
Thejan Rajapakshe, Rajib Rana, Siddique Latif, Sara Khalifa, Björn W. Schuller

Figure 1 for Pre-training in Deep Reinforcement Learning for Automatic Speech Recognition
Figure 2 for Pre-training in Deep Reinforcement Learning for Automatic Speech Recognition
Figure 3 for Pre-training in Deep Reinforcement Learning for Automatic Speech Recognition
Figure 4 for Pre-training in Deep Reinforcement Learning for Automatic Speech Recognition
Viaarxiv icon

cif-based collaborative decoding for end-to-end contextual speech recognition

Add code
Bookmark button
Alert button
Dec 17, 2020
Minglun Han, Linhao Dong, Shiyu Zhou, Bo Xu

Figure 1 for cif-based collaborative decoding for end-to-end contextual speech recognition
Figure 2 for cif-based collaborative decoding for end-to-end contextual speech recognition
Figure 3 for cif-based collaborative decoding for end-to-end contextual speech recognition
Figure 4 for cif-based collaborative decoding for end-to-end contextual speech recognition
Viaarxiv icon

A Hierarchical Model for Spoken Language Recognition

Add code
Bookmark button
Alert button
Jan 04, 2022
Luciana Ferrer, Diego Castan, Mitchell McLaren, Aaron Lawson

Figure 1 for A Hierarchical Model for Spoken Language Recognition
Figure 2 for A Hierarchical Model for Spoken Language Recognition
Figure 3 for A Hierarchical Model for Spoken Language Recognition
Figure 4 for A Hierarchical Model for Spoken Language Recognition
Viaarxiv icon

Unsupervised Representation Learning with Future Observation Prediction for Speech Emotion Recognition

Oct 24, 2019
Zheng Lian, Jianhua Tao, Bin Liu, Jian Huang

Figure 1 for Unsupervised Representation Learning with Future Observation Prediction for Speech Emotion Recognition
Figure 2 for Unsupervised Representation Learning with Future Observation Prediction for Speech Emotion Recognition
Figure 3 for Unsupervised Representation Learning with Future Observation Prediction for Speech Emotion Recognition
Figure 4 for Unsupervised Representation Learning with Future Observation Prediction for Speech Emotion Recognition
Viaarxiv icon

Audio-Based Deep Learning Frameworks for Detecting COVID-19

Feb 10, 2022
Dat Ngo, Lam Pham, Truong Hoang, Sefki Kolozali, Delaram Jarchi

Figure 1 for Audio-Based Deep Learning Frameworks for Detecting COVID-19
Figure 2 for Audio-Based Deep Learning Frameworks for Detecting COVID-19
Figure 3 for Audio-Based Deep Learning Frameworks for Detecting COVID-19
Figure 4 for Audio-Based Deep Learning Frameworks for Detecting COVID-19
Viaarxiv icon

A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition

Add code
Bookmark button
Alert button
May 20, 2020
Dongwei Jiang, Wubo Li, Ruixiong Zhang, Miao Cao, Ne Luo, Yang Han, Wei Zou, Xiangang Li

Figure 1 for A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Figure 2 for A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Figure 3 for A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Figure 4 for A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Viaarxiv icon

Supervised and Unsupervised Approaches for Controlling Narrow Lexical Focus in Sequence-to-Sequence Speech Synthesis

Jan 25, 2021
Slava Shechtman, Raul Fernandez, David Haws

Figure 1 for Supervised and Unsupervised Approaches for Controlling Narrow Lexical Focus in Sequence-to-Sequence Speech Synthesis
Figure 2 for Supervised and Unsupervised Approaches for Controlling Narrow Lexical Focus in Sequence-to-Sequence Speech Synthesis
Figure 3 for Supervised and Unsupervised Approaches for Controlling Narrow Lexical Focus in Sequence-to-Sequence Speech Synthesis
Figure 4 for Supervised and Unsupervised Approaches for Controlling Narrow Lexical Focus in Sequence-to-Sequence Speech Synthesis
Viaarxiv icon