Alert button

"speech": models, code, and papers
Alert button

TS-RIR: Translated synthetic room impulse responses for speech augmentation

Add code
Bookmark button
Alert button
Apr 03, 2021
Anton Ratnarajah, Zhenyu Tang, Dinesh Manocha

Figure 1 for TS-RIR: Translated synthetic room impulse responses for speech augmentation
Figure 2 for TS-RIR: Translated synthetic room impulse responses for speech augmentation
Figure 3 for TS-RIR: Translated synthetic room impulse responses for speech augmentation
Figure 4 for TS-RIR: Translated synthetic room impulse responses for speech augmentation
Viaarxiv icon

Formant Estimation and Tracking using Probabilistic Heat-Maps

Add code
Bookmark button
Alert button
Jun 23, 2022
Yosi Shrem, Felix Kreuk, Joseph Keshet

Figure 1 for Formant Estimation and Tracking using Probabilistic Heat-Maps
Figure 2 for Formant Estimation and Tracking using Probabilistic Heat-Maps
Figure 3 for Formant Estimation and Tracking using Probabilistic Heat-Maps
Figure 4 for Formant Estimation and Tracking using Probabilistic Heat-Maps
Viaarxiv icon

Provable Subspace Identification Under Post-Nonlinear Mixtures

Oct 14, 2022
Qi Lyu, Xiao Fu

Figure 1 for Provable Subspace Identification Under Post-Nonlinear Mixtures
Figure 2 for Provable Subspace Identification Under Post-Nonlinear Mixtures
Viaarxiv icon

End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks

Add code
Bookmark button
Alert button
Apr 30, 2021
Rodrigo Mira, Konstantinos Vougioukas, Pingchuan Ma, Stavros Petridis, Björn W. Schuller, Maja Pantic

Figure 1 for End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks
Figure 2 for End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks
Figure 3 for End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks
Figure 4 for End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks
Viaarxiv icon

Out-of-Distribution Representation Learning for Time Series Classification

Add code
Bookmark button
Alert button
Sep 26, 2022
Wang Lu, Jindong Wang, Xinwei Sun, Yiqiang Chen, Xing Xie

Figure 1 for Out-of-Distribution Representation Learning for Time Series Classification
Figure 2 for Out-of-Distribution Representation Learning for Time Series Classification
Figure 3 for Out-of-Distribution Representation Learning for Time Series Classification
Figure 4 for Out-of-Distribution Representation Learning for Time Series Classification
Viaarxiv icon

BSTC: A Large-Scale Chinese-English Speech Translation Dataset

Add code
Bookmark button
Alert button
Apr 19, 2021
Ruiqing Zhang, Xiyang Wang, Chuanqiang Zhang, Zhongjun He, Hua Wu, Zhi Li, Haifeng Wang, Ying Chen, Qinfei Li

Figure 1 for BSTC: A Large-Scale Chinese-English Speech Translation Dataset
Figure 2 for BSTC: A Large-Scale Chinese-English Speech Translation Dataset
Figure 3 for BSTC: A Large-Scale Chinese-English Speech Translation Dataset
Figure 4 for BSTC: A Large-Scale Chinese-English Speech Translation Dataset
Viaarxiv icon

Unsupervised Cross-Lingual Speech Emotion Recognition Using Pseudo Multilabel

Add code
Bookmark button
Alert button
Aug 19, 2021
Jin Li, Nan Yan, Lan Wang

Figure 1 for Unsupervised Cross-Lingual Speech Emotion Recognition Using Pseudo Multilabel
Figure 2 for Unsupervised Cross-Lingual Speech Emotion Recognition Using Pseudo Multilabel
Figure 3 for Unsupervised Cross-Lingual Speech Emotion Recognition Using Pseudo Multilabel
Figure 4 for Unsupervised Cross-Lingual Speech Emotion Recognition Using Pseudo Multilabel
Viaarxiv icon

Personalized Speech Enhancement through Self-Supervised Data Augmentation and Purification

Apr 05, 2021
Aswin Sivaraman, Sunwoo Kim, Minje Kim

Figure 1 for Personalized Speech Enhancement through Self-Supervised Data Augmentation and Purification
Figure 2 for Personalized Speech Enhancement through Self-Supervised Data Augmentation and Purification
Figure 3 for Personalized Speech Enhancement through Self-Supervised Data Augmentation and Purification
Viaarxiv icon

Position Prediction as an Effective Pretraining Strategy

Add code
Bookmark button
Alert button
Jul 15, 2022
Shuangfei Zhai, Navdeep Jaitly, Jason Ramapuram, Dan Busbridge, Tatiana Likhomanenko, Joseph Yitan Cheng, Walter Talbott, Chen Huang, Hanlin Goh, Joshua Susskind

Figure 1 for Position Prediction as an Effective Pretraining Strategy
Figure 2 for Position Prediction as an Effective Pretraining Strategy
Figure 3 for Position Prediction as an Effective Pretraining Strategy
Figure 4 for Position Prediction as an Effective Pretraining Strategy
Viaarxiv icon

Understanding effect of speech perception in EEG based speech recognition systems

May 29, 2020
Gautam Krishna, Co Tran, Mason Carnahan, Ahmed Tewfik

Figure 1 for Understanding effect of speech perception in EEG based speech recognition systems
Figure 2 for Understanding effect of speech perception in EEG based speech recognition systems
Figure 3 for Understanding effect of speech perception in EEG based speech recognition systems
Figure 4 for Understanding effect of speech perception in EEG based speech recognition systems
Viaarxiv icon