Alert button

"speech": models, code, and papers
Alert button

Multi-Window Data Augmentation Approach for Speech Emotion Recognition

Oct 28, 2020
Sarala Padi, Dinesh Manocha, Ram D. Sriram

Figure 1 for Multi-Window Data Augmentation Approach for Speech Emotion Recognition
Figure 2 for Multi-Window Data Augmentation Approach for Speech Emotion Recognition
Figure 3 for Multi-Window Data Augmentation Approach for Speech Emotion Recognition
Figure 4 for Multi-Window Data Augmentation Approach for Speech Emotion Recognition
Viaarxiv icon

Continual Speaker Adaptation for Text-to-Speech Synthesis

Add code
Bookmark button
Alert button
Mar 26, 2021
Hamed Hemati, Damian Borth

Figure 1 for Continual Speaker Adaptation for Text-to-Speech Synthesis
Figure 2 for Continual Speaker Adaptation for Text-to-Speech Synthesis
Figure 3 for Continual Speaker Adaptation for Text-to-Speech Synthesis
Figure 4 for Continual Speaker Adaptation for Text-to-Speech Synthesis
Viaarxiv icon

Indoor optical fiber eavesdropping approach and its avoidance

Jul 12, 2022
Haiqing Hao, Zhongwang Pang, Guan Wang, Bo Wang

Figure 1 for Indoor optical fiber eavesdropping approach and its avoidance
Figure 2 for Indoor optical fiber eavesdropping approach and its avoidance
Figure 3 for Indoor optical fiber eavesdropping approach and its avoidance
Viaarxiv icon

Audio-visual Speech Separation with Adversarially Disentangled Visual Representation

Add code
Bookmark button
Alert button
Nov 29, 2020
Peng Zhang, Jiaming Xu, Jing shi, Yunzhe Hao, Bo Xu

Figure 1 for Audio-visual Speech Separation with Adversarially Disentangled Visual Representation
Figure 2 for Audio-visual Speech Separation with Adversarially Disentangled Visual Representation
Figure 3 for Audio-visual Speech Separation with Adversarially Disentangled Visual Representation
Figure 4 for Audio-visual Speech Separation with Adversarially Disentangled Visual Representation
Viaarxiv icon

Machine Learning based COVID-19 Detection from Smartphone Recordings: Cough, Breath and Speech

Apr 02, 2021
Madhurananda Pahar, Thomas Niesler

Figure 1 for Machine Learning based COVID-19 Detection from Smartphone Recordings: Cough, Breath and Speech
Figure 2 for Machine Learning based COVID-19 Detection from Smartphone Recordings: Cough, Breath and Speech
Figure 3 for Machine Learning based COVID-19 Detection from Smartphone Recordings: Cough, Breath and Speech
Figure 4 for Machine Learning based COVID-19 Detection from Smartphone Recordings: Cough, Breath and Speech
Viaarxiv icon

Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition

Add code
Bookmark button
Alert button
Sep 14, 2021
Felix Wu, Kwangyoun Kim, Jing Pan, Kyu Han, Kilian Q. Weinberger, Yoav Artzi

Figure 1 for Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition
Figure 2 for Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition
Figure 3 for Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition
Figure 4 for Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition
Viaarxiv icon

Continuously Controllable Facial Expression Editing in Talking Face Videos

Sep 17, 2022
Zhiyao Sun, Yu-Hui Wen, Tian Lv, Yanan Sun, Ziyang Zhang, Yaoyuan Wang, Yong-Jin Liu

Figure 1 for Continuously Controllable Facial Expression Editing in Talking Face Videos
Figure 2 for Continuously Controllable Facial Expression Editing in Talking Face Videos
Figure 3 for Continuously Controllable Facial Expression Editing in Talking Face Videos
Figure 4 for Continuously Controllable Facial Expression Editing in Talking Face Videos
Viaarxiv icon

A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture

Apr 12, 2022
Zhenxing Lu, Mengnan He, Ruixiong Zhang, Caixia Gong

Figure 1 for A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture
Figure 2 for A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture
Figure 3 for A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture
Figure 4 for A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture
Viaarxiv icon

Conformal prediction for text infilling and part-of-speech prediction

Add code
Bookmark button
Alert button
Nov 04, 2021
Neil Dey, Jing Ding, Jack Ferrell, Carolina Kapper, Maxwell Lovig, Emiliano Planchon, Jonathan P Williams

Figure 1 for Conformal prediction for text infilling and part-of-speech prediction
Figure 2 for Conformal prediction for text infilling and part-of-speech prediction
Figure 3 for Conformal prediction for text infilling and part-of-speech prediction
Figure 4 for Conformal prediction for text infilling and part-of-speech prediction
Viaarxiv icon

On the Role of Style in Parsing Speech with Neural Models

Add code
Bookmark button
Alert button
Oct 08, 2020
Trang Tran, Jiahong Yuan, Yang Liu, Mari Ostendorf

Figure 1 for On the Role of Style in Parsing Speech with Neural Models
Figure 2 for On the Role of Style in Parsing Speech with Neural Models
Figure 3 for On the Role of Style in Parsing Speech with Neural Models
Figure 4 for On the Role of Style in Parsing Speech with Neural Models
Viaarxiv icon