Alert button

"speech": models, code, and papers
Alert button

Mathematical Vocoder Algorithm : Modified Spectral Inversion for Efficient Neural Speech Synthesis

Add code
Bookmark button
Alert button
Jun 16, 2021
Hyun Gon Ryu, Jeong-Hoon Kim, Simon See

Figure 1 for Mathematical Vocoder Algorithm : Modified Spectral Inversion for Efficient Neural Speech Synthesis
Figure 2 for Mathematical Vocoder Algorithm : Modified Spectral Inversion for Efficient Neural Speech Synthesis
Figure 3 for Mathematical Vocoder Algorithm : Modified Spectral Inversion for Efficient Neural Speech Synthesis
Figure 4 for Mathematical Vocoder Algorithm : Modified Spectral Inversion for Efficient Neural Speech Synthesis
Viaarxiv icon

End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks

Add code
Bookmark button
Alert button
Apr 27, 2021
Rodrigo Mira, Konstantinos Vougioukas, Pingchuan Ma, Stavros Petridis, Björn W. Schuller, Maja Pantic

Figure 1 for End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks
Figure 2 for End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks
Figure 3 for End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks
Figure 4 for End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks
Viaarxiv icon

Codes, Patterns and Shapes of Contemporary Online Antisemitism and Conspiracy Narratives -- an Annotation Guide and Labeled German-Language Dataset in the Context of COVID-19

Oct 13, 2022
Elisabeth Steffen, Helena Mihaljević, Milena Pustet, Nyco Bischoff, María do Mar Castro Varela, Yener Bayramoğlu, Bahar Oghalai

Figure 1 for Codes, Patterns and Shapes of Contemporary Online Antisemitism and Conspiracy Narratives -- an Annotation Guide and Labeled German-Language Dataset in the Context of COVID-19
Viaarxiv icon

An Investigation of End-to-End Models for Robust Speech Recognition

Add code
Bookmark button
Alert button
Feb 11, 2021
Archiki Prasad, Preethi Jyothi, Rajbabu Velmurugan

Figure 1 for An Investigation of End-to-End Models for Robust Speech Recognition
Figure 2 for An Investigation of End-to-End Models for Robust Speech Recognition
Figure 3 for An Investigation of End-to-End Models for Robust Speech Recognition
Figure 4 for An Investigation of End-to-End Models for Robust Speech Recognition
Viaarxiv icon

ASR4REAL: An extended benchmark for speech models

Oct 16, 2021
Morgane Riviere, Jade Copet, Gabriel Synnaeve

Figure 1 for ASR4REAL: An extended benchmark for speech models
Figure 2 for ASR4REAL: An extended benchmark for speech models
Figure 3 for ASR4REAL: An extended benchmark for speech models
Figure 4 for ASR4REAL: An extended benchmark for speech models
Viaarxiv icon

The Role of Phonetic Units in Speech Emotion Recognition

Aug 02, 2021
Jiahong Yuan, Xingyu Cai, Renjie Zheng, Liang Huang, Kenneth Church

Figure 1 for The Role of Phonetic Units in Speech Emotion Recognition
Figure 2 for The Role of Phonetic Units in Speech Emotion Recognition
Figure 3 for The Role of Phonetic Units in Speech Emotion Recognition
Figure 4 for The Role of Phonetic Units in Speech Emotion Recognition
Viaarxiv icon

Multimodal generation of upper-facial and head gestures with a Transformer Network using speech and text

Add code
Bookmark button
Alert button
Oct 09, 2021
Mireille Fares, Catherine Pelachaud, Nicolas Obin

Figure 1 for Multimodal generation of upper-facial and head gestures with a Transformer Network using speech and text
Figure 2 for Multimodal generation of upper-facial and head gestures with a Transformer Network using speech and text
Figure 3 for Multimodal generation of upper-facial and head gestures with a Transformer Network using speech and text
Figure 4 for Multimodal generation of upper-facial and head gestures with a Transformer Network using speech and text
Viaarxiv icon

AutoLV: Automatic Lecture Video Generator

Sep 19, 2022
Wenbin Wang, Yang Song, Sanjay Jha

Figure 1 for AutoLV: Automatic Lecture Video Generator
Figure 2 for AutoLV: Automatic Lecture Video Generator
Figure 3 for AutoLV: Automatic Lecture Video Generator
Figure 4 for AutoLV: Automatic Lecture Video Generator
Viaarxiv icon

PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition

Add code
Bookmark button
Alert button
Jun 10, 2021
Cheng-I Jeff Lai, Yang Zhang, Alexander H. Liu, Shiyu Chang, Yi-Lun Liao, Yung-Sung Chuang, Kaizhi Qian, Sameer Khurana, David Cox, James Glass

Figure 1 for PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition
Figure 2 for PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition
Figure 3 for PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition
Figure 4 for PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition
Viaarxiv icon

Convolutional Learning on Multigraphs

Sep 23, 2022
Landon Butler, Alejandro Parada-Mayorga, Alejandro Ribeiro

Figure 1 for Convolutional Learning on Multigraphs
Figure 2 for Convolutional Learning on Multigraphs
Figure 3 for Convolutional Learning on Multigraphs
Figure 4 for Convolutional Learning on Multigraphs
Viaarxiv icon