Picture for Gustav Eje Henter

Gustav Eje Henter

Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing

Add code
Nov 13, 2022
Viaarxiv icon

Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks

Add code
Sep 22, 2022
Figure 1 for Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks
Figure 2 for Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks
Figure 3 for Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks
Figure 4 for Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks
Viaarxiv icon

The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation

Add code
Aug 22, 2022
Figure 1 for The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation
Figure 2 for The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation
Figure 3 for The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation
Figure 4 for The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation
Viaarxiv icon

Wavebender GAN: An architecture for phonetically meaningful speech manipulation

Add code
Feb 22, 2022
Figure 1 for Wavebender GAN: An architecture for phonetically meaningful speech manipulation
Figure 2 for Wavebender GAN: An architecture for phonetically meaningful speech manipulation
Figure 3 for Wavebender GAN: An architecture for phonetically meaningful speech manipulation
Figure 4 for Wavebender GAN: An architecture for phonetically meaningful speech manipulation
Viaarxiv icon

Neural HMMs are all you need (for high-quality attention-free TTS)

Add code
Sep 03, 2021
Figure 1 for Neural HMMs are all you need (for high-quality attention-free TTS)
Figure 2 for Neural HMMs are all you need (for high-quality attention-free TTS)
Viaarxiv icon

Integrated Speech and Gesture Synthesis

Add code
Aug 25, 2021
Figure 1 for Integrated Speech and Gesture Synthesis
Figure 2 for Integrated Speech and Gesture Synthesis
Figure 3 for Integrated Speech and Gesture Synthesis
Figure 4 for Integrated Speech and Gesture Synthesis
Viaarxiv icon

Multimodal analysis of the predictability of hand-gesture properties

Add code
Aug 12, 2021
Figure 1 for Multimodal analysis of the predictability of hand-gesture properties
Figure 2 for Multimodal analysis of the predictability of hand-gesture properties
Figure 3 for Multimodal analysis of the predictability of hand-gesture properties
Figure 4 for Multimodal analysis of the predictability of hand-gesture properties
Viaarxiv icon

Normalizing Flow based Hidden Markov Models for Classification of Speech Phones with Explainability

Add code
Jul 01, 2021
Figure 1 for Normalizing Flow based Hidden Markov Models for Classification of Speech Phones with Explainability
Figure 2 for Normalizing Flow based Hidden Markov Models for Classification of Speech Phones with Explainability
Figure 3 for Normalizing Flow based Hidden Markov Models for Classification of Speech Phones with Explainability
Figure 4 for Normalizing Flow based Hidden Markov Models for Classification of Speech Phones with Explainability
Viaarxiv icon

Speech2Properties2Gestures: Gesture-Property Prediction as a Tool for Generating Representational Gestures from Speech

Add code
Jun 28, 2021
Figure 1 for Speech2Properties2Gestures: Gesture-Property Prediction as a Tool for Generating Representational Gestures from Speech
Viaarxiv icon

Transflower: probabilistic autoregressive dance generation with multimodal attention

Add code
Jun 25, 2021
Figure 1 for Transflower: probabilistic autoregressive dance generation with multimodal attention
Figure 2 for Transflower: probabilistic autoregressive dance generation with multimodal attention
Figure 3 for Transflower: probabilistic autoregressive dance generation with multimodal attention
Figure 4 for Transflower: probabilistic autoregressive dance generation with multimodal attention
Viaarxiv icon