Picture for Gustav Eje Henter

Gustav Eje Henter

Prosody-controllable spontaneous TTS with neural HMMs

Add code
Nov 24, 2022
Viaarxiv icon

Listen, denoise, action! Audio-driven motion synthesis with diffusion models

Add code
Nov 17, 2022
Figure 1 for Listen, denoise, action! Audio-driven motion synthesis with diffusion models
Figure 2 for Listen, denoise, action! Audio-driven motion synthesis with diffusion models
Figure 3 for Listen, denoise, action! Audio-driven motion synthesis with diffusion models
Figure 4 for Listen, denoise, action! Audio-driven motion synthesis with diffusion models
Viaarxiv icon

OverFlow: Putting flows on top of neural transducers for better TTS

Add code
Nov 13, 2022
Figure 1 for OverFlow: Putting flows on top of neural transducers for better TTS
Figure 2 for OverFlow: Putting flows on top of neural transducers for better TTS
Figure 3 for OverFlow: Putting flows on top of neural transducers for better TTS
Figure 4 for OverFlow: Putting flows on top of neural transducers for better TTS
Viaarxiv icon

Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing

Add code
Nov 13, 2022
Viaarxiv icon

Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks

Add code
Sep 22, 2022
Figure 1 for Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks
Figure 2 for Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks
Figure 3 for Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks
Figure 4 for Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks
Viaarxiv icon

The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation

Add code
Aug 22, 2022
Figure 1 for The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation
Figure 2 for The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation
Figure 3 for The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation
Figure 4 for The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation
Viaarxiv icon

Wavebender GAN: An architecture for phonetically meaningful speech manipulation

Add code
Feb 22, 2022
Figure 1 for Wavebender GAN: An architecture for phonetically meaningful speech manipulation
Figure 2 for Wavebender GAN: An architecture for phonetically meaningful speech manipulation
Figure 3 for Wavebender GAN: An architecture for phonetically meaningful speech manipulation
Figure 4 for Wavebender GAN: An architecture for phonetically meaningful speech manipulation
Viaarxiv icon

Neural HMMs are all you need (for high-quality attention-free TTS)

Add code
Sep 03, 2021
Figure 1 for Neural HMMs are all you need (for high-quality attention-free TTS)
Figure 2 for Neural HMMs are all you need (for high-quality attention-free TTS)
Viaarxiv icon

Integrated Speech and Gesture Synthesis

Add code
Aug 25, 2021
Figure 1 for Integrated Speech and Gesture Synthesis
Figure 2 for Integrated Speech and Gesture Synthesis
Figure 3 for Integrated Speech and Gesture Synthesis
Figure 4 for Integrated Speech and Gesture Synthesis
Viaarxiv icon

Multimodal analysis of the predictability of hand-gesture properties

Add code
Aug 12, 2021
Figure 1 for Multimodal analysis of the predictability of hand-gesture properties
Figure 2 for Multimodal analysis of the predictability of hand-gesture properties
Figure 3 for Multimodal analysis of the predictability of hand-gesture properties
Figure 4 for Multimodal analysis of the predictability of hand-gesture properties
Viaarxiv icon