Alert button
Picture for Shivam Mehta

Shivam Mehta

Alert button

Unified speech and gesture synthesis using flow matching

Add code
Bookmark button
Alert button
Oct 08, 2023
Shivam Mehta, Ruibo Tu, Simon Alexanderson, Jonas Beskow, Éva Székely, Gustav Eje Henter

Viaarxiv icon

Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio Representation

Add code
Bookmark button
Alert button
Sep 11, 2023
Anna Deichler, Shivam Mehta, Simon Alexanderson, Jonas Beskow

Figure 1 for Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio Representation
Figure 2 for Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio Representation
Figure 3 for Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio Representation
Figure 4 for Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio Representation
Viaarxiv icon

Matcha-TTS: A fast TTS architecture with conditional flow matching

Add code
Bookmark button
Alert button
Sep 06, 2023
Shivam Mehta, Ruibo Tu, Jonas Beskow, Éva Székely, Gustav Eje Henter

Figure 1 for Matcha-TTS: A fast TTS architecture with conditional flow matching
Figure 2 for Matcha-TTS: A fast TTS architecture with conditional flow matching
Figure 3 for Matcha-TTS: A fast TTS architecture with conditional flow matching
Figure 4 for Matcha-TTS: A fast TTS architecture with conditional flow matching
Viaarxiv icon

Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis

Add code
Bookmark button
Alert button
Jun 15, 2023
Shivam Mehta, Siyang Wang, Simon Alexanderson, Jonas Beskow, Éva Székely, Gustav Eje Henter

Figure 1 for Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
Figure 2 for Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
Figure 3 for Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
Viaarxiv icon

Prosody-controllable spontaneous TTS with neural HMMs

Add code
Bookmark button
Alert button
Nov 24, 2022
Harm Lameris, Shivam Mehta, Gustav Eje Henter, Joakim Gustafson, Éva Székely

Figure 1 for Prosody-controllable spontaneous TTS with neural HMMs
Figure 2 for Prosody-controllable spontaneous TTS with neural HMMs
Figure 3 for Prosody-controllable spontaneous TTS with neural HMMs
Figure 4 for Prosody-controllable spontaneous TTS with neural HMMs
Viaarxiv icon

OverFlow: Putting flows on top of neural transducers for better TTS

Add code
Bookmark button
Alert button
Nov 13, 2022
Shivam Mehta, Ambika Kirkland, Harm Lameris, Jonas Beskow, Éva Székely, Gustav Eje Henter

Figure 1 for OverFlow: Putting flows on top of neural transducers for better TTS
Figure 2 for OverFlow: Putting flows on top of neural transducers for better TTS
Figure 3 for OverFlow: Putting flows on top of neural transducers for better TTS
Figure 4 for OverFlow: Putting flows on top of neural transducers for better TTS
Viaarxiv icon

Neural HMMs are all you need (for high-quality attention-free TTS)

Add code
Bookmark button
Alert button
Sep 03, 2021
Shivam Mehta, Éva Székely, Jonas Beskow, Gustav Eje Henter

Figure 1 for Neural HMMs are all you need (for high-quality attention-free TTS)
Figure 2 for Neural HMMs are all you need (for high-quality attention-free TTS)
Viaarxiv icon