Picture for Tian Huey Teh

Tian Huey Teh

Papercup Technologies Ltd

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Ensemble prosody prediction for expressive speech synthesis

Add code
Apr 03, 2023
Figure 1 for Ensemble prosody prediction for expressive speech synthesis
Figure 2 for Ensemble prosody prediction for expressive speech synthesis
Figure 3 for Ensemble prosody prediction for expressive speech synthesis
Figure 4 for Ensemble prosody prediction for expressive speech synthesis
Viaarxiv icon

Controlling High-Dimensional Data With Sparse Input

Add code
Mar 14, 2023
Figure 1 for Controlling High-Dimensional Data With Sparse Input
Figure 2 for Controlling High-Dimensional Data With Sparse Input
Figure 3 for Controlling High-Dimensional Data With Sparse Input
Figure 4 for Controlling High-Dimensional Data With Sparse Input
Viaarxiv icon

Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis

Add code
Jun 15, 2021
Figure 1 for Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis
Figure 2 for Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis
Figure 3 for Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis
Figure 4 for Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis
Viaarxiv icon

ADEPT: A Dataset for Evaluating Prosody Transfer

Add code
Jun 15, 2021
Figure 1 for ADEPT: A Dataset for Evaluating Prosody Transfer
Figure 2 for ADEPT: A Dataset for Evaluating Prosody Transfer
Figure 3 for ADEPT: A Dataset for Evaluating Prosody Transfer
Viaarxiv icon

Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning

Add code
Aug 07, 2020
Figure 1 for Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning
Figure 2 for Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning
Figure 3 for Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning
Figure 4 for Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning
Viaarxiv icon

Phonological Features for 0-shot Multilingual Speech Synthesis

Add code
Aug 06, 2020
Figure 1 for Phonological Features for 0-shot Multilingual Speech Synthesis
Figure 2 for Phonological Features for 0-shot Multilingual Speech Synthesis
Figure 3 for Phonological Features for 0-shot Multilingual Speech Synthesis
Figure 4 for Phonological Features for 0-shot Multilingual Speech Synthesis
Viaarxiv icon