Alert button
Picture for Simon King

Simon King

Alert button

Natural language guidance of high-fidelity text-to-speech with synthetic annotations

Add code
Bookmark button
Alert button
Feb 02, 2024
Dan Lyth, Simon King

Viaarxiv icon

Differentiable Grey-box Modelling of Phaser Effects using Frame-based Spectral Processing

Add code
Bookmark button
Alert button
Jun 02, 2023
Alistair Carson, Cassia Valentini-Botinhao, Simon King, Stefan Bilbao

Figure 1 for Differentiable Grey-box Modelling of Phaser Effects using Frame-based Spectral Processing
Figure 2 for Differentiable Grey-box Modelling of Phaser Effects using Frame-based Spectral Processing
Figure 3 for Differentiable Grey-box Modelling of Phaser Effects using Frame-based Spectral Processing
Figure 4 for Differentiable Grey-box Modelling of Phaser Effects using Frame-based Spectral Processing
Viaarxiv icon

Using a Large Language Model to Control Speaking Style for Expressive TTS

Add code
Bookmark button
Alert button
May 17, 2023
Atli Thor Sigurgeirsson, Simon King

Figure 1 for Using a Large Language Model to Control Speaking Style for Expressive TTS
Figure 2 for Using a Large Language Model to Control Speaking Style for Expressive TTS
Figure 3 for Using a Large Language Model to Control Speaking Style for Expressive TTS
Figure 4 for Using a Large Language Model to Control Speaking Style for Expressive TTS
Viaarxiv icon

Ensemble prosody prediction for expressive speech synthesis

Add code
Bookmark button
Alert button
Apr 03, 2023
Tian Huey Teh, Vivian Hu, Devang S Ram Mohan, Zack Hodari, Christopher G. R. Wallis, Tomás Gomez Ibarrondo, Alexandra Torresquintero, James Leoni, Mark Gales, Simon King

Figure 1 for Ensemble prosody prediction for expressive speech synthesis
Figure 2 for Ensemble prosody prediction for expressive speech synthesis
Figure 3 for Ensemble prosody prediction for expressive speech synthesis
Figure 4 for Ensemble prosody prediction for expressive speech synthesis
Viaarxiv icon

Do Prosody Transfer Models Transfer Prosody?

Add code
Bookmark button
Alert button
Mar 07, 2023
Atli Thor Sigurgeirsson, Simon King

Figure 1 for Do Prosody Transfer Models Transfer Prosody?
Figure 2 for Do Prosody Transfer Models Transfer Prosody?
Figure 3 for Do Prosody Transfer Models Transfer Prosody?
Figure 4 for Do Prosody Transfer Models Transfer Prosody?
Viaarxiv icon

Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing

Add code
Bookmark button
Alert button
Nov 13, 2022
Jacob J Webber, Cassia Valentini-Botinhao, Evelyn Williams, Gustav Eje Henter, Simon King

Figure 1 for Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
Figure 2 for Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
Figure 3 for Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
Figure 4 for Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
Viaarxiv icon

Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis

Add code
Bookmark button
Alert button
Jun 15, 2021
Devang S Ram Mohan, Vivian Hu, Tian Huey Teh, Alexandra Torresquintero, Christopher G. R. Wallis, Marlene Staib, Lorenzo Foglianti, Jiameng Gao, Simon King

Figure 1 for Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis
Figure 2 for Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis
Figure 3 for Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis
Figure 4 for Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis
Viaarxiv icon

ADEPT: A Dataset for Evaluating Prosody Transfer

Add code
Bookmark button
Alert button
Jun 15, 2021
Alexandra Torresquintero, Tian Huey Teh, Christopher G. R. Wallis, Marlene Staib, Devang S Ram Mohan, Vivian Hu, Lorenzo Foglianti, Jiameng Gao, Simon King

Figure 1 for ADEPT: A Dataset for Evaluating Prosody Transfer
Figure 2 for ADEPT: A Dataset for Evaluating Prosody Transfer
Figure 3 for ADEPT: A Dataset for Evaluating Prosody Transfer
Viaarxiv icon

Using previous acoustic context to improve Text-to-Speech synthesis

Add code
Bookmark button
Alert button
Dec 07, 2020
Pilar Oplustil-Gallegos, Simon King

Figure 1 for Using previous acoustic context to improve Text-to-Speech synthesis
Figure 2 for Using previous acoustic context to improve Text-to-Speech synthesis
Figure 3 for Using previous acoustic context to improve Text-to-Speech synthesis
Figure 4 for Using previous acoustic context to improve Text-to-Speech synthesis
Viaarxiv icon

Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0

Add code
Bookmark button
Alert button
Mar 14, 2020
Zack Hodari, Catherine Lai, Simon King

Figure 1 for Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0
Figure 2 for Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0
Figure 3 for Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0
Figure 4 for Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0
Viaarxiv icon