Picture for Simon King

Simon King

Can we reconstruct a dysarthric voice with the large speech model Parler TTS?

Add code
Jun 04, 2025
Viaarxiv icon

Segmentation-Variant Codebooks for Preservation of Paralinguistic and Prosodic Information

Add code
May 21, 2025
Viaarxiv icon

Learning Nonlinear Dynamics in Physical Modelling Synthesis using Neural Ordinary Differential Equations

Add code
May 15, 2025
Viaarxiv icon

Do Discrete Self-Supervised Representations of Speech Capture Tone Distinctions?

Add code
Oct 25, 2024
Figure 1 for Do Discrete Self-Supervised Representations of Speech Capture Tone Distinctions?
Figure 2 for Do Discrete Self-Supervised Representations of Speech Capture Tone Distinctions?
Figure 3 for Do Discrete Self-Supervised Representations of Speech Capture Tone Distinctions?
Figure 4 for Do Discrete Self-Supervised Representations of Speech Capture Tone Distinctions?
Viaarxiv icon

Enabling Beam Search for Language Model-Based Text-to-Speech Synthesis

Add code
Aug 29, 2024
Figure 1 for Enabling Beam Search for Language Model-Based Text-to-Speech Synthesis
Figure 2 for Enabling Beam Search for Language Model-Based Text-to-Speech Synthesis
Figure 3 for Enabling Beam Search for Language Model-Based Text-to-Speech Synthesis
Figure 4 for Enabling Beam Search for Language Model-Based Text-to-Speech Synthesis
Viaarxiv icon

Natural language guidance of high-fidelity text-to-speech with synthetic annotations

Add code
Feb 02, 2024
Viaarxiv icon

Differentiable Grey-box Modelling of Phaser Effects using Frame-based Spectral Processing

Add code
Jun 02, 2023
Figure 1 for Differentiable Grey-box Modelling of Phaser Effects using Frame-based Spectral Processing
Figure 2 for Differentiable Grey-box Modelling of Phaser Effects using Frame-based Spectral Processing
Figure 3 for Differentiable Grey-box Modelling of Phaser Effects using Frame-based Spectral Processing
Figure 4 for Differentiable Grey-box Modelling of Phaser Effects using Frame-based Spectral Processing
Viaarxiv icon

Using a Large Language Model to Control Speaking Style for Expressive TTS

Add code
May 17, 2023
Viaarxiv icon

Ensemble prosody prediction for expressive speech synthesis

Add code
Apr 03, 2023
Viaarxiv icon

Do Prosody Transfer Models Transfer Prosody?

Add code
Mar 07, 2023
Viaarxiv icon