Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Thomas Drugman

Multi-Scale Spectrogram Modelling for Neural Text-to-Speech


Jun 29, 2021
Ammar Abbas, Bajibabu Bollepalli, Alexis Moinet, Arnaud Joly, Penny Karanasou, Peter Makarov, Simon Slangens, Sri Karlapati, Thomas Drugman

* Accepted for the 11th ISCA Speech Synthesis Workshop (SSW11) 

  Access Paper or Ask Questions

Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments


Jun 16, 2021
Alejandro Mottini, Jaime Lorenzo-Trueba, Sri Vishnu Kumar Karlapati, Thomas Drugman

* Presented at the Speech Synthesis Workshops 2021 (SSW11) 

  Access Paper or Ask Questions

A learned conditional prior for the VAE acoustic space of a TTS system


Jun 14, 2021
Penny Karanasou, Sri Karlapati, Alexis Moinet, Arnaud Joly, Ammar Abbas, Simon Slangen, Jaime Lorenzo Trueba, Thomas Drugman

* in Proceedings of Interspeech 2021 

  Access Paper or Ask Questions

Weakly-supervised word-level pronunciation error detection in non-native English speech


Jun 07, 2021
Daniel Korzekwa, Jaime Lorenzo-Trueba, Thomas Drugman, Shira Calamaro, Bozena Kostek

* Accepted to Interspeech 2021 

  Access Paper or Ask Questions

Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling


Feb 08, 2021
Daniel Korzekwa, Jaime Lorenzo-Trueba, Szymon Zaporowski, Shira Calamaro, Thomas Drugman, Bozena Kostek

* Accepted to ICASSP 2021 

  Access Paper or Ask Questions

EmoCat: Language-agnostic Emotional Voice Conversion


Jan 14, 2021
Bastian Schnell, Goeric Huybrechts, Bartek Perz, Thomas Drugman, Jaime Lorenzo-Trueba

* Submitted to IEEE ICASSP 2021 

  Access Paper or Ask Questions

Detection of Lexical Stress Errors in Non-native (L2) English with Data Augmentation and Attention


Dec 29, 2020
Daniel Korzekwa, Roberto Barra-Chicote, Szymon Zaporowski, Grzegorz Beringer, Jaime Lorenzo-Trueba, Alicja Serafinowicz, Jasha Droppo, Thomas Drugman, Bozena Kostek

* Submitted to ICASSP 2021 

  Access Paper or Ask Questions

Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech


Nov 04, 2020
Sri Karlapati, Ammar Abbas, Zack Hodari, Alexis Moinet, Arnaud Joly, Penny Karanasou, Thomas Drugman

* 5 pages and 3 figures 

  Access Paper or Ask Questions

Parametric Representation for Singing Voice Synthesis: a Comparative Evaluation


Jun 07, 2020
Onur Babacan, Thomas Drugman, Tuomo Raitio, Daniel Erro, Thierry Dutoit


  Access Paper or Ask Questions

Maximum Phase Modeling for Sparse Linear Prediction of Speech


Jun 07, 2020
Thomas Drugman


  Access Paper or Ask Questions

Analysis and Synthesis of Hypo and Hyperarticulated Speech


Jun 07, 2020
Benjamin Picart, Thomas Drugman, Thierry Dutoit


  Access Paper or Ask Questions

Residual Excitation Skewness for Automatic Speech Polarity Detection


May 31, 2020
Thomas Drugman


  Access Paper or Ask Questions

Maximum Voiced Frequency Estimation: Exploiting Amplitude and Phase Spectra


May 31, 2020
Thomas Drugman, Yannis Stylianou


  Access Paper or Ask Questions

Data-driven Detection and Analysis of the Patterns of Creaky Voice


May 31, 2020
Thomas Drugman, John Kane, Christer Gobl


  Access Paper or Ask Questions

Glottal source estimation robustness: A comparison of sensitivity of voice source estimation techniques


May 24, 2020
Thomas Drugman, Thomas Dubuisson, Alexis Moinet, Nicolas D'Alessandro, Thierry Dutoit


  Access Paper or Ask Questions

Oscillating Statistical Moments for Speech Polarity Detection


May 16, 2020
Thomas Drugman, Thierry Dutoit


  Access Paper or Ask Questions

Glottal Source Estimation using an Automatic Chirp Decomposition


May 16, 2020
Thomas Drugman, Baris Bozkurt, Thierry Dutoit


  Access Paper or Ask Questions

Chirp Complex Cepstrum-based Decomposition for Asynchronous Glottal Analysis


May 10, 2020
Thomas Drugman, Thierry Dutoit


  Access Paper or Ask Questions

Voice Conversion for Whispered Speech Synthesis


Jan 17, 2020
Marius Cotescu, Thomas Drugman, Goeric Huybrechts, Jaime Lorenzo-Trueba, Alexis Moinet

* Submitted to IEEE Signal Processing Letters 

  Access Paper or Ask Questions

On the Mutual Information between Source and Filter Contributions for Voice Pathology Detection


Jan 02, 2020
Thomas Drugman, Thomas Dubuisson, Thierry Dutoit


  Access Paper or Ask Questions

Phase-based Information for Voice Pathology Detection


Jan 02, 2020
Thomas Drugman, Thomas Dubuisson, Thierry Dutoit


  Access Paper or Ask Questions

Excitation-based Voice Quality Analysis and Modification


Jan 02, 2020
Thomas Drugman, Thierry Dutoit, Baris Bozkurt


  Access Paper or Ask Questions

Eigenresiduals for improved Parametric Speech Synthesis


Jan 02, 2020
Thomas Drugman, Geoffrey Wilfart, Thierry Dutoit


  Access Paper or Ask Questions

A Comparative Evaluation of Pitch Modification Techniques


Jan 02, 2020
Thomas Drugman, Thierry Dutoit


  Access Paper or Ask Questions

Using a Pitch-Synchronous Residual Codebook for Hybrid HMM/Frame Selection Speech Synthesis


Dec 30, 2019
Thomas Drugman, Alexis Moinet, Thierry Dutoit, Geoffrey Wilfart


  Access Paper or Ask Questions

Causal-Anticausal Decomposition of Speech using Complex Cepstrum for Glottal Source Estimation


Dec 30, 2019
Thomas Drugman, Baris Bozkurt, Thierry Dutoit


  Access Paper or Ask Questions

Glottal Source Processing: from Analysis to Applications


Dec 29, 2019
Thomas Drugman, Paavo Alku, Abeer Alwan, Bayya Yegnanarayana


  Access Paper or Ask Questions

Complex Cepstrum-based Decomposition of Speech for Glottal Source Estimation


Dec 29, 2019
Thomas Drugman, Baris Bozkurt, Thierry Dutoit


  Access Paper or Ask Questions

The Deterministic plus Stochastic Model of the Residual Signal and its Applications


Dec 29, 2019
Thomas Drugman, Thierry Dutoit


  Access Paper or Ask Questions