Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Ultra-Low-Bitrate Speech Coding with Pretrained Transformers


Jul 05, 2022
Ali Siahkoohi , Michael Chinen , Tom Denton , W. Bastiaan Kleijn , Jan Skoglund

* Proceedings of INTERSPEECH 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality


Apr 05, 2022
Alessandro Ragano , Emmanouil Benetos , Michael Chinen , Helard B. Martinez , Chandan K. A. Reddy , Jan Skoglund , Andrew Hines

* Submitted to INTERSPEECH 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

SoundStream: An End-to-End Neural Audio Codec


Jul 07, 2021
Neil Zeghidour , Alejandro Luebs , Ahmed Omran , Jan Skoglund , Marco Tagliasacchi


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Handling Background Noise in Neural Speech Generation


Feb 23, 2021
Tom Denton , Alejandro Luebs , Felicia S. C. Lim , Andrew Storus , Hengchin Yeh , W. Bastiaan Kleijn , Jan Skoglund

* 5 pages, 3 figures, presented at the Asilomar Conference on Signals, Systems, and Computers 2020 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

WARP-Q: Quality Prediction For Generative Neural Speech Codecs


Feb 20, 2021
Wissam A. Jassim , Jan Skoglund , Michael Chinen , Andrew Hines

* Accepted for presentation at IEEE ICASSP 2021. Source code and data can be found on https://github.com/wjassim/WARP-Q.git 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Generative Speech Coding with Predictive Variance Regularization


Feb 18, 2021
W. Bastiaan Kleijn , Andrew Storus , Michael Chinen , Tom Denton , Felicia S. C. Lim , Alejandro Luebs , Jan Skoglund , Hengchin Yeh


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet


Mar 28, 2019
Jean-Marc Valin , Jan Skoglund

* Submitted for Interspeech 2019, 5 pages 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

LPCNet: Improving Neural Speech Synthesis Through Linear Prediction


Oct 28, 2018
Jean-Marc Valin , Jan Skoglund

* 5 pages 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email