Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping


Mar 31, 2022
Yuma Koizumi , Heiga Zen , Kohei Yatabe , Nanxin Chen , Michiel Bacchiani

* Submitted to Interspeech 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation


Oct 11, 2021
Yosuke Higuchi , Nanxin Chen , Yuya Fujita , Hirofumi Inaguma , Tatsuya Komatsu , Jaesong Lee , Jumon Nozaki , Tianzi Wang , Shinji Watanabe

* Accepted to ASRU2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis


Jun 19, 2021
Nanxin Chen , Yu Zhang , Heiga Zen , Ron J. Weiss , Mohammad Norouzi , Najim Dehak , William Chan

* Proceedings of INTERSPEECH 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Focus on the present: a regularization method for the ASR source-target attention layer


Nov 02, 2020
Nanxin Chen , Piotr Żelasko , Jesús Villalba , Najim Dehak

* submitted to ICASSP2021. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

WaveGrad: Estimating Gradients for Waveform Generation


Sep 02, 2020
Nanxin Chen , Yu Zhang , Heiga Zen , Ron J. Weiss , Mohammad Norouzi , William Chan


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Robust Training of Vector Quantized Bottleneck Models


May 18, 2020
Adrian Łańcucki , Jan Chorowski , Guillaume Sanchez , Ricard Marxer , Nanxin Chen , Hans J. G. A. Dolfing , Sameer Khurana , Tanel Alumäe , Antoine Laurent

* Published at IJCNN 2020 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

x-vectors meet emotions: A study on dependencies between emotion and speaker recognition


Feb 12, 2020
Raghavendra Pappagari , Tianzi Wang , Jesus Villalba , Nanxin Chen , Najim Dehak

* 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Improving Language Identification for Multilingual Speakers


Jan 29, 2020
Andrew Titus , Jan Silovsky , Nanxin Chen , Roger Hsiao , Mary Young , Arnab Ghoshal

* 5 pages, 2 figures. Submitted to ICASSP 2020 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Non-Autoregressive Transformer Automatic Speech Recognition


Nov 10, 2019
Nanxin Chen , Shinji Watanabe , Jesús Villalba , Najim Dehak


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

A Comparative Study on Transformer vs RNN in Speech Applications


Sep 28, 2019
Shigeki Karita , Nanxin Chen , Tomoki Hayashi , Takaaki Hori , Hirofumi Inaguma , Ziyan Jiang , Masao Someki , Nelson Enrique Yalta Soplin , Ryuichi Yamamoto , Xiaofei Wang , Shinji Watanabe , Takenori Yoshimura , Wangyou Zhang

* IEEE Automatic Speech Recognition and Understanding Workshop 2019 
* Accepted at ASRU 2019 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
>>