Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

MAESTRO: Matched Speech Text Representations through Modality Matching



Zhehuai Chen , Yu Zhang , Andrew Rosenberg , Bhuvana Ramabhadran , Pedro Moreno , Ankur Bapna , Heiga Zen

* Submitted to Interspeech 2022 

   Access Paper or Ask Questions

SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping



Yuma Koizumi , Heiga Zen , Kohei Yatabe , Nanxin Chen , Michiel Bacchiani

* Submitted to Interspeech 2022 

   Access Paper or Ask Questions

CVSS Corpus and Massively Multilingual Speech-to-Speech Translation



Ye Jia , Michelle Tadmor Ramanovich , Quan Wang , Heiga Zen

* Submitted to LREC 2022 

   Access Paper or Ask Questions

WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis



Nanxin Chen , Yu Zhang , Heiga Zen , Ron J. Weiss , Mohammad Norouzi , Najim Dehak , William Chan

* Proceedings of INTERSPEECH 

   Access Paper or Ask Questions

Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling



Isaac Elias , Heiga Zen , Jonathan Shen , Yu Zhang , Ye Jia , RJ Skerry-Ryan , Yonghui Wu

* Submitted to INTERSPEECH 2021 

   Access Paper or Ask Questions

PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS



Ye Jia , Heiga Zen , Jonathan Shen , Yu Zhang , Yonghui Wu

* Submitted to Interspeech 2021 

   Access Paper or Ask Questions

1
2
3
>>