Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Wei-Ning Hsu

Direct speech-to-speech translation with discrete units


Jul 12, 2021
Ann Lee, Peng-Jen Chen, Changhan Wang, Jiatao Gu, Xutai Ma, Adam Polyak, Yossi Adi, Qing He, Yun Tang, Juan Pino, Wei-Ning Hsu


  Access Paper or Ask Questions

Kaizen: Continuously improving teacher using Exponential Moving Average for semi-supervised speech recognition


Jun 14, 2021
Vimal Manohar, Tatiana Likhomanenko, Qiantong Xu, Wei-Ning Hsu, Ronan Collobert, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed

* 4 figures, 7 pages; fixed author list going out of margin 

  Access Paper or Ask Questions

HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units


Jun 14, 2021
Wei-Ning Hsu, Benjamin Bolte, Yao-Hung Hubert Tsai, Kushal Lakhotia, Ruslan Salakhutdinov, Abdelrahman Mohamed


  Access Paper or Ask Questions

Unsupervised Speech Recognition


May 24, 2021
Alexei Baevski, Wei-Ning Hsu, Alexis Conneau, Michael Auli


  Access Paper or Ask Questions

Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training


Apr 02, 2021
Wei-Ning Hsu, Anuroop Sriram, Alexei Baevski, Tatiana Likhomanenko, Qiantong Xu, Vineel Pratap, Jacob Kahn, Ann Lee, Ronan Collobert, Gabriel Synnaeve, Michael Auli


  Access Paper or Ask Questions

Speech Resynthesis from Discrete Disentangled Self-Supervised Representations


Apr 02, 2021
Adam Polyak, Yossi Adi, Jade Copet, Eugene Kharitonov, Kushal Lakhotia, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux


  Access Paper or Ask Questions

Generative Spoken Language Modeling from Raw Audio


Feb 01, 2021
Kushal Lakhotia, Evgeny Kharitonov, Wei-Ning Hsu, Yossi Adi, Adam Polyak, Benjamin Bolte, Tu-Anh Nguyen, Jade Copet, Alexei Baevski, Adelrahman Mohamed, Emmanuel Dupoux


  Access Paper or Ask Questions

Text-Free Image-to-Speech Synthesis Using Learned Segmental Units


Dec 31, 2020
Wei-Ning Hsu, David Harwath, Christopher Song, James Glass


  Access Paper or Ask Questions

Differentiable Weighted Finite-State Transducers


Oct 02, 2020
Awni Hannun, Vineel Pratap, Jacob Kahn, Wei-Ning Hsu


  Access Paper or Ask Questions

A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning


Jun 03, 2020
Sameer Khurana, Antoine Laurent, Wei-Ning Hsu, Jan Chorowski, Adrian Lancucki, Ricard Marxer, James Glass

* Submitted to Interspeech 2020 

  Access Paper or Ask Questions

Semi-Supervised Speech Recognition via Local Prior Matching


Feb 24, 2020
Wei-Ning Hsu, Ann Lee, Gabriel Synnaeve, Awni Hannun


  Access Paper or Ask Questions

Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech


Nov 21, 2019
David Harwath, Wei-Ning Hsu, James Glass


  Access Paper or Ask Questions

Transfer Learning from Audio-Visual Grounding to Speech Recognition


Jul 09, 2019
Wei-Ning Hsu, David Harwath, James Glass

* Accepted to Interspeech 2019. 4 pages, 2 figures 

  Access Paper or Ask Questions

An Unsupervised Autoregressive Model for Speech Representation Learning


Apr 05, 2019
Yu-An Chung, Wei-Ning Hsu, Hao Tang, James Glass


  Access Paper or Ask Questions

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling


Feb 21, 2019
Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia X. Chen, Ye Jia, Anjuli Kannan, Tara Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob, Bowen Liang, HyoukJoong Lee, Ciprian Chelba, Sébastien Jean, Bo Li, Melvin Johnson, Rohan Anil, Rajat Tibrewal, Xiaobing Liu, Akiko Eriguchi, Navdeep Jaitly, Naveen Ari, Colin Cherry, Parisa Haghani, Otavio Good, Youlong Cheng, Raziel Alvarez, Isaac Caswell, Wei-Ning Hsu, Zongheng Yang, Kuan-Chieh Wang, Ekaterina Gonina, Katrin Tomanek, Ben Vanik, Zelin Wu, Llion Jones, Mike Schuster, Yanping Huang, Dehao Chen, Kazuki Irie, George Foster, John Richardson, Klaus Macherey, Antoine Bruguier, Heiga Zen, Colin Raffel, Shankar Kumar, Kanishka Rao, David Rybach, Matthew Murray, Vijayaditya Peddinti, Maxim Krikun, Michiel A. U. Bacchiani, Thomas B. Jablin, Rob Suderman, Ian Williams, Benjamin Lee, Deepti Bhatia, Justin Carlson, Semih Yavuz, Yu Zhang, Ian McGraw, Max Galkin, Qi Ge, Golan Pundak, Chad Whipkey, Todd Wang, Uri Alon, Dmitry Lepikhin, Ye Tian, Sara Sabour, William Chan, Shubham Toshniwal, Baohua Liao, Michael Nirschl, Pat Rondon


  Access Paper or Ask Questions

Hierarchical Generative Modeling for Controllable Speech Synthesis


Oct 16, 2018
Wei-Ning Hsu, Yu Zhang, Ron J. Weiss, Heiga Zen, Yonghui Wu, Yuxuan Wang, Yuan Cao, Ye Jia, Zhifeng Chen, Jonathan Shen, Patrick Nguyen, Ruoming Pang


  Access Paper or Ask Questions

Unsupervised Representation Learning of Speech for Dialect Identification


Sep 12, 2018
Suwon Shon, Wei-Ning Hsu, James Glass

* Accepted at SLT 2018 

  Access Paper or Ask Questions

Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis


Aug 30, 2018
Yu-An Chung, Yuxuan Wang, Wei-Ning Hsu, Yu Zhang, RJ Skerry-Ryan


  Access Paper or Ask Questions

Scalable Factorized Hierarchical Variational Autoencoder Training


Jun 15, 2018
Wei-Ning Hsu, James Glass

* Interspeech 2018 

  Access Paper or Ask Questions

Unsupervised Adaptation with Interpretable Disentangled Representations for Distant Conversational Speech Recognition


Jun 13, 2018
Wei-Ning Hsu, Hao Tang, James Glass

* to appear in Interspeech 2018 

  Access Paper or Ask Questions

A Study of Enhancement, Augmentation, and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition


Jun 13, 2018
Hao Tang, Wei-Ning Hsu, Francois Grondin, James Glass

* Interspeech, 2018 

  Access Paper or Ask Questions

Disentangling by Partitioning: A Representation Learning Framework for Multimodal Sensory Data


May 29, 2018
Wei-Ning Hsu, James Glass


  Access Paper or Ask Questions

Extracting Domain Invariant Features by Unsupervised Learning for Robust Automatic Speech Recognition


Mar 07, 2018
Wei-Ning Hsu, James Glass

* accepted by 2018 International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2018) 

  Access Paper or Ask Questions

Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data


Sep 22, 2017
Wei-Ning Hsu, Yu Zhang, James Glass

* Accepted to NIPS 2017 

  Access Paper or Ask Questions

Learning Latent Representations for Speech Generation and Transformation


Sep 22, 2017
Wei-Ning Hsu, Yu Zhang, James Glass

* Interspeech 2017, pp 1273-1277 
* Accepted to Interspeech 2017 

  Access Paper or Ask Questions

Unsupervised Domain Adaptation for Robust Speech Recognition via Variational Autoencoder-Based Data Augmentation


Sep 22, 2017
Wei-Ning Hsu, Yu Zhang, James Glass

* Accepted to IEEE ASRU 2017 

  Access Paper or Ask Questions

Recurrent Neural Network Encoder with Attention for Community Question Answering


Mar 23, 2016
Wei-Ning Hsu, Yu Zhang, James Glass


  Access Paper or Ask Questions

Enhancing Automatically Discovered Multi-level Acoustic Patterns Considering Context Consistency With Applications in Spoken Term Detection


Sep 07, 2015
Cheng-Tao Chung, Wei-Ning Hsu, Cheng-Yi Lee, Lin-Shan Lee

* Accepted by ICASSP 2015 

  Access Paper or Ask Questions