Semi-Supervised Speech Recognition via Local Prior Matching

Feb 24, 2020
Wei-Ning Hsu, Ann Lee, Gabriel Synnaeve, Awni Hannun


  Access Model/Code and Paper
Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech

Nov 21, 2019
David Harwath, Wei-Ning Hsu, James Glass


  Access Model/Code and Paper
Transfer Learning from Audio-Visual Grounding to Speech Recognition

Jul 09, 2019
Wei-Ning Hsu, David Harwath, James Glass

* Accepted to Interspeech 2019. 4 pages, 2 figures 

  Access Model/Code and Paper
An Unsupervised Autoregressive Model for Speech Representation Learning

Apr 05, 2019
Yu-An Chung, Wei-Ning Hsu, Hao Tang, James Glass


  Access Model/Code and Paper
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

Feb 21, 2019
Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia X. Chen, Ye Jia, Anjuli Kannan, Tara Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob, Bowen Liang, HyoukJoong Lee, Ciprian Chelba, Sébastien Jean, Bo Li, Melvin Johnson, Rohan Anil, Rajat Tibrewal, Xiaobing Liu, Akiko Eriguchi, Navdeep Jaitly, Naveen Ari, Colin Cherry, Parisa Haghani, Otavio Good, Youlong Cheng, Raziel Alvarez, Isaac Caswell, Wei-Ning Hsu, Zongheng Yang, Kuan-Chieh Wang, Ekaterina Gonina, Katrin Tomanek, Ben Vanik, Zelin Wu, Llion Jones, Mike Schuster, Yanping Huang, Dehao Chen, Kazuki Irie, George Foster, John Richardson, Klaus Macherey, Antoine Bruguier, Heiga Zen, Colin Raffel, Shankar Kumar, Kanishka Rao, David Rybach, Matthew Murray, Vijayaditya Peddinti, Maxim Krikun, Michiel A. U. Bacchiani, Thomas B. Jablin, Rob Suderman, Ian Williams, Benjamin Lee, Deepti Bhatia, Justin Carlson, Semih Yavuz, Yu Zhang, Ian McGraw, Max Galkin, Qi Ge, Golan Pundak, Chad Whipkey, Todd Wang, Uri Alon, Dmitry Lepikhin, Ye Tian, Sara Sabour, William Chan, Shubham Toshniwal, Baohua Liao, Michael Nirschl, Pat Rondon


  Access Model/Code and Paper
Hierarchical Generative Modeling for Controllable Speech Synthesis

Oct 16, 2018
Wei-Ning Hsu, Yu Zhang, Ron J. Weiss, Heiga Zen, Yonghui Wu, Yuxuan Wang, Yuan Cao, Ye Jia, Zhifeng Chen, Jonathan Shen, Patrick Nguyen, Ruoming Pang


  Access Model/Code and Paper
Unsupervised Representation Learning of Speech for Dialect Identification

Sep 12, 2018
Suwon Shon, Wei-Ning Hsu, James Glass

* Accepted at SLT 2018 

  Access Model/Code and Paper
Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis

Aug 30, 2018
Yu-An Chung, Yuxuan Wang, Wei-Ning Hsu, Yu Zhang, RJ Skerry-Ryan


  Access Model/Code and Paper
Scalable Factorized Hierarchical Variational Autoencoder Training

Jun 15, 2018
Wei-Ning Hsu, James Glass

* Interspeech 2018 

  Access Model/Code and Paper
Unsupervised Adaptation with Interpretable Disentangled Representations for Distant Conversational Speech Recognition

Jun 13, 2018
Wei-Ning Hsu, Hao Tang, James Glass

* to appear in Interspeech 2018 

  Access Model/Code and Paper
A Study of Enhancement, Augmentation, and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition

Jun 13, 2018
Hao Tang, Wei-Ning Hsu, Francois Grondin, James Glass

* Interspeech, 2018 

  Access Model/Code and Paper
Disentangling by Partitioning: A Representation Learning Framework for Multimodal Sensory Data

May 29, 2018
Wei-Ning Hsu, James Glass


  Access Model/Code and Paper
Extracting Domain Invariant Features by Unsupervised Learning for Robust Automatic Speech Recognition

Mar 07, 2018
Wei-Ning Hsu, James Glass

* accepted by 2018 International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2018) 

  Access Model/Code and Paper
Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data

Sep 22, 2017
Wei-Ning Hsu, Yu Zhang, James Glass

* Accepted to NIPS 2017 

  Access Model/Code and Paper
Learning Latent Representations for Speech Generation and Transformation

Sep 22, 2017
Wei-Ning Hsu, Yu Zhang, James Glass

* Interspeech 2017, pp 1273-1277 
* Accepted to Interspeech 2017 

  Access Model/Code and Paper
Unsupervised Domain Adaptation for Robust Speech Recognition via Variational Autoencoder-Based Data Augmentation

Sep 22, 2017
Wei-Ning Hsu, Yu Zhang, James Glass

* Accepted to IEEE ASRU 2017 

  Access Model/Code and Paper
Recurrent Neural Network Encoder with Attention for Community Question Answering

Mar 23, 2016
Wei-Ning Hsu, Yu Zhang, James Glass


  Access Model/Code and Paper
Enhancing Automatically Discovered Multi-level Acoustic Patterns Considering Context Consistency With Applications in Spoken Term Detection

Sep 07, 2015
Cheng-Tao Chung, Wei-Ning Hsu, Cheng-Yi Lee, Lin-Shan Lee

* Accepted by ICASSP 2015 

  Access Model/Code and Paper