Alert button
Picture for Zhehuai Chen

Zhehuai Chen

Alert button

Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR

Add code
Bookmark button
Alert button
Oct 18, 2022
Zhehuai Chen, Ankur Bapna, Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Pedro Moreno, Nanxin Chen

Figure 1 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Figure 2 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Figure 3 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Figure 4 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Viaarxiv icon

JOIST: A Joint Speech and Text Streaming Model For ASR

Add code
Bookmark button
Alert button
Oct 13, 2022
Tara N. Sainath, Rohit Prabhavalkar, Ankur Bapna, Yu Zhang, Zhouyuan Huo, Zhehuai Chen, Bo Li, Weiran Wang, Trevor Strohman

Figure 1 for JOIST: A Joint Speech and Text Streaming Model For ASR
Figure 2 for JOIST: A Joint Speech and Text Streaming Model For ASR
Figure 3 for JOIST: A Joint Speech and Text Streaming Model For ASR
Figure 4 for JOIST: A Joint Speech and Text Streaming Model For ASR
Viaarxiv icon

Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data

Add code
Bookmark button
Alert button
May 16, 2022
Alëna Aksënova, Zhehuai Chen, Chung-Cheng Chiu, Daan van Esch, Pavel Golik, Wei Han, Levi King, Bhuvana Ramabhadran, Andrew Rosenberg, Suzan Schwartz, Gary Wang

Figure 1 for Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data
Figure 2 for Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data
Figure 3 for Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data
Viaarxiv icon

MAESTRO: Matched Speech Text Representations through Modality Matching

Add code
Bookmark button
Alert button
Apr 07, 2022
Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Pedro Moreno, Ankur Bapna, Heiga Zen

Figure 1 for MAESTRO: Matched Speech Text Representations through Modality Matching
Figure 2 for MAESTRO: Matched Speech Text Representations through Modality Matching
Figure 3 for MAESTRO: Matched Speech Text Representations through Modality Matching
Figure 4 for MAESTRO: Matched Speech Text Representations through Modality Matching
Viaarxiv icon

Unsupervised Data Selection via Discrete Speech Representation for ASR

Add code
Bookmark button
Alert button
Apr 05, 2022
Zhiyun Lu, Yongqiang Wang, Yu Zhang, Wei Han, Zhehuai Chen, Parisa Haghani

Figure 1 for Unsupervised Data Selection via Discrete Speech Representation for ASR
Figure 2 for Unsupervised Data Selection via Discrete Speech Representation for ASR
Figure 3 for Unsupervised Data Selection via Discrete Speech Representation for ASR
Figure 4 for Unsupervised Data Selection via Discrete Speech Representation for ASR
Viaarxiv icon

Injecting Text in Self-Supervised Speech Pretraining

Add code
Bookmark button
Alert button
Aug 27, 2021
Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Gary Wang, Pedro Moreno

Figure 1 for Injecting Text in Self-Supervised Speech Pretraining
Figure 2 for Injecting Text in Self-Supervised Speech Pretraining
Figure 3 for Injecting Text in Self-Supervised Speech Pretraining
Figure 4 for Injecting Text in Self-Supervised Speech Pretraining
Viaarxiv icon

An Asynchronous WFST-Based Decoder For Automatic Speech Recognition

Add code
Bookmark button
Alert button
Mar 16, 2021
Hang Lv, Zhehuai Chen, Hainan Xu, Daniel Povey, Lei Xie, Sanjeev Khudanpur

Figure 1 for An Asynchronous WFST-Based Decoder For Automatic Speech Recognition
Figure 2 for An Asynchronous WFST-Based Decoder For Automatic Speech Recognition
Figure 3 for An Asynchronous WFST-Based Decoder For Automatic Speech Recognition
Figure 4 for An Asynchronous WFST-Based Decoder For Automatic Speech Recognition
Viaarxiv icon

End-to-end contextual speech recognition using class language models and a token passing decoder

Add code
Bookmark button
Alert button
Dec 05, 2018
Zhehuai Chen, Mahaveer Jain, Yongqiang Wang, Michael L. Seltzer, Christian Fuegen

Figure 1 for End-to-end contextual speech recognition using class language models and a token passing decoder
Figure 2 for End-to-end contextual speech recognition using class language models and a token passing decoder
Figure 3 for End-to-end contextual speech recognition using class language models and a token passing decoder
Figure 4 for End-to-end contextual speech recognition using class language models and a token passing decoder
Viaarxiv icon

Linguistic Search Optimization for Deep Learning Based LVCSR

Add code
Bookmark button
Alert button
Aug 02, 2018
Zhehuai Chen

Figure 1 for Linguistic Search Optimization for Deep Learning Based LVCSR
Figure 2 for Linguistic Search Optimization for Deep Learning Based LVCSR
Figure 3 for Linguistic Search Optimization for Deep Learning Based LVCSR
Viaarxiv icon

Sequence Discriminative Training for Deep Learning based Acoustic Keyword Spotting

Add code
Bookmark button
Alert button
Aug 02, 2018
Zhehuai Chen, Yanmin Qian, Kai Yu

Figure 1 for Sequence Discriminative Training for Deep Learning based Acoustic Keyword Spotting
Figure 2 for Sequence Discriminative Training for Deep Learning based Acoustic Keyword Spotting
Figure 3 for Sequence Discriminative Training for Deep Learning based Acoustic Keyword Spotting
Figure 4 for Sequence Discriminative Training for Deep Learning based Acoustic Keyword Spotting
Viaarxiv icon