Alert button
Picture for Yanzhang He

Yanzhang He

Alert button

Google Inc. USA

Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning

Add code
Bookmark button
Alert button
Oct 13, 2021
Dongseong Hwang, Ananya Misra, Zhouyuan Huo, Nikhil Siddhartha, Shefali Garg, David Qiu, Khe Chai Sim, Trevor Strohman, Françoise Beaufays, Yanzhang He

Figure 1 for Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning
Figure 2 for Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning
Figure 3 for Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning
Figure 4 for Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning
Viaarxiv icon

Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Oct 07, 2021
Qiujia Li, Yu Zhang, David Qiu, Yanzhang He, Liangliang Cao, Philip C. Woodland

Figure 1 for Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition
Figure 2 for Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition
Figure 3 for Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition
Figure 4 for Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition
Viaarxiv icon

Large-scale ASR Domain Adaptation by Self- and Semi-supervised Learning

Add code
Bookmark button
Alert button
Oct 01, 2021
Dongseong Hwang, Ananya Misra, Zhouyuan Huo, Nikhil Siddhartha, Shefali Garg, David Qiu, Khe Chai Sim, Trevor Strohman, Françoise Beaufays, Yanzhang He

Figure 1 for Large-scale ASR Domain Adaptation by Self- and Semi-supervised Learning
Figure 2 for Large-scale ASR Domain Adaptation by Self- and Semi-supervised Learning
Figure 3 for Large-scale ASR Domain Adaptation by Self- and Semi-supervised Learning
Figure 4 for Large-scale ASR Domain Adaptation by Self- and Semi-supervised Learning
Viaarxiv icon

Tied & Reduced RNN-T Decoder

Add code
Bookmark button
Alert button
Sep 15, 2021
Rami Botros, Tara N. Sainath, Robert David, Emmanuel Guzman, Wei Li, Yanzhang He

Figure 1 for Tied & Reduced RNN-T Decoder
Figure 2 for Tied & Reduced RNN-T Decoder
Figure 3 for Tied & Reduced RNN-T Decoder
Figure 4 for Tied & Reduced RNN-T Decoder
Viaarxiv icon

Multi-user VoiceFilter-Lite via Attentive Speaker Embedding

Add code
Bookmark button
Alert button
Jul 02, 2021
Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ian McGraw

Figure 1 for Multi-user VoiceFilter-Lite via Attentive Speaker Embedding
Figure 2 for Multi-user VoiceFilter-Lite via Attentive Speaker Embedding
Figure 3 for Multi-user VoiceFilter-Lite via Attentive Speaker Embedding
Figure 4 for Multi-user VoiceFilter-Lite via Attentive Speaker Embedding
Viaarxiv icon

Personalized Keyphrase Detection using Speaker and Environment Information

Add code
Bookmark button
Alert button
Apr 28, 2021
Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ding Zhao, Yiteng, Huang, Arun Narayanan, Ian McGraw

Figure 1 for Personalized Keyphrase Detection using Speaker and Environment Information
Figure 2 for Personalized Keyphrase Detection using Speaker and Environment Information
Figure 3 for Personalized Keyphrase Detection using Speaker and Environment Information
Figure 4 for Personalized Keyphrase Detection using Speaker and Environment Information
Viaarxiv icon

Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction

Add code
Bookmark button
Alert button
Apr 26, 2021
David Qiu, Yanzhang He, Qiujia Li, Yu Zhang, Liangliang Cao, Ian McGraw

Figure 1 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Figure 2 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Figure 3 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Figure 4 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Viaarxiv icon

Learning Word-Level Confidence For Subword End-to-End ASR

Add code
Bookmark button
Alert button
Mar 11, 2021
David Qiu, Qiujia Li, Yanzhang He, Yu Zhang, Bo Li, Liangliang Cao, Rohit Prabhavalkar, Deepti Bhatia, Wei Li, Ke Hu, Tara N. Sainath, Ian McGraw

Figure 1 for Learning Word-Level Confidence For Subword End-to-End ASR
Figure 2 for Learning Word-Level Confidence For Subword End-to-End ASR
Figure 3 for Learning Word-Level Confidence For Subword End-to-End ASR
Figure 4 for Learning Word-Level Confidence For Subword End-to-End ASR
Viaarxiv icon

Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging

Add code
Bookmark button
Alert button
Dec 12, 2020
Rohit Prabhavalkar, Yanzhang He, David Rybach, Sean Campbell, Arun Narayanan, Trevor Strohman, Tara N. Sainath

Figure 1 for Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging
Figure 2 for Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging
Figure 3 for Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging
Figure 4 for Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging
Viaarxiv icon

Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition

Add code
Bookmark button
Alert button
Oct 23, 2020
Qiujia Li, David Qiu, Yu Zhang, Bo Li, Yanzhang He, Philip C. Woodland, Liangliang Cao, Trevor Strohman

Figure 1 for Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Figure 2 for Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Figure 3 for Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Figure 4 for Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Viaarxiv icon