Alert button
Picture for Yanzhang He

Yanzhang He

Alert button

A Language Agnostic Multilingual Streaming On-Device ASR System

Add code
Bookmark button
Alert button
Aug 29, 2022
Bo Li, Tara N. Sainath, Ruoming Pang, Shuo-yiin Chang, Qiumin Xu, Trevor Strohman, Vince Chen, Qiao Liang, Heguang Liu, Yanzhang He, Parisa Haghani, Sameer Bidichandani

Figure 1 for A Language Agnostic Multilingual Streaming On-Device ASR System
Figure 2 for A Language Agnostic Multilingual Streaming On-Device ASR System
Figure 3 for A Language Agnostic Multilingual Streaming On-Device ASR System
Figure 4 for A Language Agnostic Multilingual Streaming On-Device ASR System
Viaarxiv icon

Turn-Taking Prediction for Natural Conversational Speech

Add code
Bookmark button
Alert button
Aug 29, 2022
Shuo-yiin Chang, Bo Li, Tara N. Sainath, Chao Zhang, Trevor Strohman, Qiao Liang, Yanzhang He

Figure 1 for Turn-Taking Prediction for Natural Conversational Speech
Figure 2 for Turn-Taking Prediction for Natural Conversational Speech
Figure 3 for Turn-Taking Prediction for Natural Conversational Speech
Figure 4 for Turn-Taking Prediction for Natural Conversational Speech
Viaarxiv icon

Improving Deliberation by Text-Only and Semi-Supervised Training

Add code
Bookmark button
Alert button
Jun 29, 2022
Ke Hu, Tara N. Sainath, Yanzhang He, Rohit Prabhavalkar, Trevor Strohman, Sepand Mavandadi, Weiran Wang

Figure 1 for Improving Deliberation by Text-Only and Semi-Supervised Training
Figure 2 for Improving Deliberation by Text-Only and Semi-Supervised Training
Figure 3 for Improving Deliberation by Text-Only and Semi-Supervised Training
Figure 4 for Improving Deliberation by Text-Only and Semi-Supervised Training
Viaarxiv icon

A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes

Add code
Bookmark button
Alert button
Apr 20, 2022
Shaojin Ding, Weiran Wang, Ding Zhao, Tara N. Sainath, Yanzhang He, Robert David, Rami Botros, Xin Wang, Rina Panigrahy, Qiao Liang, Dongseong Hwang, Ian McGraw, Rohit Prabhavalkar, Trevor Strohman

Figure 1 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Figure 2 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Figure 3 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Figure 4 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Viaarxiv icon

Improving Rare Word Recognition with LM-aware MWER Training

Add code
Bookmark button
Alert button
Apr 15, 2022
Weiran Wang, Tongzhou Chen, Tara N. Sainath, Ehsan Variani, Rohit Prabhavalkar, Ronny Huang, Bhuvana Ramabhadran, Neeraj Gaur, Sepand Mavandadi, Cal Peyser, Trevor Strohman, Yanzhang He, David Rybach

Figure 1 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 2 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 3 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 4 for Improving Rare Word Recognition with LM-aware MWER Training
Viaarxiv icon

Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition

Add code
Bookmark button
Alert button
Apr 13, 2022
Shaojin Ding, Rajeev Rikhye, Qiao Liang, Yanzhang He, Quan Wang, Arun Narayanan, Tom O'Malley, Ian McGraw

Figure 1 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 2 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 3 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 4 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Viaarxiv icon

4-bit Conformer with Native Quantization Aware Training for Speech Recognition

Add code
Bookmark button
Alert button
Mar 29, 2022
Shaojin Ding, Phoenix Meadowlark, Yanzhang He, Lukasz Lew, Shivani Agrawal, Oleg Rybakov

Figure 1 for 4-bit Conformer with Native Quantization Aware Training for Speech Recognition
Figure 2 for 4-bit Conformer with Native Quantization Aware Training for Speech Recognition
Figure 3 for 4-bit Conformer with Native Quantization Aware Training for Speech Recognition
Figure 4 for 4-bit Conformer with Native Quantization Aware Training for Speech Recognition
Viaarxiv icon

Closing the Gap between Single-User and Multi-User VoiceFilter-Lite

Add code
Bookmark button
Alert button
Feb 24, 2022
Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ian McGraw

Figure 1 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Figure 2 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Figure 3 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Figure 4 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Viaarxiv icon

Cross-attention conformer for context modeling in speech enhancement for ASR

Add code
Bookmark button
Alert button
Oct 30, 2021
Arun Narayanan, Chung-Cheng Chiu, Tom O'Malley, Quan Wang, Yanzhang He

Figure 1 for Cross-attention conformer for context modeling in speech enhancement for ASR
Figure 2 for Cross-attention conformer for context modeling in speech enhancement for ASR
Figure 3 for Cross-attention conformer for context modeling in speech enhancement for ASR
Figure 4 for Cross-attention conformer for context modeling in speech enhancement for ASR
Viaarxiv icon