Picture for Yanzhang He

Yanzhang He

Google Inc. USA

Turn-Taking Prediction for Natural Conversational Speech

Add code
Aug 29, 2022
Figure 1 for Turn-Taking Prediction for Natural Conversational Speech
Figure 2 for Turn-Taking Prediction for Natural Conversational Speech
Figure 3 for Turn-Taking Prediction for Natural Conversational Speech
Figure 4 for Turn-Taking Prediction for Natural Conversational Speech
Viaarxiv icon

Improving Deliberation by Text-Only and Semi-Supervised Training

Add code
Jun 29, 2022
Figure 1 for Improving Deliberation by Text-Only and Semi-Supervised Training
Figure 2 for Improving Deliberation by Text-Only and Semi-Supervised Training
Figure 3 for Improving Deliberation by Text-Only and Semi-Supervised Training
Figure 4 for Improving Deliberation by Text-Only and Semi-Supervised Training
Viaarxiv icon

A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes

Add code
Apr 20, 2022
Figure 1 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Figure 2 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Figure 3 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Figure 4 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Viaarxiv icon

Improving Rare Word Recognition with LM-aware MWER Training

Add code
Apr 15, 2022
Figure 1 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 2 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 3 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 4 for Improving Rare Word Recognition with LM-aware MWER Training
Viaarxiv icon

Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition

Add code
Apr 13, 2022
Figure 1 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 2 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 3 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 4 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Viaarxiv icon

4-bit Conformer with Native Quantization Aware Training for Speech Recognition

Add code
Mar 29, 2022
Figure 1 for 4-bit Conformer with Native Quantization Aware Training for Speech Recognition
Figure 2 for 4-bit Conformer with Native Quantization Aware Training for Speech Recognition
Figure 3 for 4-bit Conformer with Native Quantization Aware Training for Speech Recognition
Figure 4 for 4-bit Conformer with Native Quantization Aware Training for Speech Recognition
Viaarxiv icon

Closing the Gap between Single-User and Multi-User VoiceFilter-Lite

Add code
Feb 24, 2022
Figure 1 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Figure 2 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Figure 3 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Figure 4 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Viaarxiv icon

Cross-attention conformer for context modeling in speech enhancement for ASR

Add code
Oct 30, 2021
Figure 1 for Cross-attention conformer for context modeling in speech enhancement for ASR
Figure 2 for Cross-attention conformer for context modeling in speech enhancement for ASR
Figure 3 for Cross-attention conformer for context modeling in speech enhancement for ASR
Figure 4 for Cross-attention conformer for context modeling in speech enhancement for ASR
Viaarxiv icon

Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning

Add code
Oct 13, 2021
Figure 1 for Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning
Figure 2 for Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning
Figure 3 for Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning
Figure 4 for Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning
Viaarxiv icon

Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition

Add code
Oct 07, 2021
Figure 1 for Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition
Figure 2 for Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition
Figure 3 for Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition
Figure 4 for Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition
Viaarxiv icon