Picture for Yanzhang He

Yanzhang He

Google Inc. USA

Tied & Reduced RNN-T Decoder

Add code
Sep 15, 2021
Figure 1 for Tied & Reduced RNN-T Decoder
Figure 2 for Tied & Reduced RNN-T Decoder
Figure 3 for Tied & Reduced RNN-T Decoder
Figure 4 for Tied & Reduced RNN-T Decoder
Viaarxiv icon

Multi-user VoiceFilter-Lite via Attentive Speaker Embedding

Add code
Jul 02, 2021
Figure 1 for Multi-user VoiceFilter-Lite via Attentive Speaker Embedding
Figure 2 for Multi-user VoiceFilter-Lite via Attentive Speaker Embedding
Figure 3 for Multi-user VoiceFilter-Lite via Attentive Speaker Embedding
Figure 4 for Multi-user VoiceFilter-Lite via Attentive Speaker Embedding
Viaarxiv icon

Personalized Keyphrase Detection using Speaker and Environment Information

Add code
Apr 28, 2021
Figure 1 for Personalized Keyphrase Detection using Speaker and Environment Information
Figure 2 for Personalized Keyphrase Detection using Speaker and Environment Information
Figure 3 for Personalized Keyphrase Detection using Speaker and Environment Information
Figure 4 for Personalized Keyphrase Detection using Speaker and Environment Information
Viaarxiv icon

Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction

Add code
Apr 26, 2021
Figure 1 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Figure 2 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Figure 3 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Figure 4 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Viaarxiv icon

Learning Word-Level Confidence For Subword End-to-End ASR

Add code
Mar 11, 2021
Figure 1 for Learning Word-Level Confidence For Subword End-to-End ASR
Figure 2 for Learning Word-Level Confidence For Subword End-to-End ASR
Figure 3 for Learning Word-Level Confidence For Subword End-to-End ASR
Figure 4 for Learning Word-Level Confidence For Subword End-to-End ASR
Viaarxiv icon

Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging

Add code
Dec 12, 2020
Figure 1 for Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging
Figure 2 for Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging
Figure 3 for Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging
Figure 4 for Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging
Viaarxiv icon

Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition

Add code
Oct 23, 2020
Figure 1 for Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Figure 2 for Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Figure 3 for Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Figure 4 for Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Viaarxiv icon

FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization

Add code
Oct 21, 2020
Figure 1 for FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization
Figure 2 for FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization
Figure 3 for FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization
Figure 4 for FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization
Viaarxiv icon

VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition

Add code
Sep 09, 2020
Figure 1 for VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Figure 2 for VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Figure 3 for VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Viaarxiv icon

Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition

Add code
Sep 02, 2020
Figure 1 for Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Figure 2 for Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Figure 3 for Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Figure 4 for Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Viaarxiv icon