Picture for Alex Xiao

Alex Xiao

Scaling ASR Improves Zero and Few Shot Learning

Add code
Nov 29, 2021
Figure 1 for Scaling ASR Improves Zero and Few Shot Learning
Figure 2 for Scaling ASR Improves Zero and Few Shot Learning
Figure 3 for Scaling ASR Improves Zero and Few Shot Learning
Figure 4 for Scaling ASR Improves Zero and Few Shot Learning
Viaarxiv icon

Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions

Add code
Nov 18, 2021
Figure 1 for Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions
Figure 2 for Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions
Viaarxiv icon

Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution

Add code
Oct 07, 2021
Figure 1 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Figure 2 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Figure 3 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Figure 4 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Viaarxiv icon

Transferring Voice Knowledge for Acoustic Event Detection: An Empirical Study

Add code
Oct 07, 2021
Figure 1 for Transferring Voice Knowledge for Acoustic Event Detection: An Empirical Study
Figure 2 for Transferring Voice Knowledge for Acoustic Event Detection: An Empirical Study
Figure 3 for Transferring Voice Knowledge for Acoustic Event Detection: An Empirical Study
Figure 4 for Transferring Voice Knowledge for Acoustic Event Detection: An Empirical Study
Viaarxiv icon

Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios

Add code
Apr 06, 2021
Figure 1 for Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios
Figure 2 for Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios
Figure 3 for Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios
Figure 4 for Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios
Viaarxiv icon

Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency

Add code
Apr 05, 2021
Figure 1 for Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency
Figure 2 for Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency
Figure 3 for Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency
Figure 4 for Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency
Viaarxiv icon

Contrastive Semi-supervised Learning for ASR

Add code
Mar 09, 2021
Figure 1 for Contrastive Semi-supervised Learning for ASR
Figure 2 for Contrastive Semi-supervised Learning for ASR
Figure 3 for Contrastive Semi-supervised Learning for ASR
Figure 4 for Contrastive Semi-supervised Learning for ASR
Viaarxiv icon

Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications

Add code
Oct 29, 2020
Figure 1 for Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Figure 2 for Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Figure 3 for Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Figure 4 for Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Viaarxiv icon

Large scale weakly and semi-supervised learning for low-resource video ASR

Add code
May 16, 2020
Figure 1 for Large scale weakly and semi-supervised learning for low-resource video ASR
Figure 2 for Large scale weakly and semi-supervised learning for low-resource video ASR
Figure 3 for Large scale weakly and semi-supervised learning for low-resource video ASR
Viaarxiv icon

Transformer-based Acoustic Modeling for Hybrid Speech Recognition

Add code
Oct 22, 2019
Figure 1 for Transformer-based Acoustic Modeling for Hybrid Speech Recognition
Figure 2 for Transformer-based Acoustic Modeling for Hybrid Speech Recognition
Figure 3 for Transformer-based Acoustic Modeling for Hybrid Speech Recognition
Figure 4 for Transformer-based Acoustic Modeling for Hybrid Speech Recognition
Viaarxiv icon