Picture for Trevor Strohman

Trevor Strohman

Modular Hybrid Autoregressive Transducer

Add code
Oct 31, 2022
Figure 1 for Modular Hybrid Autoregressive Transducer
Figure 2 for Modular Hybrid Autoregressive Transducer
Figure 3 for Modular Hybrid Autoregressive Transducer
Figure 4 for Modular Hybrid Autoregressive Transducer
Viaarxiv icon

JOIST: A Joint Speech and Text Streaming Model For ASR

Add code
Oct 13, 2022
Figure 1 for JOIST: A Joint Speech and Text Streaming Model For ASR
Figure 2 for JOIST: A Joint Speech and Text Streaming Model For ASR
Figure 3 for JOIST: A Joint Speech and Text Streaming Model For ASR
Figure 4 for JOIST: A Joint Speech and Text Streaming Model For ASR
Viaarxiv icon

Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR

Add code
Oct 11, 2022
Figure 1 for Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR
Figure 2 for Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR
Figure 3 for Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR
Figure 4 for Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR
Viaarxiv icon

Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification

Add code
Sep 13, 2022
Figure 1 for Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification
Figure 2 for Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification
Figure 3 for Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification
Figure 4 for Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification
Viaarxiv icon

A Language Agnostic Multilingual Streaming On-Device ASR System

Add code
Aug 29, 2022
Figure 1 for A Language Agnostic Multilingual Streaming On-Device ASR System
Figure 2 for A Language Agnostic Multilingual Streaming On-Device ASR System
Figure 3 for A Language Agnostic Multilingual Streaming On-Device ASR System
Figure 4 for A Language Agnostic Multilingual Streaming On-Device ASR System
Viaarxiv icon

Streaming Intended Query Detection using E2E Modeling for Continued Conversation

Add code
Aug 29, 2022
Figure 1 for Streaming Intended Query Detection using E2E Modeling for Continued Conversation
Figure 2 for Streaming Intended Query Detection using E2E Modeling for Continued Conversation
Figure 3 for Streaming Intended Query Detection using E2E Modeling for Continued Conversation
Figure 4 for Streaming Intended Query Detection using E2E Modeling for Continued Conversation
Viaarxiv icon

Turn-Taking Prediction for Natural Conversational Speech

Add code
Aug 29, 2022
Figure 1 for Turn-Taking Prediction for Natural Conversational Speech
Figure 2 for Turn-Taking Prediction for Natural Conversational Speech
Figure 3 for Turn-Taking Prediction for Natural Conversational Speech
Figure 4 for Turn-Taking Prediction for Natural Conversational Speech
Viaarxiv icon

Improving Deliberation by Text-Only and Semi-Supervised Training

Add code
Jun 29, 2022
Figure 1 for Improving Deliberation by Text-Only and Semi-Supervised Training
Figure 2 for Improving Deliberation by Text-Only and Semi-Supervised Training
Figure 3 for Improving Deliberation by Text-Only and Semi-Supervised Training
Figure 4 for Improving Deliberation by Text-Only and Semi-Supervised Training
Viaarxiv icon

A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes

Add code
Apr 20, 2022
Figure 1 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Figure 2 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Figure 3 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Figure 4 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Viaarxiv icon

Improving Rare Word Recognition with LM-aware MWER Training

Add code
Apr 15, 2022
Figure 1 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 2 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 3 for Improving Rare Word Recognition with LM-aware MWER Training
Figure 4 for Improving Rare Word Recognition with LM-aware MWER Training
Viaarxiv icon