Picture for Ian McGraw

Ian McGraw

Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models

Add code
Mar 15, 2023
Figure 1 for Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
Figure 2 for Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
Figure 3 for Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
Figure 4 for Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
Viaarxiv icon

A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes

Add code
Apr 20, 2022
Figure 1 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Figure 2 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Figure 3 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Figure 4 for A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Viaarxiv icon

Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition

Add code
Apr 13, 2022
Figure 1 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 2 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 3 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 4 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Viaarxiv icon

Closing the Gap between Single-User and Multi-User VoiceFilter-Lite

Add code
Feb 24, 2022
Figure 1 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Figure 2 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Figure 3 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Figure 4 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Viaarxiv icon

Multi-user VoiceFilter-Lite via Attentive Speaker Embedding

Add code
Jul 02, 2021
Figure 1 for Multi-user VoiceFilter-Lite via Attentive Speaker Embedding
Figure 2 for Multi-user VoiceFilter-Lite via Attentive Speaker Embedding
Figure 3 for Multi-user VoiceFilter-Lite via Attentive Speaker Embedding
Figure 4 for Multi-user VoiceFilter-Lite via Attentive Speaker Embedding
Viaarxiv icon

Personalized Keyphrase Detection using Speaker and Environment Information

Add code
Apr 28, 2021
Figure 1 for Personalized Keyphrase Detection using Speaker and Environment Information
Figure 2 for Personalized Keyphrase Detection using Speaker and Environment Information
Figure 3 for Personalized Keyphrase Detection using Speaker and Environment Information
Figure 4 for Personalized Keyphrase Detection using Speaker and Environment Information
Viaarxiv icon

Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction

Add code
Apr 26, 2021
Figure 1 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Figure 2 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Figure 3 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Figure 4 for Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Viaarxiv icon

Learning Word-Level Confidence For Subword End-to-End ASR

Add code
Mar 11, 2021
Figure 1 for Learning Word-Level Confidence For Subword End-to-End ASR
Figure 2 for Learning Word-Level Confidence For Subword End-to-End ASR
Figure 3 for Learning Word-Level Confidence For Subword End-to-End ASR
Figure 4 for Learning Word-Level Confidence For Subword End-to-End ASR
Viaarxiv icon

Analyzing the Quality and Stability of a Streaming End-to-End On-Device Speech Recognizer

Add code
Jun 02, 2020
Figure 1 for Analyzing the Quality and Stability of a Streaming End-to-End On-Device Speech Recognizer
Figure 2 for Analyzing the Quality and Stability of a Streaming End-to-End On-Device Speech Recognizer
Figure 3 for Analyzing the Quality and Stability of a Streaming End-to-End On-Device Speech Recognizer
Figure 4 for Analyzing the Quality and Stability of a Streaming End-to-End On-Device Speech Recognizer
Viaarxiv icon

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency

Add code
Mar 28, 2020
Figure 1 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 2 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 3 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 4 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Viaarxiv icon