Picture for Andrew Rosenberg

Andrew Rosenberg

Using Text Injection to Improve Recognition of Personal Identifiers in Speech

Add code
Aug 14, 2023
Figure 1 for Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Figure 2 for Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Figure 3 for Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Figure 4 for Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Viaarxiv icon

Improving Joint Speech-Text Representations Without Alignment

Add code
Aug 11, 2023
Figure 1 for Improving Joint Speech-Text Representations Without Alignment
Figure 2 for Improving Joint Speech-Text Representations Without Alignment
Figure 3 for Improving Joint Speech-Text Representations Without Alignment
Figure 4 for Improving Joint Speech-Text Representations Without Alignment
Viaarxiv icon

Understanding Shared Speech-Text Representations

Add code
Apr 27, 2023
Viaarxiv icon

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

Add code
Mar 03, 2023
Figure 1 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 2 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 3 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 4 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Viaarxiv icon

JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition

Add code
Feb 16, 2023
Figure 1 for JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition
Figure 2 for JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition
Figure 3 for JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition
Figure 4 for JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition
Viaarxiv icon

Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech

Add code
Oct 27, 2022
Figure 1 for Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech
Figure 2 for Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech
Figure 3 for Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech
Figure 4 for Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech
Viaarxiv icon

G-Augment: Searching For The Meta-Structure Of Data Augmentation Policies For ASR

Add code
Oct 19, 2022
Figure 1 for G-Augment: Searching For The Meta-Structure Of Data Augmentation Policies For ASR
Figure 2 for G-Augment: Searching For The Meta-Structure Of Data Augmentation Policies For ASR
Figure 3 for G-Augment: Searching For The Meta-Structure Of Data Augmentation Policies For ASR
Figure 4 for G-Augment: Searching For The Meta-Structure Of Data Augmentation Policies For ASR
Viaarxiv icon

Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR

Add code
Oct 18, 2022
Figure 1 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Figure 2 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Figure 3 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Figure 4 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Viaarxiv icon

Non-Parallel Voice Conversion for ASR Augmentation

Add code
Sep 15, 2022
Figure 1 for Non-Parallel Voice Conversion for ASR Augmentation
Figure 2 for Non-Parallel Voice Conversion for ASR Augmentation
Figure 3 for Non-Parallel Voice Conversion for ASR Augmentation
Figure 4 for Non-Parallel Voice Conversion for ASR Augmentation
Viaarxiv icon

Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data

Add code
May 16, 2022
Figure 1 for Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data
Figure 2 for Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data
Figure 3 for Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data
Viaarxiv icon