Picture for Andrew Rosenberg

Andrew Rosenberg

Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models

Add code
Jul 05, 2024
Viaarxiv icon

Speech Prefix-Tuning with RNNT Loss for Improving LLM Predictions

Add code
Jun 20, 2024
Viaarxiv icon

ASTRA: Aligning Speech and Text Representations for Asr without Sampling

Add code
Jun 10, 2024
Viaarxiv icon

Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data

Add code
Feb 29, 2024
Viaarxiv icon

High-precision Voice Search Query Correction via Retrievable Speech-text Embedings

Add code
Jan 08, 2024
Viaarxiv icon

Using Text Injection to Improve Recognition of Personal Identifiers in Speech

Add code
Aug 14, 2023
Figure 1 for Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Figure 2 for Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Figure 3 for Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Figure 4 for Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Viaarxiv icon

O-1: Self-training with Oracle and 1-best Hypothesis

Add code
Aug 14, 2023
Figure 1 for O-1: Self-training with Oracle and 1-best Hypothesis
Figure 2 for O-1: Self-training with Oracle and 1-best Hypothesis
Figure 3 for O-1: Self-training with Oracle and 1-best Hypothesis
Figure 4 for O-1: Self-training with Oracle and 1-best Hypothesis
Viaarxiv icon

Improving Joint Speech-Text Representations Without Alignment

Add code
Aug 11, 2023
Figure 1 for Improving Joint Speech-Text Representations Without Alignment
Figure 2 for Improving Joint Speech-Text Representations Without Alignment
Figure 3 for Improving Joint Speech-Text Representations Without Alignment
Figure 4 for Improving Joint Speech-Text Representations Without Alignment
Viaarxiv icon

Understanding Shared Speech-Text Representations

Add code
Apr 27, 2023
Figure 1 for Understanding Shared Speech-Text Representations
Figure 2 for Understanding Shared Speech-Text Representations
Figure 3 for Understanding Shared Speech-Text Representations
Figure 4 for Understanding Shared Speech-Text Representations
Viaarxiv icon

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

Add code
Mar 03, 2023
Figure 1 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 2 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 3 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 4 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Viaarxiv icon