Picture for Pedro Moreno

Pedro Moreno

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

Add code
Mar 03, 2023
Figure 1 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 2 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 3 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 4 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Viaarxiv icon

Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR

Add code
Oct 18, 2022
Figure 1 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Figure 2 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Figure 3 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Figure 4 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Viaarxiv icon

MAESTRO: Matched Speech Text Representations through Modality Matching

Add code
Apr 07, 2022
Figure 1 for MAESTRO: Matched Speech Text Representations through Modality Matching
Figure 2 for MAESTRO: Matched Speech Text Representations through Modality Matching
Figure 3 for MAESTRO: Matched Speech Text Representations through Modality Matching
Figure 4 for MAESTRO: Matched Speech Text Representations through Modality Matching
Viaarxiv icon

Ask2Mask: Guided Data Selection for Masked Speech Modeling

Add code
Feb 24, 2022
Figure 1 for Ask2Mask: Guided Data Selection for Masked Speech Modeling
Figure 2 for Ask2Mask: Guided Data Selection for Masked Speech Modeling
Figure 3 for Ask2Mask: Guided Data Selection for Masked Speech Modeling
Figure 4 for Ask2Mask: Guided Data Selection for Masked Speech Modeling
Viaarxiv icon

Injecting Text in Self-Supervised Speech Pretraining

Add code
Aug 27, 2021
Figure 1 for Injecting Text in Self-Supervised Speech Pretraining
Figure 2 for Injecting Text in Self-Supervised Speech Pretraining
Figure 3 for Injecting Text in Self-Supervised Speech Pretraining
Figure 4 for Injecting Text in Self-Supervised Speech Pretraining
Viaarxiv icon

Speech Recognition with Augmented Synthesized Speech

Add code
Sep 25, 2019
Figure 1 for Speech Recognition with Augmented Synthesized Speech
Figure 2 for Speech Recognition with Augmented Synthesized Speech
Figure 3 for Speech Recognition with Augmented Synthesized Speech
Figure 4 for Speech Recognition with Augmented Synthesized Speech
Viaarxiv icon

From Audio to Semantics: Approaches to end-to-end spoken language understanding

Add code
Sep 24, 2018
Figure 1 for From Audio to Semantics: Approaches to end-to-end spoken language understanding
Figure 2 for From Audio to Semantics: Approaches to end-to-end spoken language understanding
Figure 3 for From Audio to Semantics: Approaches to end-to-end spoken language understanding
Figure 4 for From Audio to Semantics: Approaches to end-to-end spoken language understanding
Viaarxiv icon

Multilingual Speech Recognition With A Single End-To-End Model

Add code
Feb 15, 2018
Figure 1 for Multilingual Speech Recognition With A Single End-To-End Model
Figure 2 for Multilingual Speech Recognition With A Single End-To-End Model
Figure 3 for Multilingual Speech Recognition With A Single End-To-End Model
Figure 4 for Multilingual Speech Recognition With A Single End-To-End Model
Viaarxiv icon