Alert button
Picture for Pedro Moreno

Pedro Moreno

Alert button

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

Add code
Bookmark button
Alert button
Mar 03, 2023
Yu Zhang, Wei Han, James Qin, Yongqiang Wang, Ankur Bapna, Zhehuai Chen, Nanxin Chen, Bo Li, Vera Axelrod, Gary Wang, Zhong Meng, Ke Hu, Andrew Rosenberg, Rohit Prabhavalkar, Daniel S. Park, Parisa Haghani, Jason Riesa, Ginger Perng, Hagen Soltau, Trevor Strohman, Bhuvana Ramabhadran, Tara Sainath, Pedro Moreno, Chung-Cheng Chiu, Johan Schalkwyk, Françoise Beaufays, Yonghui Wu

Figure 1 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 2 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 3 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Figure 4 for Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Viaarxiv icon

Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR

Add code
Bookmark button
Alert button
Oct 18, 2022
Zhehuai Chen, Ankur Bapna, Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Pedro Moreno, Nanxin Chen

Figure 1 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Figure 2 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Figure 3 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Figure 4 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Viaarxiv icon

MAESTRO: Matched Speech Text Representations through Modality Matching

Add code
Bookmark button
Alert button
Apr 07, 2022
Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Pedro Moreno, Ankur Bapna, Heiga Zen

Figure 1 for MAESTRO: Matched Speech Text Representations through Modality Matching
Figure 2 for MAESTRO: Matched Speech Text Representations through Modality Matching
Figure 3 for MAESTRO: Matched Speech Text Representations through Modality Matching
Figure 4 for MAESTRO: Matched Speech Text Representations through Modality Matching
Viaarxiv icon

Ask2Mask: Guided Data Selection for Masked Speech Modeling

Add code
Bookmark button
Alert button
Feb 24, 2022
Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Yu Zhang, Pedro Moreno

Figure 1 for Ask2Mask: Guided Data Selection for Masked Speech Modeling
Figure 2 for Ask2Mask: Guided Data Selection for Masked Speech Modeling
Figure 3 for Ask2Mask: Guided Data Selection for Masked Speech Modeling
Figure 4 for Ask2Mask: Guided Data Selection for Masked Speech Modeling
Viaarxiv icon

Injecting Text in Self-Supervised Speech Pretraining

Add code
Bookmark button
Alert button
Aug 27, 2021
Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Gary Wang, Pedro Moreno

Figure 1 for Injecting Text in Self-Supervised Speech Pretraining
Figure 2 for Injecting Text in Self-Supervised Speech Pretraining
Figure 3 for Injecting Text in Self-Supervised Speech Pretraining
Figure 4 for Injecting Text in Self-Supervised Speech Pretraining
Viaarxiv icon

Speech Recognition with Augmented Synthesized Speech

Add code
Bookmark button
Alert button
Sep 25, 2019
Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Ye Jia, Pedro Moreno, Yonghui Wu, Zelin Wu

Figure 1 for Speech Recognition with Augmented Synthesized Speech
Figure 2 for Speech Recognition with Augmented Synthesized Speech
Figure 3 for Speech Recognition with Augmented Synthesized Speech
Figure 4 for Speech Recognition with Augmented Synthesized Speech
Viaarxiv icon

From Audio to Semantics: Approaches to end-to-end spoken language understanding

Add code
Bookmark button
Alert button
Sep 24, 2018
Parisa Haghani, Arun Narayanan, Michiel Bacchiani, Galen Chuang, Neeraj Gaur, Pedro Moreno, Rohit Prabhavalkar, Zhongdi Qu, Austin Waters

Figure 1 for From Audio to Semantics: Approaches to end-to-end spoken language understanding
Figure 2 for From Audio to Semantics: Approaches to end-to-end spoken language understanding
Figure 3 for From Audio to Semantics: Approaches to end-to-end spoken language understanding
Figure 4 for From Audio to Semantics: Approaches to end-to-end spoken language understanding
Viaarxiv icon

Multilingual Speech Recognition With A Single End-To-End Model

Add code
Bookmark button
Alert button
Feb 15, 2018
Shubham Toshniwal, Tara N. Sainath, Ron J. Weiss, Bo Li, Pedro Moreno, Eugene Weinstein, Kanishka Rao

Figure 1 for Multilingual Speech Recognition With A Single End-To-End Model
Figure 2 for Multilingual Speech Recognition With A Single End-To-End Model
Figure 3 for Multilingual Speech Recognition With A Single End-To-End Model
Figure 4 for Multilingual Speech Recognition With A Single End-To-End Model
Viaarxiv icon