Alert button
Picture for Andrew Rosenberg

Andrew Rosenberg

Alert button

G-Augment: Searching For The Meta-Structure Of Data Augmentation Policies For ASR

Add code
Bookmark button
Alert button
Oct 19, 2022
Gary Wang, Ekin D. Cubuk, Andrew Rosenberg, Shuyang Cheng, Ron J. Weiss, Bhuvana Ramabhadran, Pedro J. Moreno, Quoc V. Le, Daniel S. Park

Figure 1 for G-Augment: Searching For The Meta-Structure Of Data Augmentation Policies For ASR
Figure 2 for G-Augment: Searching For The Meta-Structure Of Data Augmentation Policies For ASR
Figure 3 for G-Augment: Searching For The Meta-Structure Of Data Augmentation Policies For ASR
Figure 4 for G-Augment: Searching For The Meta-Structure Of Data Augmentation Policies For ASR
Viaarxiv icon

Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR

Add code
Bookmark button
Alert button
Oct 18, 2022
Zhehuai Chen, Ankur Bapna, Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Pedro Moreno, Nanxin Chen

Figure 1 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Figure 2 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Figure 3 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Figure 4 for Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Viaarxiv icon

Non-Parallel Voice Conversion for ASR Augmentation

Add code
Bookmark button
Alert button
Sep 15, 2022
Gary Wang, Andrew Rosenberg, Bhuvana Ramabhadran, Fadi Biadsy, Yinghui Huang, Jesse Emond, Pedro Moreno Mengibar

Figure 1 for Non-Parallel Voice Conversion for ASR Augmentation
Figure 2 for Non-Parallel Voice Conversion for ASR Augmentation
Figure 3 for Non-Parallel Voice Conversion for ASR Augmentation
Figure 4 for Non-Parallel Voice Conversion for ASR Augmentation
Viaarxiv icon

Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data

Add code
Bookmark button
Alert button
May 16, 2022
Alëna Aksënova, Zhehuai Chen, Chung-Cheng Chiu, Daan van Esch, Pavel Golik, Wei Han, Levi King, Bhuvana Ramabhadran, Andrew Rosenberg, Suzan Schwartz, Gary Wang

Figure 1 for Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data
Figure 2 for Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data
Figure 3 for Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data
Viaarxiv icon

MAESTRO: Matched Speech Text Representations through Modality Matching

Add code
Bookmark button
Alert button
Apr 07, 2022
Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Pedro Moreno, Ankur Bapna, Heiga Zen

Figure 1 for MAESTRO: Matched Speech Text Representations through Modality Matching
Figure 2 for MAESTRO: Matched Speech Text Representations through Modality Matching
Figure 3 for MAESTRO: Matched Speech Text Representations through Modality Matching
Figure 4 for MAESTRO: Matched Speech Text Representations through Modality Matching
Viaarxiv icon

A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization

Add code
Bookmark button
Alert button
Mar 23, 2022
Fadi Biadsy, Youzheng Chen, Xia Zhang, Oleg Rybakov, Andrew Rosenberg, Pedro J. Moreno

Figure 1 for A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization
Figure 2 for A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization
Figure 3 for A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization
Viaarxiv icon

Ask2Mask: Guided Data Selection for Masked Speech Modeling

Add code
Bookmark button
Alert button
Feb 24, 2022
Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Yu Zhang, Pedro Moreno

Figure 1 for Ask2Mask: Guided Data Selection for Masked Speech Modeling
Figure 2 for Ask2Mask: Guided Data Selection for Masked Speech Modeling
Figure 3 for Ask2Mask: Guided Data Selection for Masked Speech Modeling
Figure 4 for Ask2Mask: Guided Data Selection for Masked Speech Modeling
Viaarxiv icon

Injecting Text in Self-Supervised Speech Pretraining

Add code
Bookmark button
Alert button
Aug 27, 2021
Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Gary Wang, Pedro Moreno

Figure 1 for Injecting Text in Self-Supervised Speech Pretraining
Figure 2 for Injecting Text in Self-Supervised Speech Pretraining
Figure 3 for Injecting Text in Self-Supervised Speech Pretraining
Figure 4 for Injecting Text in Self-Supervised Speech Pretraining
Viaarxiv icon

Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior

Add code
Bookmark button
Alert button
Feb 06, 2020
Guangzhi Sun, Yu Zhang, Ron J. Weiss, Yuan Cao, Heiga Zen, Andrew Rosenberg, Bhuvana Ramabhadran, Yonghui Wu

Figure 1 for Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior
Figure 2 for Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior
Figure 3 for Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior
Figure 4 for Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior
Viaarxiv icon

Speech Recognition with Augmented Synthesized Speech

Add code
Bookmark button
Alert button
Sep 25, 2019
Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Ye Jia, Pedro Moreno, Yonghui Wu, Zelin Wu

Figure 1 for Speech Recognition with Augmented Synthesized Speech
Figure 2 for Speech Recognition with Augmented Synthesized Speech
Figure 3 for Speech Recognition with Augmented Synthesized Speech
Figure 4 for Speech Recognition with Augmented Synthesized Speech
Viaarxiv icon