Alert button
Picture for Wei-Ning Hsu

Wei-Ning Hsu

Alert button

Continual Learning for On-Device Speech Recognition using Disentangled Conformers

Add code
Bookmark button
Alert button
Dec 02, 2022
Anuj Diwan, Ching-Feng Yeh, Wei-Ning Hsu, Paden Tomasello, Eunsol Choi, David Harwath, Abdelrahman Mohamed

Figure 1 for Continual Learning for On-Device Speech Recognition using Disentangled Conformers
Figure 2 for Continual Learning for On-Device Speech Recognition using Disentangled Conformers
Figure 3 for Continual Learning for On-Device Speech Recognition using Disentangled Conformers
Figure 4 for Continual Learning for On-Device Speech Recognition using Disentangled Conformers
Viaarxiv icon

Speech-to-Speech Translation For A Real-world Unwritten Language

Add code
Bookmark button
Alert button
Nov 11, 2022
Peng-Jen Chen, Kevin Tran, Yilin Yang, Jingfei Du, Justine Kao, Yu-An Chung, Paden Tomasello, Paul-Ambroise Duquenne, Holger Schwenk, Hongyu Gong, Hirofumi Inaguma, Sravya Popuri, Changhan Wang, Juan Pino, Wei-Ning Hsu, Ann Lee

Figure 1 for Speech-to-Speech Translation For A Real-world Unwritten Language
Figure 2 for Speech-to-Speech Translation For A Real-world Unwritten Language
Figure 3 for Speech-to-Speech Translation For A Real-world Unwritten Language
Figure 4 for Speech-to-Speech Translation For A Real-world Unwritten Language
Viaarxiv icon

Simple and Effective Unsupervised Speech Translation

Add code
Bookmark button
Alert button
Oct 18, 2022
Changhan Wang, Hirofumi Inaguma, Peng-Jen Chen, Ilia Kulikov, Yun Tang, Wei-Ning Hsu, Michael Auli, Juan Pino

Figure 1 for Simple and Effective Unsupervised Speech Translation
Figure 2 for Simple and Effective Unsupervised Speech Translation
Figure 3 for Simple and Effective Unsupervised Speech Translation
Figure 4 for Simple and Effective Unsupervised Speech Translation
Viaarxiv icon

A Single Self-Supervised Model for Many Speech Modalities Enables Zero-Shot Modality Transfer

Add code
Bookmark button
Alert button
Jul 14, 2022
Wei-Ning Hsu, Bowen Shi

Figure 1 for A Single Self-Supervised Model for Many Speech Modalities Enables Zero-Shot Modality Transfer
Figure 2 for A Single Self-Supervised Model for Many Speech Modalities Enables Zero-Shot Modality Transfer
Figure 3 for A Single Self-Supervised Model for Many Speech Modalities Enables Zero-Shot Modality Transfer
Figure 4 for A Single Self-Supervised Model for Many Speech Modalities Enables Zero-Shot Modality Transfer
Viaarxiv icon

Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT

Add code
Bookmark button
Alert button
May 15, 2022
Bowen Shi, Abdelrahman Mohamed, Wei-Ning Hsu

Figure 1 for Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Figure 2 for Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Figure 3 for Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Figure 4 for Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Viaarxiv icon

On-demand compute reduction with stochastic wav2vec 2.0

Add code
Bookmark button
Alert button
Apr 25, 2022
Apoorv Vyas, Wei-Ning Hsu, Michael Auli, Alexei Baevski

Figure 1 for On-demand compute reduction with stochastic wav2vec 2.0
Figure 2 for On-demand compute reduction with stochastic wav2vec 2.0
Figure 3 for On-demand compute reduction with stochastic wav2vec 2.0
Figure 4 for On-demand compute reduction with stochastic wav2vec 2.0
Viaarxiv icon

Simple and Effective Unsupervised Speech Synthesis

Add code
Bookmark button
Alert button
Apr 20, 2022
Alexander H. Liu, Cheng-I Jeff Lai, Wei-Ning Hsu, Michael Auli, Alexei Baevski, James Glass

Figure 1 for Simple and Effective Unsupervised Speech Synthesis
Figure 2 for Simple and Effective Unsupervised Speech Synthesis
Figure 3 for Simple and Effective Unsupervised Speech Synthesis
Figure 4 for Simple and Effective Unsupervised Speech Synthesis
Viaarxiv icon

Unified Speech-Text Pre-training for Speech Translation and Recognition

Add code
Bookmark button
Alert button
Apr 11, 2022
Yun Tang, Hongyu Gong, Ning Dong, Changhan Wang, Wei-Ning Hsu, Jiatao Gu, Alexei Baevski, Xian Li, Abdelrahman Mohamed, Michael Auli, Juan Pino

Figure 1 for Unified Speech-Text Pre-training for Speech Translation and Recognition
Figure 2 for Unified Speech-Text Pre-training for Speech Translation and Recognition
Figure 3 for Unified Speech-Text Pre-training for Speech Translation and Recognition
Figure 4 for Unified Speech-Text Pre-training for Speech Translation and Recognition
Viaarxiv icon

Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation

Add code
Bookmark button
Alert button
Apr 06, 2022
Sravya Popuri, Peng-Jen Chen, Changhan Wang, Juan Pino, Yossi Adi, Jiatao Gu, Wei-Ning Hsu, Ann Lee

Figure 1 for Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
Figure 2 for Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
Figure 3 for Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
Figure 4 for Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
Viaarxiv icon