Picture for Wei-Ning Hsu

Wei-Ning Hsu

Continual Learning for On-Device Speech Recognition using Disentangled Conformers

Add code
Dec 13, 2022
Figure 1 for Continual Learning for On-Device Speech Recognition using Disentangled Conformers
Figure 2 for Continual Learning for On-Device Speech Recognition using Disentangled Conformers
Figure 3 for Continual Learning for On-Device Speech Recognition using Disentangled Conformers
Figure 4 for Continual Learning for On-Device Speech Recognition using Disentangled Conformers
Viaarxiv icon

Speech-to-Speech Translation For A Real-world Unwritten Language

Add code
Nov 11, 2022
Figure 1 for Speech-to-Speech Translation For A Real-world Unwritten Language
Figure 2 for Speech-to-Speech Translation For A Real-world Unwritten Language
Figure 3 for Speech-to-Speech Translation For A Real-world Unwritten Language
Figure 4 for Speech-to-Speech Translation For A Real-world Unwritten Language
Viaarxiv icon

Simple and Effective Unsupervised Speech Translation

Add code
Oct 18, 2022
Figure 1 for Simple and Effective Unsupervised Speech Translation
Figure 2 for Simple and Effective Unsupervised Speech Translation
Figure 3 for Simple and Effective Unsupervised Speech Translation
Figure 4 for Simple and Effective Unsupervised Speech Translation
Viaarxiv icon

A Single Self-Supervised Model for Many Speech Modalities Enables Zero-Shot Modality Transfer

Add code
Jul 14, 2022
Figure 1 for A Single Self-Supervised Model for Many Speech Modalities Enables Zero-Shot Modality Transfer
Figure 2 for A Single Self-Supervised Model for Many Speech Modalities Enables Zero-Shot Modality Transfer
Figure 3 for A Single Self-Supervised Model for Many Speech Modalities Enables Zero-Shot Modality Transfer
Figure 4 for A Single Self-Supervised Model for Many Speech Modalities Enables Zero-Shot Modality Transfer
Viaarxiv icon

Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT

Add code
May 15, 2022
Figure 1 for Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Figure 2 for Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Figure 3 for Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Figure 4 for Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Viaarxiv icon

On-demand compute reduction with stochastic wav2vec 2.0

Add code
Apr 25, 2022
Figure 1 for On-demand compute reduction with stochastic wav2vec 2.0
Figure 2 for On-demand compute reduction with stochastic wav2vec 2.0
Figure 3 for On-demand compute reduction with stochastic wav2vec 2.0
Figure 4 for On-demand compute reduction with stochastic wav2vec 2.0
Viaarxiv icon

Simple and Effective Unsupervised Speech Synthesis

Add code
Apr 20, 2022
Figure 1 for Simple and Effective Unsupervised Speech Synthesis
Figure 2 for Simple and Effective Unsupervised Speech Synthesis
Figure 3 for Simple and Effective Unsupervised Speech Synthesis
Figure 4 for Simple and Effective Unsupervised Speech Synthesis
Viaarxiv icon

Unified Speech-Text Pre-training for Speech Translation and Recognition

Add code
Apr 11, 2022
Figure 1 for Unified Speech-Text Pre-training for Speech Translation and Recognition
Figure 2 for Unified Speech-Text Pre-training for Speech Translation and Recognition
Figure 3 for Unified Speech-Text Pre-training for Speech Translation and Recognition
Figure 4 for Unified Speech-Text Pre-training for Speech Translation and Recognition
Viaarxiv icon

Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation

Add code
Apr 06, 2022
Figure 1 for Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
Figure 2 for Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
Figure 3 for Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
Figure 4 for Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
Viaarxiv icon

Towards End-to-end Unsupervised Speech Recognition

Add code
Apr 05, 2022
Figure 1 for Towards End-to-end Unsupervised Speech Recognition
Figure 2 for Towards End-to-end Unsupervised Speech Recognition
Figure 3 for Towards End-to-end Unsupervised Speech Recognition
Figure 4 for Towards End-to-end Unsupervised Speech Recognition
Viaarxiv icon