Alert button
Picture for Chung-Ming Chien

Chung-Ming Chien

Alert button

Toward Joint Language Modeling for Speech Units and Text

Oct 12, 2023
Ju-Chieh Chou, Chung-Ming Chien, Wei-Ning Hsu, Karen Livescu, Arun Babu, Alexis Conneau, Alexei Baevski, Michael Auli

Figure 1 for Toward Joint Language Modeling for Speech Units and Text
Figure 2 for Toward Joint Language Modeling for Speech Units and Text
Figure 3 for Toward Joint Language Modeling for Speech Units and Text
Figure 4 for Toward Joint Language Modeling for Speech Units and Text
Viaarxiv icon

Few-Shot Spoken Language Understanding via Joint Speech-Text Models

Oct 09, 2023
Chung-Ming Chien, Mingjiamei Zhang, Ju-Chieh Chou, Karen Livescu

Viaarxiv icon

AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement

Sep 14, 2023
Ju-Chieh Chou, Chung-Ming Chien, Karen Livescu

Figure 1 for AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement
Figure 2 for AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement
Figure 3 for AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement
Figure 4 for AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement
Viaarxiv icon

What do self-supervised speech models know about words?

Jun 30, 2023
Ankita Pasad, Chung-Ming Chien, Shane Settle, Karen Livescu

Figure 1 for What do self-supervised speech models know about words?
Figure 2 for What do self-supervised speech models know about words?
Figure 3 for What do self-supervised speech models know about words?
Figure 4 for What do self-supervised speech models know about words?
Viaarxiv icon

Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module

Feb 16, 2022
Adam Gabryś, Goeric Huybrechts, Manuel Sam Ribeiro, Chung-Ming Chien, Julian Roth, Giulia Comini, Roberto Barra-Chicote, Bartek Perz, Jaime Lorenzo-Trueba

Figure 1 for Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module
Figure 2 for Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module
Figure 3 for Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module
Figure 4 for Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module
Viaarxiv icon

S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations

Apr 07, 2021
Jheng-hao Lin, Yist Y. Lin, Chung-Ming Chien, Hung-yi Lee

Figure 1 for S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations
Figure 2 for S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations
Figure 3 for S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations
Figure 4 for S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations
Viaarxiv icon

Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech

Mar 20, 2021
Chung-Ming Chien, Jheng-Hao Lin, Chien-yu Huang, Po-chun Hsu, Hung-yi Lee

Figure 1 for Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech
Figure 2 for Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech
Figure 3 for Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech
Figure 4 for Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech
Viaarxiv icon

Hierarchical Prosody Modeling for Non-Autoregressive Speech Synthesis

Nov 17, 2020
Chung-Ming Chien, Hung-yi Lee

Figure 1 for Hierarchical Prosody Modeling for Non-Autoregressive Speech Synthesis
Figure 2 for Hierarchical Prosody Modeling for Non-Autoregressive Speech Synthesis
Figure 3 for Hierarchical Prosody Modeling for Non-Autoregressive Speech Synthesis
Figure 4 for Hierarchical Prosody Modeling for Non-Autoregressive Speech Synthesis
Viaarxiv icon