Alert button
Picture for Hung-yi Lee

Hung-yi Lee

Alert button

Towards General-Purpose Text-Instruction-Guided Voice Conversion

Add code
Bookmark button
Alert button
Sep 25, 2023
Chun-Yi Kuan, Chen An Li, Tsu-Yuan Hsu, Tse-Yang Lin, Ho-Lam Chung, Kai-Wei Chang, Shuo-yiin Chang, Hung-yi Lee

Figure 1 for Towards General-Purpose Text-Instruction-Guided Voice Conversion
Figure 2 for Towards General-Purpose Text-Instruction-Guided Voice Conversion
Figure 3 for Towards General-Purpose Text-Instruction-Guided Voice Conversion
Figure 4 for Towards General-Purpose Text-Instruction-Guided Voice Conversion
Viaarxiv icon

AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models

Add code
Bookmark button
Alert button
Sep 19, 2023
Yuan Tseng, Layne Berry, Yi-Ting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Po-Yao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Shinji Watanabe, Abdelrahman Mohamed, Chi-Luen Feng, Hung-yi Lee

Figure 1 for AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
Figure 2 for AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
Figure 3 for AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
Viaarxiv icon

Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech

Add code
Bookmark button
Alert button
Sep 18, 2023
Chien-yu Huang, Ke-Han Lu, Shih-Heng Wang, Chi-Yuan Hsiao, Chun-Yi Kuan, Haibin Wu, Siddhant Arora, Kai-Wei Chang, Jiatong Shi, Yifan Peng, Roshan Sharma, Shinji Watanabe, Bhiksha Ramakrishnan, Shady Shehata, Hung-yi Lee

Figure 1 for Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech
Figure 2 for Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech
Figure 3 for Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech
Figure 4 for Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech
Viaarxiv icon

SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts

Add code
Bookmark button
Alert button
Jun 19, 2023
Haibin Wu, Kai-Wei Chang, Yuan-Kuei Wu, Hung-yi Lee

Figure 1 for SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts
Figure 2 for SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts
Figure 3 for SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts
Figure 4 for SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts
Viaarxiv icon

Revealing the Blind Spot of Sentence Encoder Evaluation by HEROS

Add code
Bookmark button
Alert button
Jun 13, 2023
Cheng-Han Chiang, Yung-Sung Chuang, James Glass, Hung-yi Lee

Figure 1 for Revealing the Blind Spot of Sentence Encoder Evaluation by HEROS
Figure 2 for Revealing the Blind Spot of Sentence Encoder Evaluation by HEROS
Figure 3 for Revealing the Blind Spot of Sentence Encoder Evaluation by HEROS
Figure 4 for Revealing the Blind Spot of Sentence Encoder Evaluation by HEROS
Viaarxiv icon

Improving Non-autoregressive Translation Quality with Pretrained Language Model, Embedding Distillation and Upsampling Strategy for CTC

Add code
Bookmark button
Alert button
Jun 10, 2023
Shen-sian Syu, Juncheng Xie, Hung-yi Lee

Figure 1 for Improving Non-autoregressive Translation Quality with Pretrained Language Model, Embedding Distillation and Upsampling Strategy for CTC
Figure 2 for Improving Non-autoregressive Translation Quality with Pretrained Language Model, Embedding Distillation and Upsampling Strategy for CTC
Figure 3 for Improving Non-autoregressive Translation Quality with Pretrained Language Model, Embedding Distillation and Upsampling Strategy for CTC
Figure 4 for Improving Non-autoregressive Translation Quality with Pretrained Language Model, Embedding Distillation and Upsampling Strategy for CTC
Viaarxiv icon

Why We Should Report the Details in Subjective Evaluation of TTS More Rigorously

Add code
Bookmark button
Alert button
Jun 03, 2023
Cheng-Han Chiang, Wei-Ping Huang, Hung-yi Lee

Figure 1 for Why We Should Report the Details in Subjective Evaluation of TTS More Rigorously
Figure 2 for Why We Should Report the Details in Subjective Evaluation of TTS More Rigorously
Figure 3 for Why We Should Report the Details in Subjective Evaluation of TTS More Rigorously
Figure 4 for Why We Should Report the Details in Subjective Evaluation of TTS More Rigorously
Viaarxiv icon

How to Estimate Model Transferability of Pre-Trained Speech Models?

Add code
Bookmark button
Alert button
Jun 01, 2023
Zih-Ching Chen, Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Shou-Yiin Chang, Rohit Prabhavalkar, Hung-yi Lee, Tara N. Sainath

Figure 1 for How to Estimate Model Transferability of Pre-Trained Speech Models?
Figure 2 for How to Estimate Model Transferability of Pre-Trained Speech Models?
Figure 3 for How to Estimate Model Transferability of Pre-Trained Speech Models?
Figure 4 for How to Estimate Model Transferability of Pre-Trained Speech Models?
Viaarxiv icon

MiniSUPERB: Lightweight Benchmark for Self-supervised Speech Models

Add code
Bookmark button
Alert button
May 30, 2023
Yu-Hsiang Wang, Huang-Yu Chen, Kai-Wei Chang, Winston Hsu, Hung-yi Lee

Figure 1 for MiniSUPERB: Lightweight Benchmark for Self-supervised Speech Models
Figure 2 for MiniSUPERB: Lightweight Benchmark for Self-supervised Speech Models
Figure 3 for MiniSUPERB: Lightweight Benchmark for Self-supervised Speech Models
Figure 4 for MiniSUPERB: Lightweight Benchmark for Self-supervised Speech Models
Viaarxiv icon