Alert button
Picture for Haibin Wu

Haibin Wu

Alert button

EMO-SUPERB: An In-depth Look at Speech Emotion Recognition

Feb 22, 2024
Haibin Wu, Huang-Cheng Chou, Kai-Wei Chang, Lucas Goncalves, Jiawei Du, Jyh-Shing Roger Jang, Chi-Chun Lee, Hung-Yi Lee

Viaarxiv icon

Towards audio language modeling -- an overview

Feb 20, 2024
Haibin Wu, Xuanjun Chen, Yi-Cheng Lin, Kai-wei Chang, Ho-Lam Chung, Alexander H. Liu, Hung-yi Lee

Viaarxiv icon

Codec-SUPERB: An In-Depth Analysis of Sound Codec Models

Feb 20, 2024
Haibin Wu, Ho-Lam Chung, Yi-Cheng Lin, Yuan-Kuei Wu, Xuanjun Chen, Yu-Chi Pai, Hsiu-Hsuan Wang, Kai-Wei Chang, Alexander H. Liu, Hung-yi Lee

Viaarxiv icon

Scalable Ensemble-based Detection Method against Adversarial Attacks for speaker verification

Dec 14, 2023
Haibin Wu, Heng-Cheng Kuo, Yu Tsao, Hung-yi Lee

Viaarxiv icon

AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models

Sep 19, 2023
Yuan Tseng, Layne Berry, Yi-Ting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Po-Yao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Shinji Watanabe, Abdelrahman Mohamed, Chi-Luen Feng, Hung-yi Lee

Figure 1 for AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
Figure 2 for AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
Figure 3 for AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
Viaarxiv icon

Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech

Sep 18, 2023
Chien-yu Huang, Ke-Han Lu, Shih-Heng Wang, Chi-Yuan Hsiao, Chun-Yi Kuan, Haibin Wu, Siddhant Arora, Kai-Wei Chang, Jiatong Shi, Yifan Peng, Roshan Sharma, Shinji Watanabe, Bhiksha Ramakrishnan, Shady Shehata, Hung-yi Lee

Figure 1 for Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech
Figure 2 for Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech
Figure 3 for Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech
Figure 4 for Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech
Viaarxiv icon

SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts

Jun 19, 2023
Haibin Wu, Kai-Wei Chang, Yuan-Kuei Wu, Hung-yi Lee

Figure 1 for SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts
Figure 2 for SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts
Figure 3 for SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts
Figure 4 for SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts
Viaarxiv icon

Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator

May 25, 2023
Lingwei Meng, Jiawen Kang, Mingyu Cui, Haibin Wu, Xixin Wu, Helen Meng

Figure 1 for Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator
Figure 2 for Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator
Figure 3 for Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator
Figure 4 for Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator
Viaarxiv icon

The defender's perspective on automatic speaker verification: An overview

May 22, 2023
Haibin Wu, Jiawen Kang, Lingwei Meng, Helen Meng, Hung-yi Lee

Figure 1 for The defender's perspective on automatic speaker verification: An overview
Figure 2 for The defender's perspective on automatic speaker verification: An overview
Figure 3 for The defender's perspective on automatic speaker verification: An overview
Viaarxiv icon