Alert button
Picture for Hung-yi Lee

Hung-yi Lee

Alert button

A Large-Scale Evaluation of Speech Foundation Models

Add code
Bookmark button
Alert button
Apr 15, 2024
Shu-wen Yang, Heng-Jui Chang, Zili Huang, Andy T. Liu, Cheng-I Lai, Haibin Wu, Jiatong Shi, Xuankai Chang, Hsiang-Sheng Tsai, Wen-Chin Huang, Tzu-hsun Feng, Po-Han Chi, Yist Y. Lin, Yung-Sung Chuang, Tzu-Hsien Huang, Wei-Cheng Tseng, Kushal Lakhotia, Shang-Wen Li, Abdelrahman Mohamed, Shinji Watanabe, Hung-yi Lee

Viaarxiv icon

Merging Facts, Crafting Fallacies: Evaluating the Contradictory Nature of Aggregated Factual Claims in Long-Form Generations

Add code
Bookmark button
Alert button
Feb 23, 2024
Cheng-Han Chiang, Hung-yi Lee

Viaarxiv icon

Towards audio language modeling -- an overview

Add code
Bookmark button
Alert button
Feb 20, 2024
Haibin Wu, Xuanjun Chen, Yi-Cheng Lin, Kai-wei Chang, Ho-Lam Chung, Alexander H. Liu, Hung-yi Lee

Viaarxiv icon

Codec-SUPERB: An In-Depth Analysis of Sound Codec Models

Add code
Bookmark button
Alert button
Feb 20, 2024
Haibin Wu, Ho-Lam Chung, Yi-Cheng Lin, Yuan-Kuei Wu, Xuanjun Chen, Yu-Chi Pai, Hsiu-Hsuan Wang, Kai-Wei Chang, Alexander H. Liu, Hung-yi Lee

Viaarxiv icon

Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken Conversations

Add code
Bookmark button
Alert button
Feb 20, 2024
Guan-Ting Lin, Cheng-Han Chiang, Hung-yi Lee

Viaarxiv icon

SpeechCLIP+: Self-supervised multi-task representation learning for speech via CLIP and speech-image data

Add code
Bookmark button
Alert button
Feb 10, 2024
Hsuan-Fu Wang, Yi-Jen Shih, Heng-Jui Chang, Layne Berry, Puyuan Peng, Hung-yi Lee, Hsin-Min Wang, David Harwath

Viaarxiv icon

Integrating Self-supervised Speech Model with Pseudo Word-level Targets from Visually-grounded Speech Model

Add code
Bookmark button
Alert button
Feb 08, 2024
Hung-Chieh Fang, Nai-Xuan Ye, Yi-Jen Shih, Puyuan Peng, Hsuan-Fu Wang, Layne Berry, Hung-yi Lee, David Harwath

Viaarxiv icon

REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR

Add code
Bookmark button
Alert button
Feb 06, 2024
Liang-Hsuan Tseng, En-Pei Hu, Cheng-Han Chiang, Yuan Tseng, Hung-yi Lee, Lin-shan Lee, Shao-Hua Sun

Viaarxiv icon

SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering

Add code
Bookmark button
Alert button
Jan 24, 2024
Chyi-Jiunn Lin, Guan-Ting Lin, Yung-Sung Chuang, Wei-Lun Wu, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Lin-shan Lee

Viaarxiv icon