Alert button
Picture for Heyang Liu

Heyang Liu

Alert button

M$^3$AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset

Add code
Bookmark button
Alert button
Mar 21, 2024
Zhe Chen, Heyang Liu, Wenyi Yu, Guangzhi Sun, Hongcheng Liu, Ji Wu, Chao Zhang, Yu Wang, Yanfeng Wang

Figure 1 for M$^3$AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset
Figure 2 for M$^3$AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset
Figure 3 for M$^3$AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset
Figure 4 for M$^3$AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset
Viaarxiv icon

Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview

Add code
Bookmark button
Alert button
Mar 01, 2024
Heyang Liu, Yu Wang, Yanfeng Wang

Figure 1 for Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview
Figure 2 for Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview
Figure 3 for Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview
Figure 4 for Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview
Viaarxiv icon

MM-SAP: A Comprehensive Benchmark for Assessing Self-Awareness of Multimodal Large Language Models in Perception

Add code
Bookmark button
Alert button
Jan 15, 2024
Yuhao Wang, Yusheng Liao, Heyang Liu, Hongcheng Liu, Yu Wang, Yanfeng Wang

Viaarxiv icon

LibriSQA: Advancing Free-form and Open-ended Spoken Question Answering with a Novel Dataset and Framework

Add code
Bookmark button
Alert button
Aug 30, 2023
Zihan Zhao, Yiyang Jiang, Heyang Liu, Yanfeng Wang, Yu Wang

Viaarxiv icon