Alert button
Picture for Yunlong Tang

Yunlong Tang

Alert button

V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning

Add code
Bookmark button
Alert button
Apr 18, 2024
Hang Hua, Yunlong Tang, Chenliang Xu, Jiebo Luo

Viaarxiv icon

DPStyler: Dynamic PromptStyler for Source-Free Domain Generalization

Add code
Bookmark button
Alert button
Mar 25, 2024
Yunlong Tang, Yuxuan Wan, Lei Qi, Xin Geng

Viaarxiv icon

AVicuna: Audio-Visual LLM with Interleaver and Context-Boundary Alignment for Temporal Referential Dialogue

Add code
Bookmark button
Alert button
Mar 24, 2024
Yunlong Tang, Daiki Shimada, Jing Bi, Chenliang Xu

Viaarxiv icon

Emo-Avatar: Efficient Monocular Video Style Avatar through Texture Rendering

Add code
Bookmark button
Alert button
Feb 01, 2024
Pinxin Liu, Luchuan Song, Daoan Zhang, Hang Hua, Yunlong Tang, Huaijin Tu, Jiebo Luo, Chenliang Xu

Viaarxiv icon

Video Understanding with Large Language Models: A Survey

Add code
Bookmark button
Alert button
Jan 04, 2024
Yunlong Tang, Jing Bi, Siting Xu, Luchuan Song, Susan Liang, Teng Wang, Daoan Zhang, Jie An, Jingyang Lin, Rongyi Zhu, Ali Vosoughi, Chao Huang, Zeliang Zhang, Feng Zheng, Jianguo Zhang, Ping Luo, Jiebo Luo, Chenliang Xu

Viaarxiv icon

LaunchpadGPT: Language Model as Music Visualization Designer on Launchpad

Add code
Bookmark button
Alert button
Jul 23, 2023
Siting Xu, Yunlong Tang, Feng Zheng

Figure 1 for LaunchpadGPT: Language Model as Music Visualization Designer on Launchpad
Figure 2 for LaunchpadGPT: Language Model as Music Visualization Designer on Launchpad
Figure 3 for LaunchpadGPT: Language Model as Music Visualization Designer on Launchpad
Figure 4 for LaunchpadGPT: Language Model as Music Visualization Designer on Launchpad
Viaarxiv icon

LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning

Add code
Bookmark button
Alert button
Jun 17, 2023
Yunlong Tang, Jinrui Zhang, Xiangchen Wang, Teng Wang, Feng Zheng

Figure 1 for LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning
Figure 2 for LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning
Figure 3 for LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning
Figure 4 for LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning
Viaarxiv icon

Caption Anything: Interactive Image Description with Diverse Multimodal Controls

Add code
Bookmark button
Alert button
May 08, 2023
Teng Wang, Jinrui Zhang, Junjie Fei, Hao Zheng, Yunlong Tang, Zhe Li, Mingqi Gao, Shanshan Zhao

Figure 1 for Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Figure 2 for Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Figure 3 for Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Figure 4 for Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Viaarxiv icon