Alert button
Picture for Xu Sun

Xu Sun

Alert button

Towards Multimodal Video Paragraph Captioning Models Robust to Missing Modality

Add code
Bookmark button
Alert button
Mar 28, 2024
Sishuo Chen, Lei Li, Shuhuai Ren, Rundong Gao, Yuanxin Liu, Xiaohan Bi, Xu Sun, Lu Hou

Viaarxiv icon

TempCompass: Do Video LLMs Really Understand Videos?

Add code
Bookmark button
Alert button
Mar 01, 2024
Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou

Figure 1 for TempCompass: Do Video LLMs Really Understand Videos?
Figure 2 for TempCompass: Do Video LLMs Really Understand Videos?
Figure 3 for TempCompass: Do Video LLMs Really Understand Videos?
Figure 4 for TempCompass: Do Video LLMs Really Understand Videos?
Viaarxiv icon

Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents

Add code
Bookmark button
Alert button
Feb 17, 2024
Wenkai Yang, Xiaohan Bi, Yankai Lin, Sishuo Chen, Jie Zhou, Xu Sun

Viaarxiv icon

TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Add code
Bookmark button
Alert button
Dec 04, 2023
Shuhuai Ren, Linli Yao, Shicheng Li, Xu Sun, Lu Hou

Viaarxiv icon

VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models

Add code
Bookmark button
Alert button
Nov 29, 2023
Shicheng Li, Lei Li, Shuhuai Ren, Yuanxin Liu, Yi Liu, Rundong Gao, Xu Sun, Lu Hou

Viaarxiv icon

RECALL: A Benchmark for LLMs Robustness against External Counterfactual Knowledge

Add code
Bookmark button
Alert button
Nov 14, 2023
Yi Liu, Lianzhe Huang, Shicheng Li, Sishuo Chen, Hao Zhou, Fandong Meng, Jie Zhou, Xu Sun

Viaarxiv icon

FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation

Add code
Bookmark button
Alert button
Nov 08, 2023
Yuanxin Liu, Lei Li, Shuhuai Ren, Rundong Gao, Shicheng Li, Sishuo Chen, Xu Sun, Lu Hou

Figure 1 for FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Figure 2 for FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Figure 3 for FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Figure 4 for FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Viaarxiv icon

TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding

Add code
Bookmark button
Alert button
Oct 29, 2023
Shuhuai Ren, Sishuo Chen, Shicheng Li, Xu Sun, Lu Hou

Viaarxiv icon

Incorporating Pre-trained Model Prompting in Multimodal Stock Volume Movement Prediction

Add code
Bookmark button
Alert button
Sep 11, 2023
Ruibo Chen, Zhiyuan Zhang, Yi Liu, Ruihan Bao, Keiko Harimoto, Xu Sun

Figure 1 for Incorporating Pre-trained Model Prompting in Multimodal Stock Volume Movement Prediction
Figure 2 for Incorporating Pre-trained Model Prompting in Multimodal Stock Volume Movement Prediction
Figure 3 for Incorporating Pre-trained Model Prompting in Multimodal Stock Volume Movement Prediction
Figure 4 for Incorporating Pre-trained Model Prompting in Multimodal Stock Volume Movement Prediction
Viaarxiv icon