Alert button
Picture for Yuanxin Liu

Yuanxin Liu

Alert button

Towards Multimodal Video Paragraph Captioning Models Robust to Missing Modality

Add code
Bookmark button
Alert button
Mar 28, 2024
Sishuo Chen, Lei Li, Shuhuai Ren, Rundong Gao, Yuanxin Liu, Xiaohan Bi, Xu Sun, Lu Hou

Viaarxiv icon

TempCompass: Do Video LLMs Really Understand Videos?

Add code
Bookmark button
Alert button
Mar 01, 2024
Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou

Figure 1 for TempCompass: Do Video LLMs Really Understand Videos?
Figure 2 for TempCompass: Do Video LLMs Really Understand Videos?
Figure 3 for TempCompass: Do Video LLMs Really Understand Videos?
Figure 4 for TempCompass: Do Video LLMs Really Understand Videos?
Viaarxiv icon

VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models

Add code
Bookmark button
Alert button
Nov 29, 2023
Shicheng Li, Lei Li, Shuhuai Ren, Yuanxin Liu, Yi Liu, Rundong Gao, Xu Sun, Lu Hou

Viaarxiv icon

FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation

Add code
Bookmark button
Alert button
Nov 08, 2023
Yuanxin Liu, Lei Li, Shuhuai Ren, Rundong Gao, Shicheng Li, Sishuo Chen, Xu Sun, Lu Hou

Figure 1 for FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Figure 2 for FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Figure 3 for FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Figure 4 for FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Viaarxiv icon

COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models

Add code
Bookmark button
Alert button
Oct 27, 2022
Bowen Shen, Zheng Lin, Yuanxin Liu, Zhengxiao Liu, Lei Wang, Weiping Wang

Figure 1 for COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models
Figure 2 for COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models
Figure 3 for COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models
Figure 4 for COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models
Viaarxiv icon

Compressing And Debiasing Vision-Language Pre-Trained Models for Visual Question Answering

Add code
Bookmark button
Alert button
Oct 26, 2022
Qingyi Si, Yuanxin Liu, Zheng Lin, Peng Fu, Weiping Wang

Figure 1 for Compressing And Debiasing Vision-Language Pre-Trained Models for Visual Question Answering
Figure 2 for Compressing And Debiasing Vision-Language Pre-Trained Models for Visual Question Answering
Figure 3 for Compressing And Debiasing Vision-Language Pre-Trained Models for Visual Question Answering
Figure 4 for Compressing And Debiasing Vision-Language Pre-Trained Models for Visual Question Answering
Viaarxiv icon

A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models

Add code
Bookmark button
Alert button
Oct 11, 2022
Yuanxin Liu, Fandong Meng, Zheng Lin, Jiangnan Li, Peng Fu, Yanan Cao, Weiping Wang, Jie Zhou

Figure 1 for A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models
Figure 2 for A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models
Figure 3 for A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models
Figure 4 for A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models
Viaarxiv icon

Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA

Add code
Bookmark button
Alert button
Oct 10, 2022
Qingyi Si, Fandong Meng, Mingyu Zheng, Zheng Lin, Yuanxin Liu, Peng Fu, Yanan Cao, Weiping Wang, Jie Zhou

Figure 1 for Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA
Figure 2 for Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA
Figure 3 for Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA
Figure 4 for Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA
Viaarxiv icon

Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning

Add code
Bookmark button
Alert button
Oct 10, 2022
Qingyi Si, Yuanxin Liu, Fandong Meng, Zheng Lin, Peng Fu, Yanan Cao, Weiping Wang, Jie Zhou

Figure 1 for Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning
Figure 2 for Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning
Figure 3 for Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning
Figure 4 for Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning
Viaarxiv icon