Alert button
Picture for Sirui Zhao

Sirui Zhao

Alert button

A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

Add code
Bookmark button
Alert button
Dec 20, 2023
Chaoyou Fu, Renrui Zhang, Zihan Wang, Yubo Huang, Zhengye Zhang, Longtian Qiu, Gaoxiang Ye, Yunhang Shen, Mengdan Zhang, Peixian Chen, Sirui Zhao, Shaohui Lin, Deqiang Jiang, Di Yin, Peng Gao, Ke Li, Hongsheng Li, Xing Sun

Viaarxiv icon

APGL4SR: A Generic Framework with Adaptive and Personalized Global Collaborative Information in Sequential Recommendation

Add code
Bookmark button
Alert button
Nov 06, 2023
Mingjia Yin, Hao Wang, Xiang Xu, Likang Wu, Sirui Zhao, Wei Guo, Yong Liu, Ruiming Tang, Defu Lian, Enhong Chen

Figure 1 for APGL4SR: A Generic Framework with Adaptive and Personalized Global Collaborative Information in Sequential Recommendation
Figure 2 for APGL4SR: A Generic Framework with Adaptive and Personalized Global Collaborative Information in Sequential Recommendation
Figure 3 for APGL4SR: A Generic Framework with Adaptive and Personalized Global Collaborative Information in Sequential Recommendation
Figure 4 for APGL4SR: A Generic Framework with Adaptive and Personalized Global Collaborative Information in Sequential Recommendation
Viaarxiv icon

Woodpecker: Hallucination Correction for Multimodal Large Language Models

Add code
Bookmark button
Alert button
Oct 24, 2023
Shukang Yin, Chaoyou Fu, Sirui Zhao, Tong Xu, Hao Wang, Dianbo Sui, Yunhang Shen, Ke Li, Xing Sun, Enhong Chen

Figure 1 for Woodpecker: Hallucination Correction for Multimodal Large Language Models
Figure 2 for Woodpecker: Hallucination Correction for Multimodal Large Language Models
Figure 3 for Woodpecker: Hallucination Correction for Multimodal Large Language Models
Figure 4 for Woodpecker: Hallucination Correction for Multimodal Large Language Models
Viaarxiv icon

A Solution to CVPR'2023 AQTC Challenge: Video Alignment for Multi-Step Inference

Add code
Bookmark button
Alert button
Jun 26, 2023
Chao Zhang, Shiwei Wu, Sirui Zhao, Tong Xu, Enhong Chen

Figure 1 for A Solution to CVPR'2023 AQTC Challenge: Video Alignment for Multi-Step Inference
Figure 2 for A Solution to CVPR'2023 AQTC Challenge: Video Alignment for Multi-Step Inference
Figure 3 for A Solution to CVPR'2023 AQTC Challenge: Video Alignment for Multi-Step Inference
Figure 4 for A Solution to CVPR'2023 AQTC Challenge: Video Alignment for Multi-Step Inference
Viaarxiv icon

A Survey on Multimodal Large Language Models

Add code
Bookmark button
Alert button
Jun 23, 2023
Shukang Yin, Chaoyou Fu, Sirui Zhao, Ke Li, Xing Sun, Tong Xu, Enhong Chen

Figure 1 for A Survey on Multimodal Large Language Models
Figure 2 for A Survey on Multimodal Large Language Models
Figure 3 for A Survey on Multimodal Large Language Models
Figure 4 for A Survey on Multimodal Large Language Models
Viaarxiv icon

AU-aware graph convolutional network for Macro- and Micro-expression spotting

Add code
Bookmark button
Alert button
Mar 16, 2023
Shukang Yin, Shiwei Wu, Tong Xu, Shifeng Liu, Sirui Zhao, Enhong Chen

Figure 1 for AU-aware graph convolutional network for Macro- and Micro-expression spotting
Figure 2 for AU-aware graph convolutional network for Macro- and Micro-expression spotting
Figure 3 for AU-aware graph convolutional network for Macro- and Micro-expression spotting
Figure 4 for AU-aware graph convolutional network for Macro- and Micro-expression spotting
Viaarxiv icon

More is Better: A Database for Spontaneous Micro-Expression with High Frame Rates

Add code
Bookmark button
Alert button
Jan 03, 2023
Sirui Zhao, Huaying Tang, Xinglong Mao, Shifeng Liu, Hanqing Tao, Hao Wang, Tong Xu, Enhong Chen

Figure 1 for More is Better: A Database for Spontaneous Micro-Expression with High Frame Rates
Figure 2 for More is Better: A Database for Spontaneous Micro-Expression with High Frame Rates
Figure 3 for More is Better: A Database for Spontaneous Micro-Expression with High Frame Rates
Figure 4 for More is Better: A Database for Spontaneous Micro-Expression with High Frame Rates
Viaarxiv icon