Alert button
Picture for Xiang-Dong Zhou

Xiang-Dong Zhou

Alert button

Efficient End-to-End Video Question Answering with Pyramidal Multimodal Transformer

Add code
Bookmark button
Alert button
Feb 04, 2023
Min Peng, Chongyang Wang, Yu Shi, Xiang-Dong Zhou

Figure 1 for Efficient End-to-End Video Question Answering with Pyramidal Multimodal Transformer
Figure 2 for Efficient End-to-End Video Question Answering with Pyramidal Multimodal Transformer
Figure 3 for Efficient End-to-End Video Question Answering with Pyramidal Multimodal Transformer
Figure 4 for Efficient End-to-End Video Question Answering with Pyramidal Multimodal Transformer
Viaarxiv icon

Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering

Add code
Bookmark button
Alert button
May 09, 2022
Min Peng, Chongyang Wang, Yuan Gao, Yu Shi, Xiang-Dong Zhou

Figure 1 for Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering
Figure 2 for Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering
Figure 3 for Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering
Figure 4 for Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering
Viaarxiv icon

Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering

Add code
Bookmark button
Alert button
Sep 10, 2021
Min Peng, Chongyang Wang, Yuan Gao, Yu Shi, Xiang-Dong Zhou

Figure 1 for Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering
Figure 2 for Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering
Figure 3 for Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering
Figure 4 for Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering
Viaarxiv icon

STA-VPR: Spatio-temporal Alignment for Visual Place Recognition

Add code
Bookmark button
Alert button
Apr 09, 2021
Feng Lu, Baifan Chen, Xiang-Dong Zhou, Dezhen Song

Figure 1 for STA-VPR: Spatio-temporal Alignment for Visual Place Recognition
Figure 2 for STA-VPR: Spatio-temporal Alignment for Visual Place Recognition
Figure 3 for STA-VPR: Spatio-temporal Alignment for Visual Place Recognition
Figure 4 for STA-VPR: Spatio-temporal Alignment for Visual Place Recognition
Viaarxiv icon

Recognizing Micro-expression in Video Clip with Adaptive Key-frame Mining

Add code
Bookmark button
Alert button
Sep 19, 2020
Min Peng, Chongyang Wang, Yuan Gao, Tao Bi, Tong Chen, Yu Shi, Xiang-Dong Zhou

Figure 1 for Recognizing Micro-expression in Video Clip with Adaptive Key-frame Mining
Figure 2 for Recognizing Micro-expression in Video Clip with Adaptive Key-frame Mining
Figure 3 for Recognizing Micro-expression in Video Clip with Adaptive Key-frame Mining
Figure 4 for Recognizing Micro-expression in Video Clip with Adaptive Key-frame Mining
Viaarxiv icon