Alert button
Picture for Xiatian Zhu

Xiatian Zhu

Alert button

Post-Processing Temporal Action Detection

Nov 27, 2022
Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang

Figure 1 for Post-Processing Temporal Action Detection
Figure 2 for Post-Processing Temporal Action Detection
Figure 3 for Post-Processing Temporal Action Detection
Figure 4 for Post-Processing Temporal Action Detection
Viaarxiv icon

Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation

Nov 27, 2022
Sauradip Nag, Mengmeng Xu, Xiatian Zhu, Juan-Manuel Perez-Rua, Bernard Ghanem, Yi-Zhe Song, Tao Xiang

Figure 1 for Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation
Figure 2 for Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation
Figure 3 for Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation
Figure 4 for Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation
Viaarxiv icon

Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders

Oct 09, 2022
Haosen Yang, Deng Huang, Bin Wen, Jiannan Wu, Hongxun Yao, Yi Jiang, Xiatian Zhu, Zehuan Yuan

Figure 1 for Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Figure 2 for Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Figure 3 for Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Figure 4 for Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Viaarxiv icon

DeepInteraction: 3D Object Detection via Modality Interaction

Aug 24, 2022
Zeyu Yang, Jiaqi Chen, Zhenwei Miao, Wei Li, Xiatian Zhu, Li Zhang

Figure 1 for DeepInteraction: 3D Object Detection via Modality Interaction
Figure 2 for DeepInteraction: 3D Object Detection via Modality Interaction
Figure 3 for DeepInteraction: 3D Object Detection via Modality Interaction
Figure 4 for DeepInteraction: 3D Object Detection via Modality Interaction
Viaarxiv icon

Semi-Supervised and Unsupervised Deep Visual Learning: A Survey

Aug 24, 2022
Yanbei Chen, Massimiliano Mancini, Xiatian Zhu, Zeynep Akata

Figure 1 for Semi-Supervised and Unsupervised Deep Visual Learning: A Survey
Figure 2 for Semi-Supervised and Unsupervised Deep Visual Learning: A Survey
Figure 3 for Semi-Supervised and Unsupervised Deep Visual Learning: A Survey
Figure 4 for Semi-Supervised and Unsupervised Deep Visual Learning: A Survey
Viaarxiv icon

Accelerating Score-based Generative Models with Preconditioned Diffusion Sampling

Jul 19, 2022
Hengyuan Ma, Li Zhang, Xiatian Zhu, Jianfeng Feng

Figure 1 for Accelerating Score-based Generative Models with Preconditioned Diffusion Sampling
Figure 2 for Accelerating Score-based Generative Models with Preconditioned Diffusion Sampling
Figure 3 for Accelerating Score-based Generative Models with Preconditioned Diffusion Sampling
Figure 4 for Accelerating Score-based Generative Models with Preconditioned Diffusion Sampling
Viaarxiv icon

Visual Representation Learning with Transformer: A Sequence-to-Sequence Perspective

Jul 19, 2022
Li Zhang, Sixiao Zheng, Jiachen Lu, Xinxuan Zhao, Xiatian Zhu, Yanwei Fu, Tao Xiang, Jianfeng Feng

Viaarxiv icon

Zero-Shot Temporal Action Detection via Vision-Language Prompting

Jul 17, 2022
Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang

Figure 1 for Zero-Shot Temporal Action Detection via Vision-Language Prompting
Figure 2 for Zero-Shot Temporal Action Detection via Vision-Language Prompting
Figure 3 for Zero-Shot Temporal Action Detection via Vision-Language Prompting
Figure 4 for Zero-Shot Temporal Action Detection via Vision-Language Prompting
Viaarxiv icon

FashionViL: Fashion-Focused Vision-and-Language Representation Learning

Jul 17, 2022
Xiao Han, Licheng Yu, Xiatian Zhu, Li Zhang, Yi-Zhe Song, Tao Xiang

Figure 1 for FashionViL: Fashion-Focused Vision-and-Language Representation Learning
Figure 2 for FashionViL: Fashion-Focused Vision-and-Language Representation Learning
Figure 3 for FashionViL: Fashion-Focused Vision-and-Language Representation Learning
Figure 4 for FashionViL: Fashion-Focused Vision-and-Language Representation Learning
Viaarxiv icon

Semi-Supervised Temporal Action Detection with Proposal-Free Masking

Jul 14, 2022
Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang

Figure 1 for Semi-Supervised Temporal Action Detection with Proposal-Free Masking
Figure 2 for Semi-Supervised Temporal Action Detection with Proposal-Free Masking
Figure 3 for Semi-Supervised Temporal Action Detection with Proposal-Free Masking
Figure 4 for Semi-Supervised Temporal Action Detection with Proposal-Free Masking
Viaarxiv icon