Alert button
Picture for Zuxuan Wu

Zuxuan Wu

Alert button

Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation

Add code
Bookmark button
Alert button
Dec 15, 2021
Tianyi Liu, Zuxuan Wu, Wenhan Xiong, Jingjing Chen, Yu-Gang Jiang

Figure 1 for Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation
Figure 2 for Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation
Figure 3 for Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation
Figure 4 for Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation
Viaarxiv icon

Cross-Modal Transferable Adversarial Attacks from Images to Videos

Add code
Bookmark button
Alert button
Dec 10, 2021
Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang

Figure 1 for Cross-Modal Transferable Adversarial Attacks from Images to Videos
Figure 2 for Cross-Modal Transferable Adversarial Attacks from Images to Videos
Figure 3 for Cross-Modal Transferable Adversarial Attacks from Images to Videos
Figure 4 for Cross-Modal Transferable Adversarial Attacks from Images to Videos
Viaarxiv icon

BEVT: BERT Pretraining of Video Transformers

Add code
Bookmark button
Alert button
Dec 02, 2021
Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Yu-Gang Jiang, Luowei Zhou, Lu Yuan

Figure 1 for BEVT: BERT Pretraining of Video Transformers
Figure 2 for BEVT: BERT Pretraining of Video Transformers
Figure 3 for BEVT: BERT Pretraining of Video Transformers
Figure 4 for BEVT: BERT Pretraining of Video Transformers
Viaarxiv icon

AdaViT: Adaptive Vision Transformers for Efficient Image Recognition

Add code
Bookmark button
Alert button
Nov 30, 2021
Lingchen Meng, Hengduo Li, Bor-Chun Chen, Shiyi Lan, Zuxuan Wu, Yu-Gang Jiang, Ser-Nam Lim

Figure 1 for AdaViT: Adaptive Vision Transformers for Efficient Image Recognition
Figure 2 for AdaViT: Adaptive Vision Transformers for Efficient Image Recognition
Figure 3 for AdaViT: Adaptive Vision Transformers for Efficient Image Recognition
Figure 4 for AdaViT: Adaptive Vision Transformers for Efficient Image Recognition
Viaarxiv icon

Efficient Video Transformers with Spatial-Temporal Token Selection

Add code
Bookmark button
Alert button
Nov 23, 2021
Junke Wang, Xitong Yang, Hengduo Li, Zuxuan Wu, Yu-Gang Jiang

Figure 1 for Efficient Video Transformers with Spatial-Temporal Token Selection
Figure 2 for Efficient Video Transformers with Spatial-Temporal Token Selection
Figure 3 for Efficient Video Transformers with Spatial-Temporal Token Selection
Figure 4 for Efficient Video Transformers with Spatial-Temporal Token Selection
Viaarxiv icon

Semi-Supervised Vision Transformers

Add code
Bookmark button
Alert button
Nov 22, 2021
Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, Yu-Gang Jiang

Figure 1 for Semi-Supervised Vision Transformers
Figure 2 for Semi-Supervised Vision Transformers
Figure 3 for Semi-Supervised Vision Transformers
Figure 4 for Semi-Supervised Vision Transformers
Viaarxiv icon

Attacking Video Recognition Models with Bullet-Screen Comments

Add code
Bookmark button
Alert button
Oct 29, 2021
Kai Chen, Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang

Figure 1 for Attacking Video Recognition Models with Bullet-Screen Comments
Figure 2 for Attacking Video Recognition Models with Bullet-Screen Comments
Figure 3 for Attacking Video Recognition Models with Bullet-Screen Comments
Figure 4 for Attacking Video Recognition Models with Bullet-Screen Comments
Viaarxiv icon

Boosting the Transferability of Video Adversarial Examples via Temporal Translation

Add code
Bookmark button
Alert button
Oct 18, 2021
Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang

Figure 1 for Boosting the Transferability of Video Adversarial Examples via Temporal Translation
Figure 2 for Boosting the Transferability of Video Adversarial Examples via Temporal Translation
Figure 3 for Boosting the Transferability of Video Adversarial Examples via Temporal Translation
Figure 4 for Boosting the Transferability of Video Adversarial Examples via Temporal Translation
Viaarxiv icon

Self-supervised Learning for Semi-supervised Temporal Language Grounding

Add code
Bookmark button
Alert button
Sep 23, 2021
Fan Luo, Shaoxiang Chen, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang

Figure 1 for Self-supervised Learning for Semi-supervised Temporal Language Grounding
Figure 2 for Self-supervised Learning for Semi-supervised Temporal Language Grounding
Figure 3 for Self-supervised Learning for Semi-supervised Temporal Language Grounding
Figure 4 for Self-supervised Learning for Semi-supervised Temporal Language Grounding
Viaarxiv icon