Alert button
Picture for Dongdong Chen

Dongdong Chen

Alert button

Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles

Nov 29, 2022
Shuquan Ye, Yujia Xie, Dongdong Chen, Yichong Xu, Lu Yuan, Chenguang Zhu, Jing Liao

Figure 1 for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles
Figure 2 for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles
Figure 3 for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles
Figure 4 for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles
Viaarxiv icon

Self-Supervised Learning based on Heat Equation

Nov 23, 2022
Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Lu Yuan, Zicheng Liu, Youzuo Lin

Figure 1 for Self-Supervised Learning based on Heat Equation
Figure 2 for Self-Supervised Learning based on Heat Equation
Figure 3 for Self-Supervised Learning based on Heat Equation
Figure 4 for Self-Supervised Learning based on Heat Equation
Viaarxiv icon

SinDiffusion: Learning a Diffusion Model from a Single Natural Image

Nov 22, 2022
Weilun Wang, Jianmin Bao, Wengang Zhou, Dongdong Chen, Dong Chen, Lu Yuan, Houqiang Li

Figure 1 for SinDiffusion: Learning a Diffusion Model from a Single Natural Image
Figure 2 for SinDiffusion: Learning a Diffusion Model from a Single Natural Image
Figure 3 for SinDiffusion: Learning a Diffusion Model from a Single Natural Image
Figure 4 for SinDiffusion: Learning a Diffusion Model from a Single Natural Image
Viaarxiv icon

PointCAT: Contrastive Adversarial Training for Robust Point Cloud Recognition

Sep 16, 2022
Qidong Huang, Xiaoyi Dong, Dongdong Chen, Hang Zhou, Weiming Zhang, Kui Zhang, Gang Hua, Nenghai Yu

Figure 1 for PointCAT: Contrastive Adversarial Training for Robust Point Cloud Recognition
Figure 2 for PointCAT: Contrastive Adversarial Training for Robust Point Cloud Recognition
Figure 3 for PointCAT: Contrastive Adversarial Training for Robust Point Cloud Recognition
Figure 4 for PointCAT: Contrastive Adversarial Training for Robust Point Cloud Recognition
Viaarxiv icon

OmniVL:One Foundation Model for Image-Language and Video-Language Tasks

Sep 15, 2022
Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Luowei Zhou, Yucheng Zhao, Yujia Xie, Ce Liu, Yu-Gang Jiang, Lu Yuan

Figure 1 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Figure 2 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Figure 3 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Figure 4 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Viaarxiv icon

Imaging with Equivariant Deep Learning

Sep 05, 2022
Dongdong Chen, Mike Davies, Matthias J. Ehrhardt, Carola-Bibiane Schönlieb, Ferdia Sherry, Julián Tachella

Figure 1 for Imaging with Equivariant Deep Learning
Figure 2 for Imaging with Equivariant Deep Learning
Figure 3 for Imaging with Equivariant Deep Learning
Figure 4 for Imaging with Equivariant Deep Learning
Viaarxiv icon

MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining

Aug 25, 2022
Xiaoyi Dong, Yinglin Zheng, Jianmin Bao, Ting Zhang, Dongdong Chen, Hao Yang, Ming Zeng, Weiming Zhang, Lu Yuan, Dong Chen, Fang Wen, Nenghai Yu

Figure 1 for MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining
Figure 2 for MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining
Figure 3 for MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining
Figure 4 for MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining
Viaarxiv icon

Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling

Aug 25, 2022
Rui Wang, Zuxuan Wu, Dongdong Chen, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Luowei Zhou, Lu Yuan, Yu-Gang Jiang

Figure 1 for Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling
Figure 2 for Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling
Figure 3 for Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling
Figure 4 for Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling
Viaarxiv icon

Bootstrapped Masked Autoencoders for Vision BERT Pretraining

Jul 14, 2022
Xiaoyi Dong, Jianmin Bao, Ting Zhang, Dongdong Chen, Weiming Zhang, Lu Yuan, Dong Chen, Fang Wen, Nenghai Yu

Figure 1 for Bootstrapped Masked Autoencoders for Vision BERT Pretraining
Figure 2 for Bootstrapped Masked Autoencoders for Vision BERT Pretraining
Figure 3 for Bootstrapped Masked Autoencoders for Vision BERT Pretraining
Figure 4 for Bootstrapped Masked Autoencoders for Vision BERT Pretraining
Viaarxiv icon

Should All Proposals be Treated Equally in Object Detection?

Jul 07, 2022
Yunsheng Li, Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Pei Yu, Jing Yin, Lu Yuan, Zicheng Liu, Nuno Vasconcelos

Figure 1 for Should All Proposals be Treated Equally in Object Detection?
Figure 2 for Should All Proposals be Treated Equally in Object Detection?
Figure 3 for Should All Proposals be Treated Equally in Object Detection?
Figure 4 for Should All Proposals be Treated Equally in Object Detection?
Viaarxiv icon