Alert button
Picture for Dongdong Chen

Dongdong Chen

Alert button

Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling

Add code
Bookmark button
Alert button
Aug 25, 2022
Rui Wang, Zuxuan Wu, Dongdong Chen, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Luowei Zhou, Lu Yuan, Yu-Gang Jiang

Figure 1 for Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling
Figure 2 for Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling
Figure 3 for Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling
Figure 4 for Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling
Viaarxiv icon

Bootstrapped Masked Autoencoders for Vision BERT Pretraining

Add code
Bookmark button
Alert button
Jul 14, 2022
Xiaoyi Dong, Jianmin Bao, Ting Zhang, Dongdong Chen, Weiming Zhang, Lu Yuan, Dong Chen, Fang Wen, Nenghai Yu

Figure 1 for Bootstrapped Masked Autoencoders for Vision BERT Pretraining
Figure 2 for Bootstrapped Masked Autoencoders for Vision BERT Pretraining
Figure 3 for Bootstrapped Masked Autoencoders for Vision BERT Pretraining
Figure 4 for Bootstrapped Masked Autoencoders for Vision BERT Pretraining
Viaarxiv icon

Should All Proposals be Treated Equally in Object Detection?

Add code
Bookmark button
Alert button
Jul 07, 2022
Yunsheng Li, Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Pei Yu, Jing Yin, Lu Yuan, Zicheng Liu, Nuno Vasconcelos

Figure 1 for Should All Proposals be Treated Equally in Object Detection?
Figure 2 for Should All Proposals be Treated Equally in Object Detection?
Figure 3 for Should All Proposals be Treated Equally in Object Detection?
Figure 4 for Should All Proposals be Treated Equally in Object Detection?
Viaarxiv icon

Semantic Image Synthesis via Diffusion Models

Add code
Bookmark button
Alert button
Jun 30, 2022
Weilun Wang, Jianmin Bao, Wengang Zhou, Dongdong Chen, Dong Chen, Lu Yuan, Houqiang Li

Figure 1 for Semantic Image Synthesis via Diffusion Models
Figure 2 for Semantic Image Synthesis via Diffusion Models
Figure 3 for Semantic Image Synthesis via Diffusion Models
Figure 4 for Semantic Image Synthesis via Diffusion Models
Viaarxiv icon

Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding

Add code
Bookmark button
Alert button
Jun 07, 2022
Lingchen Meng, Xiyang Dai, Yinpeng Chen, Pengchuan Zhang, Dongdong Chen, Mengchen Liu, Jianfeng Wang, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang

Figure 1 for Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding
Figure 2 for Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding
Figure 3 for Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding
Figure 4 for Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding
Viaarxiv icon

REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering

Add code
Bookmark button
Alert button
Jun 02, 2022
Yuanze Lin, Yujia Xie, Dongdong Chen, Yichong Xu, Chenguang Zhu, Lu Yuan

Figure 1 for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
Figure 2 for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
Figure 3 for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
Figure 4 for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
Viaarxiv icon

Reduce Information Loss in Transformers for Pluralistic Image Inpainting

Add code
Bookmark button
Alert button
May 15, 2022
Qiankun Liu, Zhentao Tan, Dongdong Chen, Qi Chu, Xiyang Dai, Yinpeng Chen, Mengchen Liu, Lu Yuan, Nenghai Yu

Figure 1 for Reduce Information Loss in Transformers for Pluralistic Image Inpainting
Figure 2 for Reduce Information Loss in Transformers for Pluralistic Image Inpainting
Figure 3 for Reduce Information Loss in Transformers for Pluralistic Image Inpainting
Figure 4 for Reduce Information Loss in Transformers for Pluralistic Image Inpainting
Viaarxiv icon

i-Code: An Integrative and Composable Multimodal Learning Framework

Add code
Bookmark button
Alert button
May 05, 2022
Ziyi Yang, Yuwei Fang, Chenguang Zhu, Reid Pryzant, Dongdong Chen, Yu Shi, Yichong Xu, Yao Qian, Mei Gao, Yi-Ling Chen, Liyang Lu, Yujia Xie, Robert Gmyr, Noel Codella, Naoyuki Kanda, Bin Xiao, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang

Figure 1 for i-Code: An Integrative and Composable Multimodal Learning Framework
Figure 2 for i-Code: An Integrative and Composable Multimodal Learning Framework
Figure 3 for i-Code: An Integrative and Composable Multimodal Learning Framework
Figure 4 for i-Code: An Integrative and Composable Multimodal Learning Framework
Viaarxiv icon

Residual Mixture of Experts

Add code
Bookmark button
Alert button
Apr 20, 2022
Lemeng Wu, Mengchen Liu, Yinpeng Chen, Dongdong Chen, Xiyang Dai, Lu Yuan

Figure 1 for Residual Mixture of Experts
Figure 2 for Residual Mixture of Experts
Figure 3 for Residual Mixture of Experts
Figure 4 for Residual Mixture of Experts
Viaarxiv icon

Protecting Celebrities from DeepFake with Identity Consistency Transformer

Add code
Bookmark button
Alert button
Apr 05, 2022
Xiaoyi Dong, Jianmin Bao, Dongdong Chen, Ting Zhang, Weiming Zhang, Nenghai Yu, Dong Chen, Fang Wen, Baining Guo

Figure 1 for Protecting Celebrities from DeepFake with Identity Consistency Transformer
Figure 2 for Protecting Celebrities from DeepFake with Identity Consistency Transformer
Figure 3 for Protecting Celebrities from DeepFake with Identity Consistency Transformer
Figure 4 for Protecting Celebrities from DeepFake with Identity Consistency Transformer
Viaarxiv icon