Alert button
Picture for Jingdong Wang

Jingdong Wang

Alert button

Beyond Attentive Tokens: Incorporating Token Importance and Diversity for Efficient Vision Transformers

Nov 21, 2022
Sifan Long, Zhen Zhao, Jimin Pi, Shengsheng Wang, Jingdong Wang

Figure 1 for Beyond Attentive Tokens: Incorporating Token Importance and Diversity for Efficient Vision Transformers
Figure 2 for Beyond Attentive Tokens: Incorporating Token Importance and Diversity for Efficient Vision Transformers
Figure 3 for Beyond Attentive Tokens: Incorporating Token Importance and Diversity for Efficient Vision Transformers
Figure 4 for Beyond Attentive Tokens: Incorporating Token Importance and Diversity for Efficient Vision Transformers
Viaarxiv icon

CAE v2: Context Autoencoder with CLIP Target

Nov 17, 2022
Xinyu Zhang, Jiahui Chen, Junkun Yuan, Qiang Chen, Jian Wang, Xiaodi Wang, Shumin Han, Xiaokang Chen, Jimin Pi, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang

Figure 1 for CAE v2: Context Autoencoder with CLIP Target
Figure 2 for CAE v2: Context Autoencoder with CLIP Target
Figure 3 for CAE v2: Context Autoencoder with CLIP Target
Figure 4 for CAE v2: Context Autoencoder with CLIP Target
Viaarxiv icon

Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining

Nov 07, 2022
Qiang Chen, Jian Wang, Chuchu Han, Shan Zhang, Zexian Li, Xiaokang Chen, Jiahui Chen, Xiaodi Wang, Shuming Han, Gang Zhang, Haocheng Feng, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang

Figure 1 for Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining
Figure 2 for Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining
Viaarxiv icon

RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer

Oct 13, 2022
Jian Wang, Chenhui Gou, Qiman Wu, Haocheng Feng, Junyu Han, Errui Ding, Jingdong Wang

Figure 1 for RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer
Figure 2 for RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer
Figure 3 for RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer
Figure 4 for RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer
Viaarxiv icon

It Takes Two: Masked Appearance-Motion Modeling for Self-supervised Video Transformer Pre-training

Oct 11, 2022
Yuxin Song, Min Yang, Wenhao Wu, Dongliang He, Fu Li, Jingdong Wang

Figure 1 for It Takes Two: Masked Appearance-Motion Modeling for Self-supervised Video Transformer Pre-training
Figure 2 for It Takes Two: Masked Appearance-Motion Modeling for Self-supervised Video Transformer Pre-training
Figure 3 for It Takes Two: Masked Appearance-Motion Modeling for Self-supervised Video Transformer Pre-training
Figure 4 for It Takes Two: Masked Appearance-Motion Modeling for Self-supervised Video Transformer Pre-training
Viaarxiv icon

StyleSwap: Style-Based Generator Empowers Robust Face Swapping

Sep 27, 2022
Zhiliang Xu, Hang Zhou, Zhibin Hong, Ziwei Liu, Jiaming Liu, Zhizhi Guo, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang

Figure 1 for StyleSwap: Style-Based Generator Empowers Robust Face Swapping
Figure 2 for StyleSwap: Style-Based Generator Empowers Robust Face Swapping
Figure 3 for StyleSwap: Style-Based Generator Empowers Robust Face Swapping
Figure 4 for StyleSwap: Style-Based Generator Empowers Robust Face Swapping
Viaarxiv icon

NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields

Sep 24, 2022
Jiankai Sun, Yan Xu, Mingyu Ding, Hongwei Yi, Jingdong Wang, Liangjun Zhang, Mac Schwager

Figure 1 for NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields
Figure 2 for NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields
Figure 3 for NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields
Figure 4 for NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields
Viaarxiv icon

TRUST: An Accurate and End-to-End Table structure Recognizer Using Splitting-based Transformers

Aug 31, 2022
Zengyuan Guo, Yuechen Yu, Pengyuan Lv, Chengquan Zhang, Haojie Li, Zhihui Wang, Kun Yao, Jingtuo Liu, Jingdong Wang

Figure 1 for TRUST: An Accurate and End-to-End Table structure Recognizer Using Splitting-based Transformers
Figure 2 for TRUST: An Accurate and End-to-End Table structure Recognizer Using Splitting-based Transformers
Figure 3 for TRUST: An Accurate and End-to-End Table structure Recognizer Using Splitting-based Transformers
Figure 4 for TRUST: An Accurate and End-to-End Table structure Recognizer Using Splitting-based Transformers
Viaarxiv icon

CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval

Aug 21, 2022
Haoran Wang, Dongliang He, Wenhao Wu, Boyang Xia, Min Yang, Fu Li, Yunlong Yu, Zhong Ji, Errui Ding, Jingdong Wang

Figure 1 for CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval
Figure 2 for CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval
Figure 3 for CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval
Figure 4 for CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval
Viaarxiv icon

Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment

Aug 02, 2022
Qiang Chen, Xiaokang Chen, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Gang Zeng, Jingdong Wang

Figure 1 for Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
Figure 2 for Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
Figure 3 for Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
Figure 4 for Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
Viaarxiv icon