Alert button
Picture for Jingdong Wang

Jingdong Wang

Alert button

Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection

Add code
Bookmark button
Alert button
Mar 27, 2023
Chang Liu, Weiming Zhang, Xiangru Lin, Wei Zhang, Xiao Tan, Junyu Han, Xiaomao Li, Errui Ding, Jingdong Wang

Figure 1 for Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection
Figure 2 for Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection
Figure 3 for Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection
Figure 4 for Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection
Viaarxiv icon

CAPE: Camera View Position Embedding for Multi-View 3D Object Detection

Add code
Bookmark button
Alert button
Mar 17, 2023
Kaixin Xiong, Shi Gong, Xiaoqing Ye, Xiao Tan, Ji Wan, Errui Ding, Jingdong Wang, Xiang Bai

Figure 1 for CAPE: Camera View Position Embedding for Multi-View 3D Object Detection
Figure 2 for CAPE: Camera View Position Embedding for Multi-View 3D Object Detection
Figure 3 for CAPE: Camera View Position Embedding for Multi-View 3D Object Detection
Figure 4 for CAPE: Camera View Position Embedding for Multi-View 3D Object Detection
Viaarxiv icon

PSVT: End-to-End Multi-person 3D Pose and Shape Estimation with Progressive Video Transformers

Add code
Bookmark button
Alert button
Mar 16, 2023
Zhongwei Qiu, Yang Qiansheng, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Chang Xu, Dongmei Fu, Jingdong Wang

Figure 1 for PSVT: End-to-End Multi-person 3D Pose and Shape Estimation with Progressive Video Transformers
Figure 2 for PSVT: End-to-End Multi-person 3D Pose and Shape Estimation with Progressive Video Transformers
Figure 3 for PSVT: End-to-End Multi-person 3D Pose and Shape Estimation with Progressive Video Transformers
Figure 4 for PSVT: End-to-End Multi-person 3D Pose and Shape Estimation with Progressive Video Transformers
Viaarxiv icon

Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement

Add code
Bookmark button
Alert button
Mar 03, 2023
Jiaxiang Tang, Hang Zhou, Xiaokang Chen, Tianshu Hu, Errui Ding, Jingdong Wang, Gang Zeng

Figure 1 for Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement
Figure 2 for Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement
Figure 3 for Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement
Figure 4 for Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement
Viaarxiv icon

StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training

Add code
Bookmark button
Alert button
Mar 01, 2023
Yuechen Yu, Yulin Li, Chengquan Zhang, Xiaoqiang Zhang, Zengyuan Guo, Xiameng Qin, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang

Figure 1 for StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training
Figure 2 for StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training
Figure 3 for StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training
Figure 4 for StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training
Viaarxiv icon

Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation

Add code
Bookmark button
Alert button
Feb 14, 2023
Yasheng Sun, Qianyi Wu, Hang Zhou, Kaisiyuan Wang, Tianshu Hu, Chen-Chieh Liao, Dongliang He, Jingtuo Liu, Errui Ding, Jingdong Wang, Shio Miyafuji, Ziwei Liu, Hideki Koike

Figure 1 for Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation
Figure 2 for Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation
Figure 3 for Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation
Figure 4 for Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation
Viaarxiv icon

Understanding Self-Supervised Pretraining with Part-Aware Representation Learning

Add code
Bookmark button
Alert button
Jan 27, 2023
Jie Zhu, Jiyang Qi, Mingyu Ding, Xiaokang Chen, Ping Luo, Xinggang Wang, Wenyu Liu, Leye Wang, Jingdong Wang

Figure 1 for Understanding Self-Supervised Pretraining with Part-Aware Representation Learning
Figure 2 for Understanding Self-Supervised Pretraining with Part-Aware Representation Learning
Figure 3 for Understanding Self-Supervised Pretraining with Part-Aware Representation Learning
Figure 4 for Understanding Self-Supervised Pretraining with Part-Aware Representation Learning
Viaarxiv icon

Graph Contrastive Learning for Skeleton-based Action Recognition

Add code
Bookmark button
Alert button
Jan 26, 2023
Xiaohu Huang, Hao Zhou, Bin Feng, Xinggang Wang, Wenyu Liu, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Jingdong Wang

Figure 1 for Graph Contrastive Learning for Skeleton-based Action Recognition
Figure 2 for Graph Contrastive Learning for Skeleton-based Action Recognition
Figure 3 for Graph Contrastive Learning for Skeleton-based Action Recognition
Figure 4 for Graph Contrastive Learning for Skeleton-based Action Recognition
Viaarxiv icon

Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?

Add code
Bookmark button
Alert button
Dec 31, 2022
Wenhao Wu, Haipeng Luo, Bo Fang, Jingdong Wang, Wanli Ouyang

Figure 1 for Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
Figure 2 for Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
Figure 3 for Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
Figure 4 for Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
Viaarxiv icon

Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models

Add code
Bookmark button
Alert button
Dec 31, 2022
Wenhao Wu, Xiaohan Wang, Haipeng Luo, Jingdong Wang, Yi Yang, Wanli Ouyang

Figure 1 for Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
Figure 2 for Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
Figure 3 for Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
Figure 4 for Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
Viaarxiv icon