Alert button
Picture for Jingdong Wang

Jingdong Wang

Alert button

Semi-DETR: Semi-Supervised Object Detection with Detection Transformers

Add code
Bookmark button
Alert button
Jul 16, 2023
Jiacheng Zhang, Xiangru Lin, Wei Zhang, Kuo Wang, Xiao Tan, Junyu Han, Errui Ding, Jingdong Wang, Guanbin Li

Figure 1 for Semi-DETR: Semi-Supervised Object Detection with Detection Transformers
Figure 2 for Semi-DETR: Semi-Supervised Object Detection with Detection Transformers
Figure 3 for Semi-DETR: Semi-Supervised Object Detection with Detection Transformers
Figure 4 for Semi-DETR: Semi-Supervised Object Detection with Detection Transformers
Viaarxiv icon

Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation

Add code
Bookmark button
Alert button
Jun 29, 2023
Zhongwei Qiu, Qiansheng Yang, Jian Wang, Xiyu Wang, Chang Xu, Dongmei Fu, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang

Figure 1 for Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation
Figure 2 for Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation
Figure 3 for Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation
Figure 4 for Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation
Viaarxiv icon

Vision Transformer with Attention Map Hallucination and FFN Compaction

Add code
Bookmark button
Alert button
Jun 19, 2023
Haiyang Xu, Zhichao Zhou, Dongliang He, Fu Li, Jingdong Wang

Viaarxiv icon

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images

Add code
Bookmark button
Alert button
Jun 05, 2023
Wenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen, Mingyu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang, Dimosthenis Karatzas, Xing Sun, Jingdong Wang, Xiang Bai

Figure 1 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Figure 2 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Figure 3 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Figure 4 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Viaarxiv icon

Rethinking the Open-Loop Evaluation of End-to-End Autonomous Driving in nuScenes

Add code
Bookmark button
Alert button
May 17, 2023
Jiang-Tian Zhai, Ze Feng, Jinhao Du, Yongqiang Mao, Jiang-Jiang Liu, Zichang Tan, Yifu Zhang, Xiaoqing Ye, Jingdong Wang

Figure 1 for Rethinking the Open-Loop Evaluation of End-to-End Autonomous Driving in nuScenes
Figure 2 for Rethinking the Open-Loop Evaluation of End-to-End Autonomous Driving in nuScenes
Figure 3 for Rethinking the Open-Loop Evaluation of End-to-End Autonomous Driving in nuScenes
Figure 4 for Rethinking the Open-Loop Evaluation of End-to-End Autonomous Driving in nuScenes
Viaarxiv icon

Multi-Modal 3D Object Detection by Box Matching

Add code
Bookmark button
Alert button
May 12, 2023
Zhe Liu, Xiaoqing Ye, Zhikang Zou, Xinwei He, Xiao Tan, Errui Ding, Jingdong Wang, Xiang Bai

Figure 1 for Multi-Modal 3D Object Detection by Box Matching
Figure 2 for Multi-Modal 3D Object Detection by Box Matching
Figure 3 for Multi-Modal 3D Object Detection by Box Matching
Figure 4 for Multi-Modal 3D Object Detection by Box Matching
Viaarxiv icon

StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator

Add code
Bookmark button
Alert button
May 09, 2023
Jiazhi Guan, Zhanwang Zhang, Hang Zhou, Tianshu Hu, Kaisiyuan Wang, Dongliang He, Haocheng Feng, Jingtuo Liu, Errui Ding, Ziwei Liu, Jingdong Wang

Figure 1 for StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator
Figure 2 for StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator
Figure 3 for StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator
Figure 4 for StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator
Viaarxiv icon

Exploring Effective Factors for Improving Visual In-Context Learning

Add code
Bookmark button
Alert button
Apr 10, 2023
Yanpeng Sun, Qiang Chen, Jian Wang, Jingdong Wang, Zechao Li

Figure 1 for Exploring Effective Factors for Improving Visual In-Context Learning
Figure 2 for Exploring Effective Factors for Improving Visual In-Context Learning
Figure 3 for Exploring Effective Factors for Improving Visual In-Context Learning
Figure 4 for Exploring Effective Factors for Improving Visual In-Context Learning
Viaarxiv icon

Task-Oriented Multi-Modal Mutual Leaning for Vision-Language Models

Add code
Bookmark button
Alert button
Mar 30, 2023
Sifan Long, Zhen Zhao, Junkun Yuan, Zichang Tan, Jiangjiang Liu, Luping Zhou, Shengsheng Wang, Jingdong Wang

Figure 1 for Task-Oriented Multi-Modal Mutual Leaning for Vision-Language Models
Figure 2 for Task-Oriented Multi-Modal Mutual Leaning for Vision-Language Models
Figure 3 for Task-Oriented Multi-Modal Mutual Leaning for Vision-Language Models
Figure 4 for Task-Oriented Multi-Modal Mutual Leaning for Vision-Language Models
Viaarxiv icon

ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box

Add code
Bookmark button
Alert button
Mar 27, 2023
Yifu Zhang, Xinggang Wang, Xiaoqing Ye, Wei Zhang, Jincheng Lu, Xiao Tan, Errui Ding, Peize Sun, Jingdong Wang

Figure 1 for ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box
Figure 2 for ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box
Figure 3 for ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box
Figure 4 for ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box
Viaarxiv icon