Alert button
Picture for Jingdong Wang

Jingdong Wang

Alert button

GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding

Add code
Bookmark button
Alert button
Nov 20, 2023
Hao Li, Dingwen Zhang, Yalun Dai, Nian Liu, Lechao Cheng, Jingfeng Li, Jingdong Wang, Junwei Han

Figure 1 for GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding
Figure 2 for GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding
Figure 3 for GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding
Figure 4 for GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding
Viaarxiv icon

Disentangled Representation Learning with Transmitted Information Bottleneck

Add code
Bookmark button
Alert button
Nov 03, 2023
Zhuohang Dang, Minnan Luo, Chengyou Jia, Guang Dai, Jihong Wang, Xiaojun Chang, Jingdong Wang, Qinghua Zheng

Figure 1 for Disentangled Representation Learning with Transmitted Information Bottleneck
Figure 2 for Disentangled Representation Learning with Transmitted Information Bottleneck
Figure 3 for Disentangled Representation Learning with Transmitted Information Bottleneck
Figure 4 for Disentangled Representation Learning with Transmitted Information Bottleneck
Viaarxiv icon

HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception

Add code
Bookmark button
Alert button
Oct 31, 2023
Junkun Yuan, Xinyu Zhang, Hao Zhou, Jian Wang, Zhongwei Qiu, Zhiyin Shao, Shaofeng Zhang, Sifan Long, Kun Kuang, Kun Yao, Junyu Han, Errui Ding, Lanfen Lin, Fei Wu, Jingdong Wang

Figure 1 for HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
Figure 2 for HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
Figure 3 for HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
Figure 4 for HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
Viaarxiv icon

Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection

Add code
Bookmark button
Alert button
Oct 24, 2023
Linyan Huang, Zhiqi Li, Chonghao Sima, Wenhai Wang, Jingdong Wang, Yu Qiao, Hongyang Li

Figure 1 for Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection
Figure 2 for Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection
Figure 3 for Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection
Figure 4 for Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection
Viaarxiv icon

Accelerating Vision Transformers Based on Heterogeneous Attention Patterns

Add code
Bookmark button
Alert button
Oct 11, 2023
Deli Yu, Teng Xi, Jianwei Li, Baopu Li, Gang Zhang, Haocheng Feng, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang

Figure 1 for Accelerating Vision Transformers Based on Heterogeneous Attention Patterns
Figure 2 for Accelerating Vision Transformers Based on Heterogeneous Attention Patterns
Figure 3 for Accelerating Vision Transformers Based on Heterogeneous Attention Patterns
Figure 4 for Accelerating Vision Transformers Based on Heterogeneous Attention Patterns
Viaarxiv icon

Forward Flow for Novel View Synthesis of Dynamic Scenes

Add code
Bookmark button
Alert button
Sep 29, 2023
Xiang Guo, Jiadai Sun, Yuchao Dai, Guanying Chen, Xiaoqing Ye, Xiao Tan, Errui Ding, Yumeng Zhang, Jingdong Wang

Figure 1 for Forward Flow for Novel View Synthesis of Dynamic Scenes
Figure 2 for Forward Flow for Novel View Synthesis of Dynamic Scenes
Figure 3 for Forward Flow for Novel View Synthesis of Dynamic Scenes
Figure 4 for Forward Flow for Novel View Synthesis of Dynamic Scenes
Viaarxiv icon

GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction

Add code
Bookmark button
Alert button
Sep 26, 2023
Pengyuan Lyu, Weihong Ma, Hongyi Wang, Yuechen Yu, Chengquan Zhang, Kun Yao, Yang Xue, Jingdong Wang

Figure 1 for GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction
Figure 2 for GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction
Figure 3 for GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction
Figure 4 for GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction
Viaarxiv icon

PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement

Add code
Bookmark button
Alert button
Sep 20, 2023
Chengyou Jia, Minnan Luo, Zhuohang Dang, Guang Dai, Xiaojun Chang, Jingdong Wang, Qinghua Zheng

Figure 1 for PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
Figure 2 for PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
Figure 3 for PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
Figure 4 for PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
Viaarxiv icon

Unified Frequency-Assisted Transformer Framework for Detecting and Grounding Multi-Modal Manipulation

Add code
Bookmark button
Alert button
Sep 18, 2023
Huan Liu, Zichang Tan, Qiang Chen, Yunchao Wei, Yao Zhao, Jingdong Wang

Figure 1 for Unified Frequency-Assisted Transformer Framework for Detecting and Grounding Multi-Modal Manipulation
Figure 2 for Unified Frequency-Assisted Transformer Framework for Detecting and Grounding Multi-Modal Manipulation
Figure 3 for Unified Frequency-Assisted Transformer Framework for Detecting and Grounding Multi-Modal Manipulation
Figure 4 for Unified Frequency-Assisted Transformer Framework for Detecting and Grounding Multi-Modal Manipulation
Viaarxiv icon

VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation

Add code
Bookmark button
Alert button
Sep 07, 2023
Xin Li, Wenqing Chu, Ye Wu, Weihang Yuan, Fanglong Liu, Qi Zhang, Fu Li, Haocheng Feng, Errui Ding, Jingdong Wang

Figure 1 for VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation
Figure 2 for VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation
Figure 3 for VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation
Figure 4 for VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation
Viaarxiv icon