Alert button
Picture for Xizhou Zhu

Xizhou Zhu

Alert button

BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision

Add code
Bookmark button
Alert button
Nov 18, 2022
Chenyu Yang, Yuntao Chen, Hao Tian, Chenxin Tao, Xizhou Zhu, Zhaoxiang Zhang, Gao Huang, Hongyang Li, Yu Qiao, Lewei Lu, Jie Zhou, Jifeng Dai

Figure 1 for BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision
Figure 2 for BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision
Figure 3 for BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision
Figure 4 for BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision
Viaarxiv icon

Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks

Add code
Bookmark button
Alert button
Nov 17, 2022
Hao Li, Jinguo Zhu, Xiaohu Jiang, Xizhou Zhu, Hongsheng Li, Chun Yuan, Xiaohua Wang, Yu Qiao, Xiaogang Wang, Wenhai Wang, Jifeng Dai

Figure 1 for Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
Figure 2 for Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
Figure 3 for Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
Figure 4 for Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
Viaarxiv icon

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Add code
Bookmark button
Alert button
Nov 13, 2022
Wenhai Wang, Jifeng Dai, Zhe Chen, Zhenhang Huang, Zhiqi Li, Xizhou Zhu, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao

Figure 1 for InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Figure 2 for InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Figure 3 for InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Figure 4 for InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Viaarxiv icon

Demystify Transformers & Convolutions in Modern Image Deep Networks

Add code
Bookmark button
Alert button
Nov 10, 2022
Jifeng Dai, Min Shi, Weiyun Wang, Sitong Wu, Linjie Xing, Wenhai Wang, Xizhou Zhu, Lewei Lu, Jie Zhou, Xiaogang Wang, Yu Qiao, Xiaowei Hu

Figure 1 for Demystify Transformers & Convolutions in Modern Image Deep Networks
Figure 2 for Demystify Transformers & Convolutions in Modern Image Deep Networks
Figure 3 for Demystify Transformers & Convolutions in Modern Image Deep Networks
Figure 4 for Demystify Transformers & Convolutions in Modern Image Deep Networks
Viaarxiv icon

Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe

Add code
Bookmark button
Alert button
Sep 12, 2022
Hongyang Li, Chonghao Sima, Jifeng Dai, Wenhai Wang, Lewei Lu, Huijie Wang, Enze Xie, Zhiqi Li, Hanming Deng, Hao Tian, Xizhou Zhu, Li Chen, Yulu Gao, Xiangwei Geng, Jia Zeng, Yang Li, Jiazhi Yang, Xiaosong Jia, Bohan Yu, Yu Qiao, Dahua Lin, Si Liu, Junchi Yan, Jianping Shi, Ping Luo

Figure 1 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Figure 2 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Figure 3 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Figure 4 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Viaarxiv icon

Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs

Add code
Bookmark button
Alert button
Jun 09, 2022
Jinguo Zhu, Xizhou Zhu, Wenhai Wang, Xiaohua Wang, Hongsheng Li, Xiaogang Wang, Jifeng Dai

Figure 1 for Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs
Figure 2 for Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs
Figure 3 for Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs
Figure 4 for Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs
Viaarxiv icon

Siamese Image Modeling for Self-Supervised Vision Representation Learning

Add code
Bookmark button
Alert button
Jun 02, 2022
Chenxin Tao, Xizhou Zhu, Gao Huang, Yu Qiao, Xiaogang Wang, Jifeng Dai

Figure 1 for Siamese Image Modeling for Self-Supervised Vision Representation Learning
Figure 2 for Siamese Image Modeling for Self-Supervised Vision Representation Learning
Figure 3 for Siamese Image Modeling for Self-Supervised Vision Representation Learning
Figure 4 for Siamese Image Modeling for Self-Supervised Vision Representation Learning
Viaarxiv icon

DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation

Add code
Bookmark button
Alert button
Mar 16, 2022
Ailing Zeng, Xuan Ju, Lei Yang, Ruiyuan Gao, Xizhou Zhu, Bo Dai, Qiang Xu

Figure 1 for DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation
Figure 2 for DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation
Figure 3 for DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation
Figure 4 for DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation
Viaarxiv icon

Exploring the Equivalence of Siamese Self-Supervised Learning via A Unified Gradient Framework

Add code
Bookmark button
Alert button
Dec 09, 2021
Chenxin Tao, Honghui Wang, Xizhou Zhu, Jiahua Dong, Shiji Song, Gao Huang, Jifeng Dai

Figure 1 for Exploring the Equivalence of Siamese Self-Supervised Learning via A Unified Gradient Framework
Figure 2 for Exploring the Equivalence of Siamese Self-Supervised Learning via A Unified Gradient Framework
Figure 3 for Exploring the Equivalence of Siamese Self-Supervised Learning via A Unified Gradient Framework
Figure 4 for Exploring the Equivalence of Siamese Self-Supervised Learning via A Unified Gradient Framework
Viaarxiv icon