Alert button
Picture for Wanli Ouyang

Wanli Ouyang

Alert button

Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models

Add code
Bookmark button
Alert button
Dec 31, 2022
Wenhao Wu, Xiaohan Wang, Haipeng Luo, Jingdong Wang, Yi Yang, Wanli Ouyang

Figure 1 for Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
Figure 2 for Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
Figure 3 for Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
Figure 4 for Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
Viaarxiv icon

Ponder: Point Cloud Pre-training via Neural Rendering

Add code
Bookmark button
Alert button
Dec 31, 2022
Di Huang, Sida Peng, Tong He, Xiaowei Zhou, Wanli Ouyang

Figure 1 for Ponder: Point Cloud Pre-training via Neural Rendering
Figure 2 for Ponder: Point Cloud Pre-training via Neural Rendering
Figure 3 for Ponder: Point Cloud Pre-training via Neural Rendering
Figure 4 for Ponder: Point Cloud Pre-training via Neural Rendering
Viaarxiv icon

MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency

Add code
Bookmark button
Alert button
Dec 20, 2022
Mingye Xu, Mutian Xu, Tong He, Wanli Ouyang, Yali Wang, Xiaoguang Han, Yu Qiao

Figure 1 for MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency
Figure 2 for MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency
Figure 3 for MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency
Figure 4 for MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency
Viaarxiv icon

3D Point Cloud Pre-training with Knowledge Distillation from 2D Images

Add code
Bookmark button
Alert button
Dec 17, 2022
Yuan Yao, Yuanhan Zhang, Zhenfei Yin, Jiebo Luo, Wanli Ouyang, Xiaoshui Huang

Figure 1 for 3D Point Cloud Pre-training with Knowledge Distillation from 2D Images
Figure 2 for 3D Point Cloud Pre-training with Knowledge Distillation from 2D Images
Figure 3 for 3D Point Cloud Pre-training with Knowledge Distillation from 2D Images
Figure 4 for 3D Point Cloud Pre-training with Knowledge Distillation from 2D Images
Viaarxiv icon

Frozen CLIP Model is An Efficient Point Cloud Backbone

Add code
Bookmark button
Alert button
Dec 09, 2022
Xiaoshui Huang, Sheng Li, Wentao Qu, Tong He, Yifan Zuo, Wanli Ouyang

Figure 1 for Frozen CLIP Model is An Efficient Point Cloud Backbone
Figure 2 for Frozen CLIP Model is An Efficient Point Cloud Backbone
Figure 3 for Frozen CLIP Model is An Efficient Point Cloud Backbone
Figure 4 for Frozen CLIP Model is An Efficient Point Cloud Backbone
Viaarxiv icon

GD-MAE: Generative Decoder for MAE Pre-training on LiDAR Point Clouds

Add code
Bookmark button
Alert button
Dec 07, 2022
Honghui Yang, Tong He, Jiaheng Liu, Hua Chen, Boxi Wu, Binbin Lin, Xiaofei He, Wanli Ouyang

Figure 1 for GD-MAE: Generative Decoder for MAE Pre-training on LiDAR Point Clouds
Figure 2 for GD-MAE: Generative Decoder for MAE Pre-training on LiDAR Point Clouds
Figure 3 for GD-MAE: Generative Decoder for MAE Pre-training on LiDAR Point Clouds
Figure 4 for GD-MAE: Generative Decoder for MAE Pre-training on LiDAR Point Clouds
Viaarxiv icon

ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency

Add code
Bookmark button
Alert button
Dec 02, 2022
Chuming Li, Jie Liu, Yinmin Zhang, Yuhong Wei, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang

Figure 1 for ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency
Figure 2 for ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency
Figure 3 for ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency
Figure 4 for ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency
Viaarxiv icon

Reconstructing Hand-Held Objects from Monocular Video

Add code
Bookmark button
Alert button
Nov 30, 2022
Di Huang, Xiaopeng Ji, Xingyi He, Jiaming Sun, Tong He, Qing Shuai, Wanli Ouyang, Xiaowei Zhou

Figure 1 for Reconstructing Hand-Held Objects from Monocular Video
Figure 2 for Reconstructing Hand-Held Objects from Monocular Video
Figure 3 for Reconstructing Hand-Held Objects from Monocular Video
Figure 4 for Reconstructing Hand-Held Objects from Monocular Video
Viaarxiv icon

3D-QueryIS: A Query-based Framework for 3D Instance Segmentation

Add code
Bookmark button
Alert button
Nov 17, 2022
Jiaheng Liu, Tong He, Honghui Yang, Rui Su, Jiayi Tian, Junran Wu, Hongcheng Guo, Ke Xu, Wanli Ouyang

Viaarxiv icon

Boosting Semi-Supervised 3D Object Detection with Semi-Sampling

Add code
Bookmark button
Alert button
Nov 15, 2022
Xiaopei Wu, Yang Zhao, Liang Peng, Hua Chen, Xiaoshui Huang, Binbin Lin, Haifeng Liu, Deng Cai, Wanli Ouyang

Figure 1 for Boosting Semi-Supervised 3D Object Detection with Semi-Sampling
Figure 2 for Boosting Semi-Supervised 3D Object Detection with Semi-Sampling
Figure 3 for Boosting Semi-Supervised 3D Object Detection with Semi-Sampling
Figure 4 for Boosting Semi-Supervised 3D Object Detection with Semi-Sampling
Viaarxiv icon