Picture for Wanli Ouyang

Wanli Ouyang

School of Electrical and Information Engineering, The University of Sydney, Australia

Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models

Add code
Dec 31, 2022
Viaarxiv icon

Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?

Add code
Dec 31, 2022
Viaarxiv icon

Ponder: Point Cloud Pre-training via Neural Rendering

Add code
Dec 31, 2022
Viaarxiv icon

MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency

Add code
Dec 20, 2022
Viaarxiv icon

3D Point Cloud Pre-training with Knowledge Distillation from 2D Images

Add code
Dec 17, 2022
Viaarxiv icon

Frozen CLIP Model is An Efficient Point Cloud Backbone

Add code
Dec 09, 2022
Viaarxiv icon

GD-MAE: Generative Decoder for MAE Pre-training on LiDAR Point Clouds

Add code
Dec 07, 2022
Viaarxiv icon

ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency

Add code
Dec 02, 2022
Figure 1 for ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency
Figure 2 for ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency
Figure 3 for ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency
Figure 4 for ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency
Viaarxiv icon

Reconstructing Hand-Held Objects from Monocular Video

Add code
Nov 30, 2022
Viaarxiv icon

3D-QueryIS: A Query-based Framework for 3D Instance Segmentation

Add code
Nov 17, 2022
Viaarxiv icon