Picture for Rynson W. H. Lau

Rynson W. H. Lau

Hierarchical Cross-Modal Alignment for Open-Vocabulary 3D Object Detection

Add code
Mar 10, 2025
Viaarxiv icon

Do Multimodal Large Language Models See Like Humans?

Add code
Dec 12, 2024
Figure 1 for Do Multimodal Large Language Models See Like Humans?
Figure 2 for Do Multimodal Large Language Models See Like Humans?
Figure 3 for Do Multimodal Large Language Models See Like Humans?
Figure 4 for Do Multimodal Large Language Models See Like Humans?
Viaarxiv icon

Revisiting the Integration of Convolution and Attention for Vision Backbone

Add code
Nov 21, 2024
Viaarxiv icon

LuSh-NeRF: Lighting up and Sharpening NeRFs for Low-light Scenes

Add code
Nov 11, 2024
Figure 1 for LuSh-NeRF: Lighting up and Sharpening NeRFs for Low-light Scenes
Figure 2 for LuSh-NeRF: Lighting up and Sharpening NeRFs for Low-light Scenes
Figure 3 for LuSh-NeRF: Lighting up and Sharpening NeRFs for Low-light Scenes
Figure 4 for LuSh-NeRF: Lighting up and Sharpening NeRFs for Low-light Scenes
Viaarxiv icon

Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension

Add code
Oct 02, 2024
Figure 1 for Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension
Figure 2 for Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension
Figure 3 for Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension
Figure 4 for Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension
Viaarxiv icon

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

Add code
Sep 17, 2024
Viaarxiv icon

OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding

Add code
Aug 20, 2024
Figure 1 for OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Figure 2 for OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Figure 3 for OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Figure 4 for OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Viaarxiv icon

MDeRainNet: An Efficient Neural Network for Rain Streak Removal from Macro-pixel Images

Add code
Jun 15, 2024
Figure 1 for MDeRainNet: An Efficient Neural Network for Rain Streak Removal from Macro-pixel Images
Figure 2 for MDeRainNet: An Efficient Neural Network for Rain Streak Removal from Macro-pixel Images
Figure 3 for MDeRainNet: An Efficient Neural Network for Rain Streak Removal from Macro-pixel Images
Figure 4 for MDeRainNet: An Efficient Neural Network for Rain Streak Removal from Macro-pixel Images
Viaarxiv icon

DreamPhysics: Learning Physical Properties of Dynamic 3D Gaussians with Video Diffusion Priors

Add code
Jun 03, 2024
Viaarxiv icon

Color Shift Estimation-and-Correction for Image Enhancement

Add code
May 29, 2024
Figure 1 for Color Shift Estimation-and-Correction for Image Enhancement
Figure 2 for Color Shift Estimation-and-Correction for Image Enhancement
Figure 3 for Color Shift Estimation-and-Correction for Image Enhancement
Figure 4 for Color Shift Estimation-and-Correction for Image Enhancement
Viaarxiv icon