Picture for Shaofei Huang

Shaofei Huang

DOMR: Establishing Cross-View Segmentation via Dense Object Matching

Add code
Aug 06, 2025
Viaarxiv icon

LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding

Add code
Jan 14, 2025
Viaarxiv icon

Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor Regression

Add code
Dec 22, 2024
Viaarxiv icon

FreeEdit: Mask-free Reference-based Image Editing with Multi-modal Instruction

Add code
Sep 26, 2024
Viaarxiv icon

Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation

Add code
Aug 28, 2024
Figure 1 for Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
Figure 2 for Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
Figure 3 for Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
Figure 4 for Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
Viaarxiv icon

Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation

Add code
Mar 09, 2024
Viaarxiv icon

Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation

Add code
Dec 12, 2023
Figure 1 for Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation
Figure 2 for Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation
Figure 3 for Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation
Figure 4 for Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation
Viaarxiv icon

Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training

Add code
Dec 04, 2023
Figure 1 for Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training
Figure 2 for Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training
Figure 3 for Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training
Figure 4 for Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training
Viaarxiv icon

Discovering Sounding Objects by Audio Queries for Audio Visual Segmentation

Add code
Sep 18, 2023
Figure 1 for Discovering Sounding Objects by Audio Queries for Audio Visual Segmentation
Figure 2 for Discovering Sounding Objects by Audio Queries for Audio Visual Segmentation
Figure 3 for Discovering Sounding Objects by Audio Queries for Audio Visual Segmentation
Figure 4 for Discovering Sounding Objects by Audio Queries for Audio Visual Segmentation
Viaarxiv icon

Anchor3DLane: Learning to Regress 3D Anchors for Monocular 3D Lane Detection

Add code
Jan 06, 2023
Viaarxiv icon