Picture for Shaofei Huang

Shaofei Huang

From Incomplete Architecture to Quantified Risk: Multimodal LLM-Driven Security Assessment for Cyber-Physical Systems

Add code
Apr 07, 2026
Viaarxiv icon

Pretrain-then-Adapt: Uncertainty-Aware Test-Time Adaptation for Text-based Person Search

Add code
Apr 07, 2026
Viaarxiv icon

From Instruction to Event: Sound-Triggered Mobile Manipulation

Add code
Jan 29, 2026
Viaarxiv icon

DOMR: Establishing Cross-View Segmentation via Dense Object Matching

Add code
Aug 06, 2025
Viaarxiv icon

LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding

Add code
Jan 14, 2025
Viaarxiv icon

Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor Regression

Add code
Dec 22, 2024
Viaarxiv icon

FreeEdit: Mask-free Reference-based Image Editing with Multi-modal Instruction

Add code
Sep 26, 2024
Viaarxiv icon

Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation

Add code
Aug 28, 2024
Figure 1 for Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
Figure 2 for Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
Figure 3 for Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
Figure 4 for Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
Viaarxiv icon

Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation

Add code
Mar 09, 2024
Viaarxiv icon

Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation

Add code
Dec 12, 2023
Figure 1 for Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation
Figure 2 for Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation
Figure 3 for Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation
Figure 4 for Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation
Viaarxiv icon