Picture for Guolei Sun

Guolei Sun

equal contribution

HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking

Add code
Jul 10, 2025
Figure 1 for HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking
Figure 2 for HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking
Figure 3 for HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking
Figure 4 for HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking
Viaarxiv icon

A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects

Add code
Jun 16, 2025
Viaarxiv icon

CamSAM2: Segment Anything Accurately in Camouflaged Videos

Add code
Mar 26, 2025
Viaarxiv icon

Exploiting Temporal State Space Sharing for Video Semantic Segmentation

Add code
Mar 26, 2025
Viaarxiv icon

Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model

Add code
Mar 20, 2025
Figure 1 for Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model
Figure 2 for Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model
Figure 3 for Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model
Figure 4 for Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model
Viaarxiv icon

SAM-Aware Graph Prompt Reasoning Network for Cross-Domain Few-Shot Segmentation

Add code
Dec 31, 2024
Figure 1 for SAM-Aware Graph Prompt Reasoning Network for Cross-Domain Few-Shot Segmentation
Figure 2 for SAM-Aware Graph Prompt Reasoning Network for Cross-Domain Few-Shot Segmentation
Figure 3 for SAM-Aware Graph Prompt Reasoning Network for Cross-Domain Few-Shot Segmentation
Figure 4 for SAM-Aware Graph Prompt Reasoning Network for Cross-Domain Few-Shot Segmentation
Viaarxiv icon

Towards Open-Vocabulary Video Semantic Segmentation

Add code
Dec 12, 2024
Figure 1 for Towards Open-Vocabulary Video Semantic Segmentation
Figure 2 for Towards Open-Vocabulary Video Semantic Segmentation
Figure 3 for Towards Open-Vocabulary Video Semantic Segmentation
Figure 4 for Towards Open-Vocabulary Video Semantic Segmentation
Viaarxiv icon

Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation

Add code
Oct 29, 2024
Figure 1 for Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation
Figure 2 for Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation
Figure 3 for Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation
Figure 4 for Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation
Viaarxiv icon

When SAM2 Meets Video Camouflaged Object Segmentation: A Comprehensive Evaluation and Adaptation

Add code
Sep 27, 2024
Figure 1 for When SAM2 Meets Video Camouflaged Object Segmentation: A Comprehensive Evaluation and Adaptation
Figure 2 for When SAM2 Meets Video Camouflaged Object Segmentation: A Comprehensive Evaluation and Adaptation
Figure 3 for When SAM2 Meets Video Camouflaged Object Segmentation: A Comprehensive Evaluation and Adaptation
Figure 4 for When SAM2 Meets Video Camouflaged Object Segmentation: A Comprehensive Evaluation and Adaptation
Viaarxiv icon

Towards a Generalist and Blind RGB-X Tracker

Add code
May 28, 2024
Viaarxiv icon