3d Semantic Segmentation


3D Semantic Segmentation is a computer vision task that involves dividing a 3D point cloud or 3D mesh into semantically meaningful parts or regions. The goal of 3D semantic segmentation is to identify and label different objects and parts within a 3D scene, which can be used for applications such as robotics, autonomous driving, and augmented reality.

Technical Report for ICRA 2025 GOOSE 3D Semantic Segmentation Challenge: Adaptive Point Cloud Understanding for Heterogeneous Robotic Systems

Add code
Jun 08, 2025
Viaarxiv icon

LogoSP: Local-global Grouping of Superpoints for Unsupervised Semantic Segmentation of 3D Point Clouds

Add code
Jun 09, 2025
Viaarxiv icon

Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion

Add code
Jul 08, 2025
Figure 1 for Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion
Figure 2 for Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion
Figure 3 for Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion
Figure 4 for Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion
Viaarxiv icon

TSDASeg: A Two-Stage Model with Direct Alignment for Interactive Point Cloud Segmentation

Add code
Jun 26, 2025
Viaarxiv icon

A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects

Add code
Jun 24, 2025
Viaarxiv icon

SURPRISE3D: A Dataset for Spatial Understanding and Reasoning in Complex 3D Scenes

Add code
Jul 10, 2025
Viaarxiv icon

seg_3D_by_PC2D: Multi-View Projection for Domain Generalization and Adaptation in 3D Semantic Segmentation

Add code
May 21, 2025
Viaarxiv icon

LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion

Add code
Jul 03, 2025
Figure 1 for LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Figure 2 for LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Figure 3 for LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Figure 4 for LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Viaarxiv icon

GS4: Generalizable Sparse Splatting Semantic SLAM

Add code
Jun 06, 2025
Viaarxiv icon

Enhancing Human-Robot Collaboration: A Sim2Real Domain Adaptation Algorithm for Point Cloud Segmentation in Industrial Environments

Add code
Jun 11, 2025
Viaarxiv icon