Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Topic:Point Cloud Registration

SGOR: Outlier Removal by Leveraging Semantic and Geometric Information for Robust Point Cloud Registration

Jul 08, 2024

Guiyu Zhao, Zhentao Guo, Hongbin Ma

Abstract:In this paper, we introduce a new outlier removal method that fully leverages geometric and semantic information, to achieve robust registration. Current semantic-based registration methods only use semantics for point-to-point or instance semantic correspondence generation, which has two problems. First, these methods are highly dependent on the correctness of semantics. They perform poorly in scenarios with incorrect semantics and sparse semantics. Second, the use of semantics is limited only to the correspondence generation, resulting in bad performance in the weak geometry scene. To solve these problems, on the one hand, we propose secondary ground segmentation and loose semantic consistency based on regional voting. It improves the robustness to semantic correctness by reducing the dependence on single-point semantics. On the other hand, we propose semantic-geometric consistency for outlier removal, which makes full use of semantic information and significantly improves the quality of correspondences. In addition, a two-stage hypothesis verification is proposed, which solves the problem of incorrect transformation selection in the weak geometry scene. In the outdoor dataset, our method demonstrates superior performance, boosting a 22.5 percentage points improvement in registration recall and achieving better robustness under various conditions. Our code is available.

* Accepted by IROS 2024

Via

Access Paper or Ask Questions

Deep Learning-based Point Cloud Registration for Augmented Reality-guided Surgery

May 06, 2024

Maximilian Weber, Daniel Wild, Jens Kleesiek, Jan Egger, Christina Gsaxner

Abstract:Point cloud registration aligns 3D point clouds using spatial transformations. It is an important task in computer vision, with applications in areas such as augmented reality (AR) and medical imaging. This work explores the intersection of two research trends: the integration of AR into image-guided surgery and the use of deep learning for point cloud registration. The main objective is to evaluate the feasibility of applying deep learning-based point cloud registration methods for image-to-patient registration in augmented reality-guided surgery. We created a dataset of point clouds from medical imaging and corresponding point clouds captured with a popular AR device, the HoloLens 2. We evaluate three well-established deep learning models in registering these data pairs. While we find that some deep learning methods show promise, we show that a conventional registration pipeline still outperforms them on our challenging dataset.

* 5 pages, 4 figures; accepted at IEEE ISBI 2024

Via

Access Paper or Ask Questions

RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration

May 13, 2024

Congjia Chen, Xiaoyu Jia, Yanhong Zheng, Yufu Qu

Figure 1 for RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration

Figure 2 for RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration

Figure 3 for RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration

Figure 4 for RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration

Abstract:Point cloud registration is a fundamental task for estimating rigid transformations between point clouds. Previous studies have used geometric information for extracting features, matching and estimating transformation. Recently, owing to the advancement of RGB-D sensors, researchers have attempted to utilize visual information to improve registration performance. However, these studies focused on extracting distinctive features by deep feature fusion, which cannot effectively solve the negative effects of each feature's weakness, and cannot sufficiently leverage the valid information. In this paper, we propose a new feature combination framework, which applies a looser but more effective fusion and can achieve better performance. An explicit filter based on transformation consistency is designed for the combination framework, which can overcome each feature's weakness. And an adaptive threshold determined by the error distribution is proposed to extract more valid information from the two types of features. Owing to the distinctive design, our proposed framework can estimate more accurate correspondences and is applicable to both hand-crafted and learning-based feature descriptors. Experiments on ScanNet show that our method achieves a state-of-the-art performance and the rotation accuracy of 99.1%.

Via

Access Paper or Ask Questions

GaussReg: Fast 3D Registration with Gaussian Splatting

Jul 07, 2024

Jiahao Chang, Yinglin Xu, Yihao Li, Yuantao Chen, Xiaoguang Han

Figure 1 for GaussReg: Fast 3D Registration with Gaussian Splatting

Figure 2 for GaussReg: Fast 3D Registration with Gaussian Splatting

Figure 3 for GaussReg: Fast 3D Registration with Gaussian Splatting

Figure 4 for GaussReg: Fast 3D Registration with Gaussian Splatting

Abstract:Point cloud registration is a fundamental problem for large-scale 3D scene scanning and reconstruction. With the help of deep learning, registration methods have evolved significantly, reaching a nearly-mature stage. As the introduction of Neural Radiance Fields (NeRF), it has become the most popular 3D scene representation as its powerful view synthesis capabilities. Regarding NeRF representation, its registration is also required for large-scale scene reconstruction. However, this topic extremly lacks exploration. This is due to the inherent challenge to model the geometric relationship among two scenes with implicit representations. The existing methods usually convert the implicit representation to explicit representation for further registration. Most recently, Gaussian Splatting (GS) is introduced, employing explicit 3D Gaussian. This method significantly enhances rendering speed while maintaining high rendering quality. Given two scenes with explicit GS representations, in this work, we explore the 3D registration task between them. To this end, we propose GaussReg, a novel coarse-to-fine framework, both fast and accurate. The coarse stage follows existing point cloud registration methods and estimates a rough alignment for point clouds from GS. We further newly present an image-guided fine registration approach, which renders images from GS to provide more detailed geometric information for precise alignment. To support comprehensive evaluation, we carefully build a scene-level dataset called ScanNet-GSReg with 1379 scenes obtained from the ScanNet dataset and collect an in-the-wild dataset called GSReg. Experimental results demonstrate our method achieves state-of-the-art performance on multiple datasets. Our GaussReg is 44 times faster than HLoc (SuperPoint as the feature extractor and SuperGlue as the matcher) with comparable accuracy.

* ECCV 2024

Via

Access Paper or Ask Questions

GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields

Jul 08, 2024

Weiyi Xue, Zehan Zheng, Fan Lu, Haiyun Wei, Guang Chen, Changjun Jiang

Abstract:Although recent efforts have extended Neural Radiance Fields (NeRF) into LiDAR point cloud synthesis, the majority of existing works exhibit a strong dependence on precomputed poses. However, point cloud registration methods struggle to achieve precise global pose estimation, whereas previous pose-free NeRFs overlook geometric consistency in global reconstruction. In light of this, we explore the geometric insights of point clouds, which provide explicit registration priors for reconstruction. Based on this, we propose Geometry guided Neural LiDAR Fields(GeoNLF), a hybrid framework performing alternately global neural reconstruction and pure geometric pose optimization. Furthermore, NeRFs tend to overfit individual frames and easily get stuck in local minima under sparse-view inputs. To tackle this issue, we develop a selective-reweighting strategy and introduce geometric constraints for robust optimization. Extensive experiments on NuScenes and KITTI-360 datasets demonstrate the superiority of GeoNLF in both novel view synthesis and multi-view registration of low-frequency large-scale point clouds.

Via

Access Paper or Ask Questions

Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration

May 25, 2024

Junjie Gao, Chongjian Wang, Zhongjun Ding, Shuangmin Chen, Shiqing Xin, Changhe Tu, Wenping Wang

Figure 1 for Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration

Figure 2 for Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration

Figure 3 for Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration

Figure 4 for Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration

Abstract:In the realm of point cloud registration, the most prevalent pose evaluation approaches are statistics-based, identifying the optimal transformation by maximizing the number of consistent correspondences. However, registration recall decreases significantly when point clouds exhibit a low overlap rate, despite efforts in designing feature descriptors and establishing correspondences. In this paper, we introduce Deep-PE, a lightweight, learning-based pose evaluator designed to enhance the accuracy of pose selection, especially in challenging point cloud scenarios with low overlap. Our network incorporates a Pose-Aware Attention (PAA) module to simulate and learn the alignment status of point clouds under various candidate poses, alongside a Pose Confidence Prediction (PCP) module that predicts the likelihood of successful registration. These two modules facilitate the learning of both local and global alignment priors. Extensive tests across multiple benchmarks confirm the effectiveness of Deep-PE. Notably, on 3DLoMatch with a low overlap rate, Deep-PE significantly outperforms state-of-the-art methods by at least 8% and 11% in registration recall under handcrafted FPFH and learning-based FCGF descriptors, respectively. To the best of our knowledge, this is the first study to utilize deep learning to select the optimal pose without the explicit need for input correspondences.

* 22 pages, 16 figures

Via

Access Paper or Ask Questions

GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation

Jul 19, 2024

Bangyan Liao, Zhenjun Zhao, Lu Chen, Haoang Li, Daniel Cremers, Peidong Liu

Figure 1 for GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation

Figure 2 for GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation

Figure 3 for GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation

Figure 4 for GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation

Abstract:Plane adjustment (PA) is crucial for many 3D applications, involving simultaneous pose estimation and plane recovery. Despite recent advancements, it remains a challenging problem in the realm of multi-view point cloud registration. Current state-of-the-art methods can achieve globally optimal convergence only with good initialization. Furthermore, their high time complexity renders them impractical for large-scale problems. To address these challenges, we first exploit a novel optimization strategy termed \textit{Bi-Convex Relaxation}, which decouples the original problem into two simpler sub-problems, reformulates each sub-problem using a convex relaxation technique, and alternately solves each one until the original problem converges. Building on this strategy, we propose two algorithmic variants for solving the plane adjustment problem, namely \textit{GlobalPointer} and \textit{GlobalPointer++}, based on point-to-plane and plane-to-plane errors, respectively. Extensive experiments on both synthetic and real datasets demonstrate that our method can perform large-scale plane adjustment with linear time complexity, larger convergence region, and robustness to poor initialization, while achieving similar accuracy as prior methods. The code is available at https://github.com/wu-cvgl/GlobalPointer.

* Accepted to ECCV 2024. The first two authors contributed equally to this work. Code: https://github.com/wu-cvgl/GlobalPointer

Via

Access Paper or Ask Questions

Benchmarking Classical and Learning-Based Multibeam Point Cloud Registration

May 10, 2024

Li Ling, Jun Zhang, Nils Bore, John Folkesson, Anna Wåhlin

Figure 1 for Benchmarking Classical and Learning-Based Multibeam Point Cloud Registration

Figure 2 for Benchmarking Classical and Learning-Based Multibeam Point Cloud Registration

Figure 3 for Benchmarking Classical and Learning-Based Multibeam Point Cloud Registration

Figure 4 for Benchmarking Classical and Learning-Based Multibeam Point Cloud Registration

Abstract:Deep learning has shown promising results for multiple 3D point cloud registration datasets. However, in the underwater domain, most registration of multibeam echo-sounder (MBES) point cloud data are still performed using classical methods in the iterative closest point (ICP) family. In this work, we curate and release DotsonEast Dataset, a semi-synthetic MBES registration dataset constructed from an autonomous underwater vehicle in West Antarctica. Using this dataset, we systematically benchmark the performance of 2 classical and 4 learning-based methods. The experimental results show that the learning-based methods work well for coarse alignment, and are better at recovering rough transforms consistently at high overlap (20-50%). In comparison, GICP (a variant of ICP) performs well for fine alignment and is better across all metrics at extremely low overlap (10%). To the best of our knowledge, this is the first work to benchmark both learning-based and classical registration methods on an AUV-based MBES dataset. To facilitate future research, both the code and data are made available online.

* Accepted at ICRA 2024 (IEEE International Conference on Robotics and Automation 2024)

Via

Access Paper or Ask Questions

SRIF: Semantic Shape Registration Empowered by Diffusion-based Image Morphing and Flow Estimation

Sep 18, 2024

Mingze Sun, Chen Guo, Puhua Jiang, Shiwei Mao, Yurun Chen, Ruqi Huang

Figure 1 for SRIF: Semantic Shape Registration Empowered by Diffusion-based Image Morphing and Flow Estimation

Figure 2 for SRIF: Semantic Shape Registration Empowered by Diffusion-based Image Morphing and Flow Estimation

Figure 3 for SRIF: Semantic Shape Registration Empowered by Diffusion-based Image Morphing and Flow Estimation

Figure 4 for SRIF: Semantic Shape Registration Empowered by Diffusion-based Image Morphing and Flow Estimation

Abstract:In this paper, we propose SRIF, a novel Semantic shape Registration framework based on diffusion-based Image morphing and Flow estimation. More concretely, given a pair of extrinsically aligned shapes, we first render them from multi-views, and then utilize an image interpolation framework based on diffusion models to generate sequences of intermediate images between them. The images are later fed into a dynamic 3D Gaussian splatting framework, with which we reconstruct and post-process for intermediate point clouds respecting the image morphing processing. In the end, tailored for the above, we propose a novel registration module to estimate continuous normalizing flow, which deforms source shape consistently towards the target, with intermediate point clouds as weak guidance. Our key insight is to leverage large vision models (LVMs) to associate shapes and therefore obtain much richer semantic information on the relationship between shapes than the ad-hoc feature extraction and alignment. As a consequence, SRIF achieves high-quality dense correspondences on challenging shape pairs, but also delivers smooth, semantically meaningful interpolation in between. Empirical evidence justifies the effectiveness and superiority of our method as well as specific design choices. The code is released at https://github.com/rqhuang88/SRIF.

Via

Access Paper or Ask Questions

CS-Net:Contribution-based Sampling Network for Point Cloud Simplification

Jan 18, 2025

Tian Guo, Chen Chen, Hui Yuan, Xiaolong Mao, Raouf Hamzaoui, Junhui Hou

Figure 1 for CS-Net:Contribution-based Sampling Network for Point Cloud Simplification

Figure 2 for CS-Net:Contribution-based Sampling Network for Point Cloud Simplification

Figure 3 for CS-Net:Contribution-based Sampling Network for Point Cloud Simplification

Figure 4 for CS-Net:Contribution-based Sampling Network for Point Cloud Simplification

Abstract:Point cloud sampling plays a crucial role in reducing computation costs and storage requirements for various vision tasks. Traditional sampling methods, such as farthest point sampling, lack task-specific information and, as a result, cannot guarantee optimal performance in specific applications. Learning-based methods train a network to sample the point cloud for the targeted downstream task. However, they do not guarantee that the sampled points are the most relevant ones. Moreover, they may result in duplicate sampled points, which requires completion of the sampled point cloud through post-processing techniques. To address these limitations, we propose a contribution-based sampling network (CS-Net), where the sampling operation is formulated as a Top-k operation. To ensure that the network can be trained in an end-to-end way using gradient descent algorithms, we use a differentiable approximation to the Top-k operation via entropy regularization of an optimal transport problem. Our network consists of a feature embedding module, a cascade attention module, and a contribution scoring module. The feature embedding module includes a specifically designed spatial pooling layer to reduce parameters while preserving important features. The cascade attention module combines the outputs of three skip connected offset attention layers to emphasize the attractive features and suppress less important ones. The contribution scoring module generates a contribution score for each point and guides the sampling process to prioritize the most important ones. Experiments on the ModelNet40 and PU147 showed that CS-Net achieved state-of-the-art performance in two semantic-based downstream tasks (classification and registration) and two reconstruction-based tasks (compression and surface reconstruction).

Via

Access Paper or Ask Questions

Topic:Point Cloud Registration

Papers and Code