Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yongtian Wang

Robust Point Cloud Registration via Geometric Overlapping Guided Rotation Search

Aug 24, 2025

Zhao Zheng, Jingfan Fan, Long Shao, Hong Song, Danni Ai, Tianyu Fu, Deqiang Xiao, Yongtian Wang, Jian Yang

Abstract:Point cloud registration based on correspondences computes the rigid transformation that maximizes the number of inliers constrained within the noise threshold. Current state-of-the-art (SOTA) methods employing spatial compatibility graphs or branch-and-bound (BnB) search mainly focus on registration under high outlier ratios. However, graph-based methods require at least quadratic space and time complexity for graph construction, while multi-stage BnB search methods often suffer from inaccuracy due to local optima between decomposed stages. This paper proposes a geometric maximum overlapping registration framework via rotation-only BnB search. The rigid transformation is decomposed using Chasles' theorem into a translation along rotation axis and a 2D rigid transformation. The optimal rotation axis and angle are searched via BnB, with residual parameters formulated as range maximum query (RMQ) problems. Firstly, the top-k candidate rotation axes are searched within a hemisphere parameterized by cube mapping, and the translation along each axis is estimated through interval stabbing of the correspondences projected onto that axis. Secondly, the 2D registration is relaxed to 1D rotation angle search with 2D RMQ of geometric overlapping for axis-aligned rectangles, which is solved deterministically in polynomial time using sweep line algorithm with segment tree. Experimental results on 3DMatch, 3DLoMatch, and KITTI datasets demonstrate superior accuracy and efficiency over SOTA methods, while the time complexity is polynomial and the space complexity increases linearly with the number of points, even in the worst case.

Via

Access Paper or Ask Questions

Combining deep learning and crowdsourcing geo-images to predict housing quality in rural China

Aug 15, 2022

Weipan Xu, Yu Gu, Yifan Chen, Yongtian Wang, Weihuan Deng, Xun Li

Figure 1 for Combining deep learning and crowdsourcing geo-images to predict housing quality in rural China

Figure 2 for Combining deep learning and crowdsourcing geo-images to predict housing quality in rural China

Figure 3 for Combining deep learning and crowdsourcing geo-images to predict housing quality in rural China

Figure 4 for Combining deep learning and crowdsourcing geo-images to predict housing quality in rural China

Abstract:Housing quality is an essential proxy for regional wealth, security and health. Understanding the distribution of housing quality is crucial for unveiling rural development status and providing political proposals. However,present rural house quality data highly depends on a top-down, time-consuming survey at the national or provincial level but fails to unpack the housing quality at the village level. To fill the gap between accurately depicting rural housing quality conditions and deficient data,we collect massive rural images and invite users to assess their housing quality at scale. Furthermore, a deep learning framework is proposed to automatically and efficiently predict housing quality based on crowd-sourcing rural images.

Via

Access Paper or Ask Questions

Deep Surface Normal Estimation with Hierarchical RGB-D Fusion

Apr 06, 2019

Jin Zeng, Yanfeng Tong, Yunmu Huang, Qiong Yan, Wenxiu Sun, Jing Chen, Yongtian Wang

Figure 1 for Deep Surface Normal Estimation with Hierarchical RGB-D Fusion

Figure 2 for Deep Surface Normal Estimation with Hierarchical RGB-D Fusion

Figure 3 for Deep Surface Normal Estimation with Hierarchical RGB-D Fusion

Figure 4 for Deep Surface Normal Estimation with Hierarchical RGB-D Fusion

Abstract:The growing availability of commodity RGB-D cameras has boosted the applications in the field of scene understanding. However, as a fundamental scene understanding task, surface normal estimation from RGB-D data lacks thorough investigation. In this paper, a hierarchical fusion network with adaptive feature re-weighting is proposed for surface normal estimation from a single RGB-D image. Specifically, the features from color image and depth are successively integrated at multiple scales to ensure global surface smoothness while preserving visually salient details. Meanwhile, the depth features are re-weighted with a confidence map estimated from depth before merging into the color branch to avoid artifacts caused by input depth corruption. Additionally, a hybrid multi-scale loss function is designed to learn accurate normal estimation given noisy ground-truth dataset. Extensive experimental results validate the effectiveness of the fusion strategy and the loss design, outperforming state-of-the-art normal estimation schemes.

Via

Access Paper or Ask Questions

Exploring Stereovision-Based 3-D Scene Reconstruction for Augmented Reality

Feb 17, 2019

Guang-Yu Nie, Yun Liu, Cong Wang, Yue Liu, Yongtian Wang

Figure 1 for Exploring Stereovision-Based 3-D Scene Reconstruction for Augmented Reality

Figure 2 for Exploring Stereovision-Based 3-D Scene Reconstruction for Augmented Reality

Figure 3 for Exploring Stereovision-Based 3-D Scene Reconstruction for Augmented Reality

Figure 4 for Exploring Stereovision-Based 3-D Scene Reconstruction for Augmented Reality

Abstract:Three-dimensional (3-D) scene reconstruction is one of the key techniques in Augmented Reality (AR), which is related to the integration of image processing and display systems of complex information. Stereo matching is a computer vision based approach for 3-D scene reconstruction. In this paper, we explore an improved stereo matching network, SLED-Net, in which a Single Long Encoder-Decoder is proposed to replace the stacked hourglass network in PSM-Net for better contextual information learning. We compare SLED-Net to state-of-the-art methods recently published, and demonstrate its superior performance on Scene Flow and KITTI2015 test sets.

* To be published in IEEE VR2019 Conference as a Poster

Via

Access Paper or Ask Questions

Greedy Graph Searching for Vascular Tracking in Angiographic Image Sequences

May 25, 2018

Huihui Fang, Jian Yang, Jianjun Zhu, Danni Ai, Yong Huang, Yurong Jiang, Hong Song, Yongtian Wang

Figure 1 for Greedy Graph Searching for Vascular Tracking in Angiographic Image Sequences

Figure 2 for Greedy Graph Searching for Vascular Tracking in Angiographic Image Sequences

Figure 3 for Greedy Graph Searching for Vascular Tracking in Angiographic Image Sequences

Figure 4 for Greedy Graph Searching for Vascular Tracking in Angiographic Image Sequences

Abstract:Vascular tracking of angiographic image sequences is one of the most clinically important tasks in the diagnostic assessment and interventional guidance of cardiac disease. However, this task can be challenging to accomplish because of unsatisfactory angiography image quality and complex vascular structures. Thus, this study proposed a new greedy graph search-based method for vascular tracking. Each vascular branch is separated from the vasculature and is tracked independently. Then, all branches are combined using topology optimization, thereby resulting in complete vasculature tracking. A gray-based image registration method was applied to determine the tracking range, and the deformation field between two consecutive frames was calculated. The vascular branch was described using a vascular centerline extraction method with multi-probability fusion-based topology optimization. We introduce an undirected acyclic graph establishment technique. A greedy search method was proposed to acquire all possible paths in the graph that might match the tracked vascular branch. The final tracking result was selected by branch matching using dynamic time warping with a DAISY descriptor. The solution to the problem reflected both the spatial and textural information between successive frames. Experimental results demonstrated that the proposed method was effective and robust for vascular tracking, attaining a F1 score of 0.89 on a single branch dataset and 0.88 on a vessel tree dataset. This approach provided a universal solution to address the problem of filamentary structure tracking.

* Submitted to Medical Physics; 30 pages, 11 figures

Via

Access Paper or Ask Questions