Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Michael Kaess

TartanCalib: Iterative Wide-Angle Lens Calibration using Adaptive SubPixel Refinement of AprilTags

Oct 05, 2022
Bardienus P Duisterhof, Yaoyu Hu, Si Heng Teng, Michael Kaess, Sebastian Scherer

Figure 1 for TartanCalib: Iterative Wide-Angle Lens Calibration using Adaptive SubPixel Refinement of AprilTags

Figure 2 for TartanCalib: Iterative Wide-Angle Lens Calibration using Adaptive SubPixel Refinement of AprilTags

Figure 3 for TartanCalib: Iterative Wide-Angle Lens Calibration using Adaptive SubPixel Refinement of AprilTags

Figure 4 for TartanCalib: Iterative Wide-Angle Lens Calibration using Adaptive SubPixel Refinement of AprilTags

Wide-angle cameras are uniquely positioned for mobile robots, by virtue of the rich information they provide in a small, light, and cost-effective form factor. An accurate calibration of the intrinsics and extrinsics is a critical pre-requisite for using the edge of a wide-angle lens for depth perception and odometry. Calibrating wide-angle lenses with current state-of-the-art techniques yields poor results due to extreme distortion at the edge, as most algorithms assume a lens with low to medium distortion closer to a pinhole projection. In this work we present our methodology for accurate wide-angle calibration. Our pipeline generates an intermediate model, and leverages it to iteratively improve feature detection and eventually the camera parameters. We test three key methods to utilize intermediate camera models: (1) undistorting the image into virtual pinhole cameras, (2) reprojecting the target into the image frame, and (3) adaptive subpixel refinement. Combining adaptive subpixel refinement and feature reprojection significantly improves reprojection errors by up to 26.59 %, helps us detect up to 42.01 % more features, and improves performance in the downstream task of dense depth mapping. Finally, TartanCalib is open-source and implemented into an easy-to-use calibration toolbox. We also provide a translation layer with other state-of-the-art works, which allows for regressing generic models with thousands of parameters or using a more robust solver. To this end, TartanCalib is the tool of choice for wide-angle calibration. Project website and code: http://tartancalib.com.

Via

Access Paper or Ask Questions

Acoustic Localization and Communication Using a MEMS Microphone for Low-cost and Low-power Bio-inspired Underwater Robots

Oct 03, 2022
Akshay Hinduja, Yunsik Ohm, Jiahe Liao, Carmel Majidi, Michael Kaess

Figure 1 for Acoustic Localization and Communication Using a MEMS Microphone for Low-cost and Low-power Bio-inspired Underwater Robots

Figure 2 for Acoustic Localization and Communication Using a MEMS Microphone for Low-cost and Low-power Bio-inspired Underwater Robots

Figure 3 for Acoustic Localization and Communication Using a MEMS Microphone for Low-cost and Low-power Bio-inspired Underwater Robots

Figure 4 for Acoustic Localization and Communication Using a MEMS Microphone for Low-cost and Low-power Bio-inspired Underwater Robots

Having accurate localization capabilities is one of the fundamental requirements of autonomous robots. For underwater vehicles, the choices for effective localization are limited due to limitations of GPS use in water and poor environmental visibility that makes camera-based methods ineffective. Popular inertial navigation methods for underwater localization using Doppler-velocity log sensors, sonar, high-end inertial navigation systems, or acoustic positioning systems require bulky expensive hardware which are incompatible with low cost, bio-inspired underwater robots. In this paper, we introduce an approach for underwater robot localization inspired by GPS methods known as acoustic pseudoranging. Our method allows us to potentially localize multiple bio-inspired robots equipped with commonly available micro electro-mechanical systems microphones. This is achieved through estimating the time difference of arrival of acoustic signals sent simultaneously through four speakers with a known constellation geometry. We also leverage the same acoustic framework to perform oneway communication with the robot to execute some primitive motions. To our knowledge, this is the first application of the approach for the on-board localization of small bio-inspired robots in water. Hardware schematics and the accompanying code are released to aid further development in the field3.

Via

Access Paper or Ask Questions

Robust Incremental Smoothing and Mapping (riSAM)

Sep 28, 2022
Daniel McGann, John G. Rogers III, Michael Kaess

Figure 1 for Robust Incremental Smoothing and Mapping (riSAM)

Figure 2 for Robust Incremental Smoothing and Mapping (riSAM)

Figure 3 for Robust Incremental Smoothing and Mapping (riSAM)

Figure 4 for Robust Incremental Smoothing and Mapping (riSAM)

This paper presents a method for robust optimization for online incremental Simultaneous Localization and Mapping (SLAM). Due to the NP-Hardness of data association in the presence of perceptual aliasing, tractable (approximate) approaches to data association will produce erroneous measurements. We require SLAM back-ends that can converge to accurate solutions in the presence of outlier measurements while meeting online efficiency constraints. Existing robust SLAM methods either remain sensitive to outliers, become increasingly sensitive to initialization, or fail to provide online efficiency. We present the robust incremental Smoothing and Mapping (riSAM) algorithm, a robust back-end optimizer for incremental SLAM based on Graduated Non-Convexity. We demonstrate on benchmarking datasets that our algorithm achieves online efficiency, outperforms existing online approaches, and matches or improves the performance of existing offline methods.

* Under review for ICRA 2023

Via

Access Paper or Ask Questions

Conditional GANs for Sonar Image Filtering with Applications to Underwater Occupancy Mapping

Sep 23, 2022
Tianxiang Lin, Akshay Hinduja, Mohamad Qadri, Michael Kaess

Figure 1 for Conditional GANs for Sonar Image Filtering with Applications to Underwater Occupancy Mapping

Figure 2 for Conditional GANs for Sonar Image Filtering with Applications to Underwater Occupancy Mapping

Figure 3 for Conditional GANs for Sonar Image Filtering with Applications to Underwater Occupancy Mapping

Figure 4 for Conditional GANs for Sonar Image Filtering with Applications to Underwater Occupancy Mapping

Underwater robots typically rely on acoustic sensors like sonar to perceive their surroundings. However, these sensors are often inundated with multiple sources and types of noise, which makes using raw data for any meaningful inference with features, objects, or boundary returns very difficult. While several conventional methods of dealing with noise exist, their success rates are unsatisfactory. This paper presents a novel application of conditional Generative Adversarial Networks (cGANs) to train a model to produce noise-free sonar images, outperforming several conventional filtering methods. Estimating free space is crucial for autonomous robots performing active exploration and mapping. Thus, we apply our approach to the task of underwater occupancy mapping and show superior free and occupied space inference when compared to conventional methods.

* 7 pages, 13 figures. This paper is under review

Via

Access Paper or Ask Questions

Neural Implicit Surface Reconstruction using Imaging Sonar

Sep 17, 2022
Mohamad Qadri, Michael Kaess, Ioannis Gkioulekas

Figure 1 for Neural Implicit Surface Reconstruction using Imaging Sonar

Figure 2 for Neural Implicit Surface Reconstruction using Imaging Sonar

Figure 3 for Neural Implicit Surface Reconstruction using Imaging Sonar

Figure 4 for Neural Implicit Surface Reconstruction using Imaging Sonar

We present a technique for dense 3D reconstruction of objects using an imaging sonar, also known as forward-looking sonar (FLS). Compared to previous methods that model the scene geometry as point clouds or volumetric grids, we represent the geometry as a neural implicit function. Additionally, given such a representation, we use a differentiable volumetric renderer that models the propagation of acoustic waves to synthesize imaging sonar measurements. We perform experiments on real and synthetic datasets and show that our algorithm reconstructs high-fidelity surface geometry from multi-view FLS images at much higher quality than was possible with previous techniques and without suffering from their associated memory overhead.

* 8 pages, 8 figures. This paper is under review

Via

Access Paper or Ask Questions

Group-$k$ Consistent Measurement Set Maximization for Robust Outlier Detection

Sep 06, 2022
Brendon Forsgren, Ram Vasudevan, Michael Kaess, Timothy W. McLain, Joshua G. Mangelson

Figure 1 for Group-$k$ Consistent Measurement Set Maximization for Robust Outlier Detection

Figure 2 for Group-$k$ Consistent Measurement Set Maximization for Robust Outlier Detection

Figure 3 for Group-$k$ Consistent Measurement Set Maximization for Robust Outlier Detection

Figure 4 for Group-$k$ Consistent Measurement Set Maximization for Robust Outlier Detection

This paper presents a method for the robust selection of measurements in a simultaneous localization and mapping (SLAM) framework. Existing methods check consistency or compatibility on a pairwise basis, however many measurement types are not sufficiently constrained in a pairwise scenario to determine if either measurement is inconsistent with the other. This paper presents group-$k$ consistency maximization (G$k$CM) that estimates the largest set of measurements that is internally group-$k$ consistent. Solving for the largest set of group-$k$ consistent measurements can be formulated as an instance of the maximum clique problem on generalized graphs and can be solved by adapting current methods. This paper evaluates the performance of G$k$CM using simulated data and compares it to pairwise consistency maximization (PCM) presented in previous work.

Via

Access Paper or Ask Questions

Long-term Visual Map Sparsification with Heterogeneous GNN

Mar 29, 2022
Ming-Fang Chang, Yipu Zhao, Rajvi Shah, Jakob J. Engel, Michael Kaess, Simon Lucey

Figure 1 for Long-term Visual Map Sparsification with Heterogeneous GNN

Figure 2 for Long-term Visual Map Sparsification with Heterogeneous GNN

Figure 3 for Long-term Visual Map Sparsification with Heterogeneous GNN

Figure 4 for Long-term Visual Map Sparsification with Heterogeneous GNN

We address the problem of map sparsification for long-term visual localization. For map sparsification, a commonly employed assumption is that the pre-build map and the later captured localization query are consistent. However, this assumption can be easily violated in the dynamic world. Additionally, the map size grows as new data accumulate through time, causing large data overhead in the long term. In this paper, we aim to overcome the environmental changes and reduce the map size at the same time by selecting points that are valuable to future localization. Inspired by the recent progress in Graph Neural Network(GNN), we propose the first work that models SfM maps as heterogeneous graphs and predicts 3D point importance scores with a GNN, which enables us to directly exploit the rich information in the SfM map graph. Two novel supervisions are proposed: 1) a data-fitting term for selecting valuable points to future localization based on training queries; 2) a K-Cover term for selecting sparse points with full map coverage. The experiments show that our method selected map points on stable and widely visible structures and outperformed baselines in localization performance.

* Accepted by CVPR 2022

Via

Access Paper or Ask Questions

Real-time Registration and Reconstruction with Cylindrical LiDAR Images

Dec 06, 2021
Wei Dong, Kwonyoung Ryu, Michael Kaess, Jaesik Park

Figure 1 for Real-time Registration and Reconstruction with Cylindrical LiDAR Images

Figure 2 for Real-time Registration and Reconstruction with Cylindrical LiDAR Images

Figure 3 for Real-time Registration and Reconstruction with Cylindrical LiDAR Images

Figure 4 for Real-time Registration and Reconstruction with Cylindrical LiDAR Images

Spinning LiDAR data are prevalent for 3D perception tasks, yet its cylindrical image form is less studied. Conventional approaches regard scans as point clouds, and they either rely on expensive Euclidean 3D nearest neighbor search for data association or depend on projected range images for further processing. We revisit the LiDAR scan formation and present a cylindrical range image representation for data from raw scans, equipped with an efficient calibrated spherical projective model. With our formulation, we 1) collect a large dataset of LiDAR data consisting of both indoor and outdoor sequences accompanied with pseudo-ground truth poses; 2) evaluate the projective and conventional registration approaches on the sequences with both synthetic and real-world transformations; 3) transfer state-of-the-art RGB-D algorithms to LiDAR that runs up to 180 Hz for registration and 150 Hz for dense reconstruction. The dataset and tools will be released.

* 6 pages, 7 figures. This paper is under the review

Via

Access Paper or Ask Questions

PatchGraph: In-hand tactile tracking with learned surface normals

Nov 15, 2021
Paloma Sodhi, Michael Kaess, Mustafa Mukadam, Stuart Anderson

Figure 1 for PatchGraph: In-hand tactile tracking with learned surface normals

Figure 2 for PatchGraph: In-hand tactile tracking with learned surface normals

Figure 3 for PatchGraph: In-hand tactile tracking with learned surface normals

Figure 4 for PatchGraph: In-hand tactile tracking with learned surface normals

We address the problem of tracking 3D object poses from touch during in-hand manipulations. Specifically, we look at tracking small objects using vision-based tactile sensors that provide high-dimensional tactile image measurements at the point of contact. While prior work has relied on a-priori information about the object being localized, we remove this requirement. Our key insight is that an object is composed of several local surface patches, each informative enough to achieve reliable object tracking. Moreover, we can recover the geometry of this local patch online by extracting local surface normal information embedded in each tactile image. We propose a novel two-stage approach. First, we learn a mapping from tactile images to surface normals using an image translation network. Second, we use these surface normals within a factor graph to both reconstruct a local patch map and use it to infer 3D object poses. We demonstrate reliable object tracking for over 100 contact sequences across unique shapes with four objects in simulation and two objects in the real-world. Supplementary video: https://youtu.be/JwNTC9_nh8M

* 7 pages, 8 figures

Via

Access Paper or Ask Questions

ASH: A Modern Framework for Parallel Spatial Hashing in 3D Perception

Oct 01, 2021
Wei Dong, Yixing Lao, Michael Kaess, Vladlen Koltun

Figure 1 for ASH: A Modern Framework for Parallel Spatial Hashing in 3D Perception

Figure 2 for ASH: A Modern Framework for Parallel Spatial Hashing in 3D Perception

Figure 3 for ASH: A Modern Framework for Parallel Spatial Hashing in 3D Perception

Figure 4 for ASH: A Modern Framework for Parallel Spatial Hashing in 3D Perception

We present ASH, a modern and high-performance framework for parallel spatial hashing on GPU. Compared to existing GPU hash map implementations, ASH achieves higher performance, supports richer functionality, and requires fewer lines of code (LoC) when used for implementing spatially varying operations from volumetric geometry reconstruction to differentiable appearance reconstruction. Unlike existing GPU hash maps, the ASH framework provides a versatile tensor interface, hiding low-level details from the users. In addition, by decoupling the internal hashing data structures and key-value data in buffers, we offer direct access to spatially varying data via indices, enabling seamless integration to modern libraries such as PyTorch. To achieve this, we 1) detach stored key-value data from the low-level hash map implementation; 2) bridge the pointer-first low level data structures to index-first high-level tensor interfaces via an index heap; 3) adapt both generic and non-generic integer-only hash map implementations as backends to operate on multi-dimensional keys. We first profile our hash map against state-of-the-art hash maps on synthetic data to show the performance gain from this architecture. We then show that ASH can consistently achieve higher performance on various large-scale 3D perception tasks with fewer LoC by showcasing several applications, including 1) point cloud voxelization, 2) dense volumetric SLAM, 3) non-rigid point cloud registration and volumetric deformation, and 4) spatially varying geometry and appearance refinement. ASH and its example applications are open sourced in Open3D (http://www.open3d.org).

* 18 pages, 18 figures

Via

Access Paper or Ask Questions