Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jean-Emmanuel Deschaud

CAOR

Leg Exoskeleton Odometry using a Limited FOV Depth Sensor

Feb 26, 2025

Fabio Elnecave Xavier, Matis Viozelange, Guillaume Burger, Marine Pétriaux, Jean-Emmanuel Deschaud, François Goulette

Abstract:For leg exoskeletons to operate effectively in real-world environments, they must be able to perceive and understand the terrain around them. However, unlike other legged robots, exoskeletons face specific constraints on where depth sensors can be mounted due to the presence of a human user. These constraints lead to a limited Field Of View (FOV) and greater sensor motion, making odometry particularly challenging. To address this, we propose a novel odometry algorithm that integrates proprioceptive data from the exoskeleton with point clouds from a depth camera to produce accurate elevation maps despite these limitations. Our method builds on an extended Kalman filter (EKF) to fuse kinematic and inertial measurements, while incorporating a tailored iterative closest point (ICP) algorithm to register new point clouds with the elevation map. Experimental validation with a leg exoskeleton demonstrates that our approach reduces drift and enhances the quality of elevation maps compared to a purely proprioceptive baseline, while also outperforming a more traditional point cloud map-based variant.

* Accepted to ICRA 2025

Via

Access Paper or Ask Questions

3DLabelProp: Geometric-Driven Domain Generalization for LiDAR Semantic Segmentation in Autonomous Driving

Jan 24, 2025

Jules Sanchez, Jean-Emmanuel Deschaud, François Goulette

Figure 1 for 3DLabelProp: Geometric-Driven Domain Generalization for LiDAR Semantic Segmentation in Autonomous Driving

Figure 2 for 3DLabelProp: Geometric-Driven Domain Generalization for LiDAR Semantic Segmentation in Autonomous Driving

Figure 3 for 3DLabelProp: Geometric-Driven Domain Generalization for LiDAR Semantic Segmentation in Autonomous Driving

Figure 4 for 3DLabelProp: Geometric-Driven Domain Generalization for LiDAR Semantic Segmentation in Autonomous Driving

Abstract:Domain generalization aims to find ways for deep learning models to maintain their performance despite significant domain shifts between training and inference datasets. This is particularly important for models that need to be robust or are costly to train. LiDAR perception in autonomous driving is impacted by both of these concerns, leading to the emergence of various approaches. This work addresses the challenge by proposing a geometry-based approach, leveraging the sequential structure of LiDAR sensors, which sets it apart from the learning-based methods commonly found in the literature. The proposed method, called 3DLabelProp, is applied on the task of LiDAR Semantic Segmentation (LSS). Through extensive experimentation on seven datasets, it is demonstrated to be a state-of-the-art approach, outperforming both naive and other domain generalization methods.

Via

Access Paper or Ask Questions

Open-Set 3D object detection in LiDAR data as an Out-of-Distribution problem

Oct 31, 2024

Louis Soum-Fontez, Jean-Emmanuel Deschaud, François Goulette

Abstract:3D Object Detection from LiDAR data has achieved industry-ready performance in controlled environments through advanced deep learning methods. However, these neural network models are limited by a finite set of inlier object categories. Our work redefines the open-set 3D Object Detection problem in LiDAR data as an Out-Of-Distribution (OOD) problem to detect outlier objects. This approach brings additional information in comparison with traditional object detection. We establish a comparative benchmark and show that two-stage OOD methods, notably autolabelling, show promising results for 3D OOD Object Detection. Our contributions include setting a rigorous evaluation protocol by examining the evaluation of hyperparameters and evaluating strategies for generating additional data to train an OOD-aware 3D object detector. This comprehensive analysis is essential for developing robust 3D object detection systems that can perform reliably in diverse and unpredictable real-world scenarios.

Via

Access Paper or Ask Questions

Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving?

Oct 10, 2024

Samir Abou Haidar, Alexandre Chariot, Mehdi Darouich, Cyril Joly, Jean-Emmanuel Deschaud

Figure 1 for Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving?

Figure 2 for Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving?

Figure 3 for Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving?

Figure 4 for Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving?

Abstract:Within a perception framework for autonomous mobile and robotic systems, semantic analysis of 3D point clouds typically generated by LiDARs is key to numerous applications, such as object detection and recognition, and scene reconstruction. Scene semantic segmentation can be achieved by directly integrating 3D spatial data with specialized deep neural networks. Although this type of data provides rich geometric information regarding the surrounding environment, it also presents numerous challenges: its unstructured and sparse nature, its unpredictable size, and its demanding computational requirements. These characteristics hinder the real-time semantic analysis, particularly on resource-constrained hardware architectures that constitute the main computational components of numerous robotic applications. Therefore, in this paper, we investigate various 3D semantic segmentation methodologies and analyze their performance and capabilities for resource-constrained inference on embedded NVIDIA Jetson platforms. We evaluate them for a fair comparison through a standardized training protocol and data augmentations, providing benchmark results on the Jetson AGX Orin and AGX Xavier series for two large-scale outdoor datasets: SemanticKITTI and nuScenes.

* Accepted to IROS 2024 PPNIV Workshop

Via

Access Paper or Ask Questions

RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel View Synthesis

Aug 06, 2024

Hugo Blanc, Jean-Emmanuel Deschaud, Alexis Paljic

Figure 1 for RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel View Synthesis

Figure 2 for RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel View Synthesis

Figure 3 for RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel View Synthesis

Figure 4 for RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel View Synthesis

Abstract:Differentiable volumetric rendering-based methods made significant progress in novel view synthesis. On one hand, innovative methods have replaced the Neural Radiance Fields (NeRF) network with locally parameterized structures, enabling high-quality renderings in a reasonable time. On the other hand, approaches have used differentiable splatting instead of NeRF's ray casting to optimize radiance fields rapidly using Gaussian kernels, allowing for fine adaptation to the scene. However, differentiable ray casting of irregularly spaced kernels has been scarcely explored, while splatting, despite enabling fast rendering times, is susceptible to clearly visible artifacts. Our work closes this gap by providing a physically consistent formulation of the emitted radiance c and density {\sigma}, decomposed with Gaussian functions associated with Spherical Gaussians/Harmonics for all-frequency colorimetric representation. We also introduce a method enabling differentiable ray casting of irregularly distributed Gaussians using an algorithm that integrates radiance fields slab by slab and leverages a BVH structure. This allows our approach to finely adapt to the scene while avoiding splatting artifacts. As a result, we achieve superior rendering quality compared to the state-of-the-art while maintaining reasonable training times and achieving inference speeds of 25 FPS on the Blender dataset. Project page with videos and code: https://raygauss.github.io/

* Project page with videos and code: https://raygauss.github.io/

Via

Access Paper or Ask Questions

COLA: COarse-LAbel multi-source LiDAR semantic segmentation for autonomous driving

Nov 06, 2023

Jules Sanchez, Jean-Emmanuel Deschaud, François Goulette

Figure 1 for COLA: COarse-LAbel multi-source LiDAR semantic segmentation for autonomous driving

Figure 2 for COLA: COarse-LAbel multi-source LiDAR semantic segmentation for autonomous driving

Figure 3 for COLA: COarse-LAbel multi-source LiDAR semantic segmentation for autonomous driving

Figure 4 for COLA: COarse-LAbel multi-source LiDAR semantic segmentation for autonomous driving

Abstract:LiDAR semantic segmentation for autonomous driving has been a growing field of interest in the past few years. Datasets and methods have appeared and expanded very quickly, but methods have not been updated to exploit this new availability of data and continue to rely on the same classical datasets. Different ways of performing LIDAR semantic segmentation training and inference can be divided into several subfields, which include the following: domain generalization, the ability to segment data coming from unseen domains ; source-to-source segmentation, the ability to segment data coming from the training domain; and pre-training, the ability to create re-usable geometric primitives. In this work, we aim to improve results in all of these subfields with the novel approach of multi-source training. Multi-source training relies on the availability of various datasets at training time and uses them together rather than relying on only one dataset. To overcome the common obstacles found for multi-source training, we introduce the coarse labels and call the newly created multi-source dataset COLA. We propose three applications of this new dataset that display systematic improvement over single-source strategies: COLA-DG for domain generalization (up to +10%), COLA-S2S for source-to-source segmentation (up to +5.3%), and COLA-PT for pre-training (up to +12%).

Via

Access Paper or Ask Questions

ParisLuco3D: A high-quality target dataset for domain generalization of LiDAR perception

Oct 25, 2023

Jules Sanchez, Louis Soum-Fontez, Jean-Emmanuel Deschaud, Francois Goulette

Figure 1 for ParisLuco3D: A high-quality target dataset for domain generalization of LiDAR perception

Figure 2 for ParisLuco3D: A high-quality target dataset for domain generalization of LiDAR perception

Figure 3 for ParisLuco3D: A high-quality target dataset for domain generalization of LiDAR perception

Figure 4 for ParisLuco3D: A high-quality target dataset for domain generalization of LiDAR perception

Abstract:LiDAR is a sensor system that supports autonomous driving by gathering precise geometric information about the scene. Exploiting this information for perception is interesting as the amount of available data increases. As the quantitative performance of various perception tasks has improved, the focus has shifted from source-to-source perception to domain adaptation and domain generalization for perception. These new goals require access to a large variety of domains for evaluation. Unfortunately, the various annotation strategies of data providers complicate the computation of cross-domain performance based on the available data This paper provides a novel dataset, specifically designed for cross-domain evaluation to make it easier to evaluate the performance of various source datasets. Alongside the dataset, a flexible online benchmark is provided to ensure a fair comparison across methods.

Via

Access Paper or Ask Questions

MDT3D: Multi-Dataset Training for LiDAR 3D Object Detection Generalization

Aug 02, 2023

Louis Soum-Fontez, Jean-Emmanuel Deschaud, François Goulette

Abstract:Supervised 3D Object Detection models have been displaying increasingly better performance in single-domain cases where the training data comes from the same environment and sensor as the testing data. However, in real-world scenarios data from the target domain may not be available for finetuning or for domain adaptation methods. Indeed, 3D object detection models trained on a source dataset with a specific point distribution have shown difficulties in generalizing to unseen datasets. Therefore, we decided to leverage the information available from several annotated source datasets with our Multi-Dataset Training for 3D Object Detection (MDT3D) method to increase the robustness of 3D object detection models when tested in a new environment with a different sensor configuration. To tackle the labelling gap between datasets, we used a new label mapping based on coarse labels. Furthermore, we show how we managed the mix of datasets during training and finally introduce a new cross-dataset augmentation method: cross-dataset object injection. We demonstrate that this training paradigm shows improvements for different types of 3D object detection models. The source code and additional results for this research project will be publicly available on GitHub for interested parties to access and utilize: https://github.com/LouisSF/MDT3D

* Accepted for publication at IROS 2023

Via

Access Paper or Ask Questions

Multi-IMU Proprioceptive State Estimator for Humanoid Robots

Jul 26, 2023

Fabio Elnecave Xavier, Guillaume Burger, Marine Pétriaux, Jean-Emmanuel Deschaud, François Goulette

Figure 1 for Multi-IMU Proprioceptive State Estimator for Humanoid Robots

Figure 2 for Multi-IMU Proprioceptive State Estimator for Humanoid Robots

Figure 3 for Multi-IMU Proprioceptive State Estimator for Humanoid Robots

Figure 4 for Multi-IMU Proprioceptive State Estimator for Humanoid Robots

Abstract:Algorithms for state estimation of humanoid robots usually assume that the feet remain flat and in a constant position while in contact with the ground. However, this hypothesis is easily violated while walking, especially for human-like gaits with heel-toe motion. This reduces the time during which the contact assumption can be used, or requires higher variances to account for errors. In this paper, we present a novel state estimator based on the extended Kalman filter that can properly handle any contact configuration. We consider multiple inertial measurement units (IMUs) distributed throughout the robot's structure, including on both feet, which are used to track multiple bodies of the robot. This multi-IMU instrumentation setup also has the advantage of allowing the deformations in the robot's structure to be estimated, improving the kinematic model used in the filter. The proposed approach is validated experimentally on the exoskeleton Atalante and is shown to present low drift, performing better than similar single-IMU filters. The obtained trajectory estimates are accurate enough to construct elevation maps that have little distortion with respect to the ground truth.

* Accepted to IROS 2023

Via

Access Paper or Ask Questions

Domain generalization of 3D semantic segmentation in autonomous driving

Dec 07, 2022

Jules Sanchez, Jean-Emmanuel Deschaud, Francois Goulette

Figure 1 for Domain generalization of 3D semantic segmentation in autonomous driving

Figure 2 for Domain generalization of 3D semantic segmentation in autonomous driving

Figure 3 for Domain generalization of 3D semantic segmentation in autonomous driving

Figure 4 for Domain generalization of 3D semantic segmentation in autonomous driving

Abstract:3D autonomous driving semantic segmentation using deep learning has become, a well-studied subject, providing methods that can reach very high performance. Nonetheless, because of the limited size of the training datasets, these models cannot see every type of object and scenes found in real-world applications. The ability to be reliable in these various unknown environments is called domain generalization. Despite its importance, domain generalization is relatively unexplored in the case of 3D autonomous driving semantic segmentation. To fill this gap, this paper presents the first benchmark for this application by testing state-of-the-art methods and discussing the difficulty of tackling LiDAR domain shifts. We also propose the first method designed to address this domain generalization, which we call 3DLabelProp. This method relies on leveraging the geometry and sequentiality of the LiDAR data to enhance its generalization performances by working on partially accumulated point clouds. It reaches a mIoU of 52.6% on SemanticPOSS while being trained only on SemanticKITTI, making it state-of-the-art method for generalization (+7.4% better than the second best method). The code for this method will be available on Github.

Via

Access Paper or Ask Questions