Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Loic Landrieu

FLAIR: a Country-Scale Land Cover Semantic Segmentation Dataset From Multi-Source Optical Imagery

Oct 20, 2023

Anatol Garioud, Nicolas Gonthier, Loic Landrieu, Apolline De Wit, Marion Valette, Marc Poupée, Sébastien Giordano, Boris Wattrelos

Abstract:We introduce the French Land cover from Aerospace ImageRy (FLAIR), an extensive dataset from the French National Institute of Geographical and Forest Information (IGN) that provides a unique and rich resource for large-scale geospatial analysis. FLAIR contains high-resolution aerial imagery with a ground sample distance of 20 cm and over 20 billion individually labeled pixels for precise land-cover classification. The dataset also integrates temporal and spectral data from optical satellite time series. FLAIR thus combines data with varying spatial, spectral, and temporal resolutions across over 817 km2 of acquisitions representing the full landscape diversity of France. This diversity makes FLAIR a valuable resource for the development and evaluation of novel methods for large-scale land-cover semantic segmentation and raises significant challenges in terms of computer vision, data fusion, and geospatial analysis. We also provide powerful uni- and multi-sensor baseline models that can be employed to assess algorithm's performance and for downstream applications. Through its extent and the quality of its annotation, FLAIR aims to spur improvements in monitoring and understanding key anthropogenic development indicators such as urban growth, deforestation, and soil artificialization. Dataset and codes can be accessed at https://ignf.github.io/FLAIR/

* NeurIPS 2023 - Datasets & Benchmarks Track

Via

Access Paper or Ask Questions

Efficient 3D Semantic Segmentation with Superpoint Transformer

Jun 13, 2023

Damien Robert, Hugo Raguet, Loic Landrieu

Abstract:We introduce a novel superpoint-based transformer architecture for efficient semantic segmentation of large-scale 3D scenes. Our method incorporates a fast algorithm to partition point clouds into a hierarchical superpoint structure, which makes our preprocessing 7 times times faster than existing superpoint-based approaches. Additionally, we leverage a self-attention mechanism to capture the relationships between superpoints at multiple scales, leading to state-of-the-art performance on three challenging benchmark datasets: S3DIS (76.0% mIoU 6-fold validation), KITTI-360 (63.5% on Val), and DALES (79.6%). With only 212k parameters, our approach is up to 200 times more compact than other state-of-the-art models while maintaining similar performance. Furthermore, our model can be trained on a single GPU in 3 hours for a fold of the S3DIS dataset, which is 7x to 70x fewer GPU-hours than the best-performing methods. Our code and models are accessible at github.com/drprojects/superpoint_transformer.

* Code available at github.com/drprojects/superpoint_transformer

Via

Access Paper or Ask Questions

Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans

Apr 19, 2023

Romain Loiseau, Elliot Vincent, Mathieu Aubry, Loic Landrieu

Figure 1 for Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans

Figure 2 for Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans

Figure 3 for Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans

Figure 4 for Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans

Abstract:We propose an unsupervised method for parsing large 3D scans of real-world scenes into interpretable parts. Our goal is to provide a practical tool for analyzing 3D scenes with unique characteristics in the context of aerial surveying and mapping, without relying on application-specific user annotations. Our approach is based on a probabilistic reconstruction model that decomposes an input 3D point cloud into a small set of learned prototypical shapes. Our model provides an interpretable reconstruction of complex scenes and leads to relevant instance and semantic segmentations. To demonstrate the usefulness of our results, we introduce a novel dataset of seven diverse aerial LiDAR scans. We show that our method outperforms state-of-the-art unsupervised methods in terms of decomposition accuracy while remaining visually interpretable. Our method offers significant advantage over existing approaches, as it does not require any manual annotations, making it a practical and efficient tool for 3D scene analysis. Our code and dataset are available at https://imagine.enpc.fr/~loiseaur/learnable-earth-parser

Via

Access Paper or Ask Questions

A Survey and Benchmark of Automatic Surface Reconstruction from Point Clouds

Jan 31, 2023

Raphael Sulzer, Loic Landrieu, Renaud Marlet, Bruno Vallet

Abstract:We survey and benchmark traditional and novel learning-based algorithms that address the problem of surface reconstruction from point clouds. Surface reconstruction from point clouds is particularly challenging when applied to real-world acquisitions, due to noise, outliers, non-uniform sampling and missing data. Traditionally, different handcrafted priors of the input points or the output surface have been proposed to make the problem more tractable. However, hyperparameter tuning for adjusting priors to different acquisition defects can be a tedious task. To this end, the deep learning community has recently addressed the surface reconstruction problem. In contrast to traditional approaches, deep surface reconstruction methods can learn priors directly from a training set of point clouds and corresponding true surfaces. In our survey, we detail how different handcrafted and learned priors affect the robustness of methods to defect-laden input and their capability to generate geometric and topologically accurate reconstructions. In our benchmark, we evaluate the reconstructions of several traditional and learning-based methods on the same grounds. We show that learning-based methods can generalize to unseen shape categories, but their training and test sets must share the same point cloud characteristics. We also provide the code and data to compete in our benchmark and to further stimulate the development of learning-based surface reconstruction https://github.com/raphaelsulzer/dsr-benchmark.

Via

Access Paper or Ask Questions

A Model You Can Hear: Audio Identification with Playable Prototypes

Aug 05, 2022

Romain Loiseau, Baptiste Bouvier, Yann Teytaut, Elliot Vincent, Mathieu Aubry, Loic Landrieu

Figure 1 for A Model You Can Hear: Audio Identification with Playable Prototypes

Figure 2 for A Model You Can Hear: Audio Identification with Playable Prototypes

Figure 3 for A Model You Can Hear: Audio Identification with Playable Prototypes

Figure 4 for A Model You Can Hear: Audio Identification with Playable Prototypes

Abstract:Machine learning techniques have proved useful for classifying and analyzing audio content. However, recent methods typically rely on abstract and high-dimensional representations that are difficult to interpret. Inspired by transformation-invariant approaches developed for image and 3D data, we propose an audio identification model based on learnable spectral prototypes. Equipped with dedicated transformation networks, these prototypes can be used to cluster and classify input audio samples from large collections of sounds. Our model can be trained with or without supervision and reaches state-of-the-art results for speaker and instrument identification, while remaining easily interpretable. The code is available at: https://github.com/romainloiseau/a-model-you-can-hear

Via

Access Paper or Ask Questions

Multi-Layer Modeling of Dense Vegetation from Aerial LiDAR Scans

Apr 25, 2022

Ekaterina Kalinicheva, Loic Landrieu, Clément Mallet, Nesrine Chehata

Figure 1 for Multi-Layer Modeling of Dense Vegetation from Aerial LiDAR Scans

Figure 2 for Multi-Layer Modeling of Dense Vegetation from Aerial LiDAR Scans

Figure 3 for Multi-Layer Modeling of Dense Vegetation from Aerial LiDAR Scans

Figure 4 for Multi-Layer Modeling of Dense Vegetation from Aerial LiDAR Scans

Abstract:The analysis of the multi-layer structure of wild forests is an important challenge of automated large-scale forestry. While modern aerial LiDARs offer geometric information across all vegetation layers, most datasets and methods focus only on the segmentation and reconstruction of the top of canopy. We release WildForest3D, which consists of 29 study plots and over 2000 individual trees across 47 000m2 with dense 3D annotation, along with occupancy and height maps for 3 vegetation layers: ground vegetation, understory, and overstory. We propose a 3D deep network architecture predicting for the first time both 3D point-wise labels and high-resolution layer occupancy rasters simultaneously. This allows us to produce a precise estimation of the thickness of each vegetation layer as well as the corresponding watertight meshes, therefore meeting most forestry purposes. Both the dataset and the model are released in open access: https://github.com/ekalinicheva/multi_layer_vegetation.

* Earth Vision Workshop, CVPR 2022

Via

Access Paper or Ask Questions

Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation

Apr 15, 2022

Damien Robert, Bruno Vallet, Loic Landrieu

Figure 1 for Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation

Figure 2 for Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation

Figure 3 for Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation

Figure 4 for Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation

Abstract:Recent works on 3D semantic segmentation propose to exploit the synergy between images and point clouds by processing each modality with a dedicated network and projecting learned 2D features onto 3D points. Merging large-scale point clouds and images raises several challenges, such as constructing a mapping between points and pixels, and aggregating features between multiple views. Current methods require mesh reconstruction or specialized sensors to recover occlusions, and use heuristics to select and aggregate available images. In contrast, we propose an end-to-end trainable multi-view aggregation model leveraging the viewing conditions of 3D points to merge features from images taken at arbitrary positions. Our method can combine standard 2D and 3D networks and outperforms both 3D models operating on colorized point clouds and hybrid 2D/3D networks without requiring colorization, meshing, or true depth maps. We set a new state-of-the-art for large-scale indoor/outdoor semantic segmentation on S3DIS (74.7 mIoU 6-Fold) and on KITTI-360 (58.3 mIoU). Our full pipeline is accessible at https://github.com/drprojects/DeepViewAgg, and only requires raw 3D scans and a set of images and poses.

* Accepted to CVPR 2022 with an Oral presentation; camera ready version. 17 pages, 11 figures. Code and data available at https://github.com/drprojects/DeepViewAgg

Via

Access Paper or Ask Questions

Deep Surface Reconstruction from Point Clouds with Visibility Information

Feb 03, 2022

Raphael Sulzer, Loic Landrieu, Alexandre Boulch, Renaud Marlet, Bruno Vallet

Figure 1 for Deep Surface Reconstruction from Point Clouds with Visibility Information

Figure 2 for Deep Surface Reconstruction from Point Clouds with Visibility Information

Figure 3 for Deep Surface Reconstruction from Point Clouds with Visibility Information

Figure 4 for Deep Surface Reconstruction from Point Clouds with Visibility Information

Abstract:Most current neural networks for reconstructing surfaces from point clouds ignore sensor poses and only operate on raw point locations. Sensor visibility, however, holds meaningful information regarding space occupancy and surface orientation. In this paper, we present two simple ways to augment raw point clouds with visibility information, so it can directly be leveraged by surface reconstruction networks with minimal adaptation. Our proposed modifications consistently improve the accuracy of generated surfaces as well as the generalization ability of the networks to unseen shape domains. Our code and data is available at https://github.com/raphaelsulzer/dsrv-data.

* 13 pages

Via

Access Paper or Ask Questions

Predicting Vegetation Stratum Occupancy from Airborne LiDAR Data with Deep Learning

Jan 20, 2022

Ekaterina Kalinicheva, Loic Landrieu, Clément Mallet, Nesrine Chehata

Figure 1 for Predicting Vegetation Stratum Occupancy from Airborne LiDAR Data with Deep Learning

Figure 2 for Predicting Vegetation Stratum Occupancy from Airborne LiDAR Data with Deep Learning

Figure 3 for Predicting Vegetation Stratum Occupancy from Airborne LiDAR Data with Deep Learning

Figure 4 for Predicting Vegetation Stratum Occupancy from Airborne LiDAR Data with Deep Learning

Abstract:We propose a new deep learning-based method for estimating the occupancy of vegetation strata from airborne 3D LiDAR point clouds. Our model predicts rasterized occupancy maps for three vegetation strata corresponding to lower, medium, and higher cover. Our weakly-supervised training scheme allows our network to only be supervised with vegetation occupancy values aggregated over cylindrical plots containing thousands of points. Such ground truth is easier to produce than pixel-wise or point-wise annotations. Our method outperforms handcrafted and deep learning baselines in terms of precision by up to 30%, while simultaneously providing visual and interpretable predictions. We provide an open-source implementation along with a dataset of 199 agricultural plots to train and evaluate weakly supervised occupancy regression algorithms.

Via

Access Paper or Ask Questions

Vegetation Stratum Occupancy Prediction from Airborne LiDAR 3D Point Clouds

Dec 27, 2021

Ekaterina Kalinicheva, Loic Landrieu, Clément Mallet, Nesrine Chehata

Figure 1 for Vegetation Stratum Occupancy Prediction from Airborne LiDAR 3D Point Clouds

Figure 2 for Vegetation Stratum Occupancy Prediction from Airborne LiDAR 3D Point Clouds

Figure 3 for Vegetation Stratum Occupancy Prediction from Airborne LiDAR 3D Point Clouds

Abstract:We propose a new deep learning-based method for estimating the occupancy of vegetation strata from 3D point clouds captured from an aerial platform. Our model predicts rasterized occupancy maps for three vegetation strata: lower, medium, and higher strata. Our training scheme allows our network to only being supervized with values aggregated over cylindrical plots, which are easier to produce than pixel-wise or point-wise annotations. Our method outperforms handcrafted and deep learning baselines in terms of precision while simultaneously providing visual and interpretable predictions. We provide an open-source implementation of our method along along a dataset of 199 agricultural plots to train and evaluate occupancy regression algorithms.

* SilviLaser 2021 Conference

Via

Access Paper or Ask Questions