Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Qian Du

A3CLNN: Spatial, Spectral and Multiscale Attention ConvLSTM Neural Network for Multisource Remote Sensing Data Classification

Apr 09, 2022

Heng-Chao Li, Wen-Shuai Hu, Wei Li, Jun Li, Qian Du, Antonio Plaza

Figure 1 for A3CLNN: Spatial, Spectral and Multiscale Attention ConvLSTM Neural Network for Multisource Remote Sensing Data Classification

Figure 2 for A3CLNN: Spatial, Spectral and Multiscale Attention ConvLSTM Neural Network for Multisource Remote Sensing Data Classification

Figure 3 for A3CLNN: Spatial, Spectral and Multiscale Attention ConvLSTM Neural Network for Multisource Remote Sensing Data Classification

Figure 4 for A3CLNN: Spatial, Spectral and Multiscale Attention ConvLSTM Neural Network for Multisource Remote Sensing Data Classification

Abstract:The problem of effectively exploiting the information multiple data sources has become a relevant but challenging research topic in remote sensing. In this paper, we propose a new approach to exploit the complementarity of two data sources: hyperspectral images (HSIs) and light detection and ranging (LiDAR) data. Specifically, we develop a new dual-channel spatial, spectral and multiscale attention convolutional long short-term memory neural network (called dual-channel A3CLNN) for feature extraction and classification of multisource remote sensing data. Spatial, spectral and multiscale attention mechanisms are first designed for HSI and LiDAR data in order to learn spectral- and spatial-enhanced feature representations, and to represent multiscale information for different classes. In the designed fusion network, a novel composite attention learning mechanism (combined with a three-level fusion strategy) is used to fully integrate the features in these two data sources. Finally, inspired by the idea of transfer learning, a novel stepwise training strategy is designed to yield a final classification result. Our experimental results, conducted on several multisource remote sensing data sets, demonstrate that the newly proposed dual-channel A3CLNN exhibits better feature representation ability (leading to more competitive classification performance) than other state-of-the-art methods.

* IEEE Transactions on Neural Networks and Learning Systems, vol. 33, no. 2, pp. 747-761, Feb. 2022
* 16 pages, 10 figures

Via

Access Paper or Ask Questions

MS-HLMO: Multi-scale Histogram of Local Main Orientation for Remote Sensing Image Registration

Apr 01, 2022

Chenzhong Gao, Wei Li, Ran Tao, Qian Du

Figure 1 for MS-HLMO: Multi-scale Histogram of Local Main Orientation for Remote Sensing Image Registration

Figure 2 for MS-HLMO: Multi-scale Histogram of Local Main Orientation for Remote Sensing Image Registration

Figure 3 for MS-HLMO: Multi-scale Histogram of Local Main Orientation for Remote Sensing Image Registration

Figure 4 for MS-HLMO: Multi-scale Histogram of Local Main Orientation for Remote Sensing Image Registration

Abstract:Multi-source image registration is challenging due to intensity, rotation, and scale differences among the images. Considering the characteristics and differences of multi-source remote sensing images, a feature-based registration algorithm named Multi-scale Histogram of Local Main Orientation (MS-HLMO) is proposed. Harris corner detection is first adopted to generate feature points. The HLMO feature of each Harris feature point is extracted on a Partial Main Orientation Map (PMOM) with a Generalized Gradient Location and Orientation Histogram-like (GGLOH) feature descriptor, which provides high intensity, rotation, and scale invariance. The feature points are matched through a multi-scale matching strategy. Comprehensive experiments on 17 multi-source remote sensing scenes demonstrate that the proposed MS-HLMO and its simplified version MS-HLMO$^+$ outperform other competitive registration algorithms in terms of effectiveness and generalization.

Via

Access Paper or Ask Questions

Change Detection from Synthetic Aperture Radar Images via Dual Path Denoising Network

Mar 13, 2022

Junjie Wang, Feng Gao, Junyu Dong, Qian Du, Heng-Chao Li

Figure 1 for Change Detection from Synthetic Aperture Radar Images via Dual Path Denoising Network

Figure 2 for Change Detection from Synthetic Aperture Radar Images via Dual Path Denoising Network

Figure 3 for Change Detection from Synthetic Aperture Radar Images via Dual Path Denoising Network

Figure 4 for Change Detection from Synthetic Aperture Radar Images via Dual Path Denoising Network

Abstract:Benefited from the rapid and sustainable development of synthetic aperture radar (SAR) sensors, change detection from SAR images has received increasing attentions over the past few years. Existing unsupervised deep learning-based methods have made great efforts to exploit robust feature representations, but they consume much time to optimize parameters. Besides, these methods use clustering to obtain pseudo-labels for training, and the pseudo-labeled samples often involve errors, which can be considered as "label noise". To address these issues, we propose a Dual Path Denoising Network (DPDNet) for SAR image change detection. In particular, we introduce the random label propagation to clean the label noise involved in preclassification. We also propose the distinctive patch convolution for feature representation learning to reduce the time consumption. Specifically, the attention mechanism is used to select distinctive pixels in the feature maps, and patches around these pixels are selected as convolution kernels. Consequently, the DPDNet does not require a great number of training samples for parameter optimization, and its computational efficiency is greatly enhanced. Extensive experiments have been conducted on five SAR datasets to verify the proposed DPDNet. The experimental results demonstrate that our method outperforms several state-of-the-art methods in change detection results.

* Accepted by IEEE JSTARS

Via

Access Paper or Ask Questions

SSCU-Net: Spatial-Spectral Collaborative Unmixing Network for Hyperspectral Images

Mar 12, 2022

Qi Li, Feng Gao, Junyu Dong, Xinbo Gao, Qian Du

Figure 1 for SSCU-Net: Spatial-Spectral Collaborative Unmixing Network for Hyperspectral Images

Figure 2 for SSCU-Net: Spatial-Spectral Collaborative Unmixing Network for Hyperspectral Images

Figure 3 for SSCU-Net: Spatial-Spectral Collaborative Unmixing Network for Hyperspectral Images

Figure 4 for SSCU-Net: Spatial-Spectral Collaborative Unmixing Network for Hyperspectral Images

Abstract:Linear spectral unmixing is an essential technique in hyperspectral image processing and interpretation. In recent years, deep learning-based approaches have shown great promise in hyperspectral unmixing, in particular, unsupervised unmixing methods based on autoencoder networks are a recent trend. The autoencoder model, which automatically learns low-dimensional representations (abundances) and reconstructs data with their corresponding bases (endmembers), has achieved superior performance in hyperspectral unmixing. In this article, we explore the effective utilization of spatial and spectral information in autoencoder-based unmixing networks. Important findings on the use of spatial and spectral information in the autoencoder framework are discussed. Inspired by these findings, we propose a spatial-spectral collaborative unmixing network, called SSCU-Net, which learns a spatial autoencoder network and a spectral autoencoder network in an end-to-end manner to more effectively improve the unmixing performance. SSCU-Net is a two-stream deep network and shares an alternating architecture, where the two autoencoder networks are efficiently trained in a collaborative way for estimation of endmembers and abundances. Meanwhile, we propose a new spatial autoencoder network by introducing a superpixel segmentation method based on abundance information, which greatly facilitates the employment of spatial information and improves the accuracy of unmixing network. Moreover, extensive ablation studies are carried out to investigate the performance gain of SSCU-Net. Experimental results on both synthetic and real hyperspectral data sets illustrate the effectiveness and competitiveness of the proposed SSCU-Net compared with several state-of-the-art hyperspectral unmixing methods.

* IEEE TGRS 2022

Via

Access Paper or Ask Questions

Change Detection from Synthetic Aperture Radar Images via Graph-Based Knowledge Supplement Network

Feb 09, 2022

Junjie Wang, Feng Gao, Junyu Dong, Shan Zhang, Qian Du

Figure 1 for Change Detection from Synthetic Aperture Radar Images via Graph-Based Knowledge Supplement Network

Figure 2 for Change Detection from Synthetic Aperture Radar Images via Graph-Based Knowledge Supplement Network

Figure 3 for Change Detection from Synthetic Aperture Radar Images via Graph-Based Knowledge Supplement Network

Figure 4 for Change Detection from Synthetic Aperture Radar Images via Graph-Based Knowledge Supplement Network

Abstract:Synthetic aperture radar (SAR) image change detection is a vital yet challenging task in the field of remote sensing image analysis. Most previous works adopt a self-supervised method which uses pseudo-labeled samples to guide subsequent training and testing. However, deep networks commonly require many high-quality samples for parameter optimization. The noise in pseudo-labels inevitably affects the final change detection performance. To solve the problem, we propose a Graph-based Knowledge Supplement Network (GKSNet). To be more specific, we extract discriminative information from the existing labeled dataset as additional knowledge, to suppress the adverse effects of noisy samples to some extent. Afterwards, we design a graph transfer module to distill contextual information attentively from the labeled dataset to the target dataset, which bridges feature correlation between datasets. To validate the proposed method, we conducted extensive experiments on four SAR datasets, which demonstrated the superiority of the proposed GKSNet as compared to several state-of-the-art baselines. Our codes are available at https://github.com/summitgao/SAR_CD_GKSNet.

* Accepted by IEEE JSTARS

Via

Access Paper or Ask Questions

Adaptive DropBlock Enhanced Generative Adversarial Networks for Hyperspectral Image Classification

Jan 22, 2022

Junjie Wang, Feng Gao, Junyu Dong, Qian Du

Figure 1 for Adaptive DropBlock Enhanced Generative Adversarial Networks for Hyperspectral Image Classification

Figure 2 for Adaptive DropBlock Enhanced Generative Adversarial Networks for Hyperspectral Image Classification

Figure 3 for Adaptive DropBlock Enhanced Generative Adversarial Networks for Hyperspectral Image Classification

Figure 4 for Adaptive DropBlock Enhanced Generative Adversarial Networks for Hyperspectral Image Classification

Abstract:In recent years, hyperspectral image (HSI) classification based on generative adversarial networks (GAN) has achieved great progress. GAN-based classification methods can mitigate the limited training sample dilemma to some extent. However, several studies have pointed out that existing GAN-based HSI classification methods are heavily affected by the imbalanced training data problem. The discriminator in GAN always contradicts itself and tries to associate fake labels to the minority-class samples, and thus impair the classification performance. Another critical issue is the mode collapse in GAN-based methods. The generator is only capable of producing samples within a narrow scope of the data space, which severely hinders the advancement of GAN-based HSI classification methods. In this paper, we proposed an Adaptive DropBlock-enhanced Generative Adversarial Networks (ADGAN) for HSI classification. First, to solve the imbalanced training data problem, we adjust the discriminator to be a single classifier, and it will not contradict itself. Second, an adaptive DropBlock (AdapDrop) is proposed as a regularization method employed in the generator and discriminator to alleviate the mode collapse issue. The AdapDrop generated drop masks with adaptive shapes instead of a fixed size region, and it alleviates the limitations of DropBlock in dealing with ground objects with various shapes. Experimental results on three HSI datasets demonstrated that the proposed ADGAN achieved superior performance over state-of-the-art GAN-based methods. Our codes are available at https://github.com/summitgao/HC_ADGAN

* in IEEE Transactions on Geoscience and Remote Sensing, vol. 59, no. 6, pp. 5040-5053, June 2021

Via

Access Paper or Ask Questions

HPRN: Holistic Prior-embedded Relation Network for Spectral Super-Resolution

Dec 29, 2021

Chaoxiong Wu, Jiaojiao Li, Rui Song, Yunsong Li, Qian Du

Figure 1 for HPRN: Holistic Prior-embedded Relation Network for Spectral Super-Resolution

Figure 2 for HPRN: Holistic Prior-embedded Relation Network for Spectral Super-Resolution

Figure 3 for HPRN: Holistic Prior-embedded Relation Network for Spectral Super-Resolution

Figure 4 for HPRN: Holistic Prior-embedded Relation Network for Spectral Super-Resolution

Abstract:Spectral super-resolution (SSR) refers to the hyperspectral image (HSI) recovery from an RGB counterpart. Due to the one-to-many nature of the SSR problem, a single RGB image can be reprojected to many HSIs. The key to tackle this illposed problem is to plug into multi-source prior information such as the natural RGB spatial context-prior, deep feature-prior or inherent HSI statistical-prior, etc., so as to improve the confidence and fidelity of reconstructed spectra. However, most current approaches only consider the general and limited priors in their designing the customized convolutional neural networks (CNNs), which leads to the inability to effectively alleviate the degree of ill-posedness. To address the problematic issues, we propose a novel holistic prior-embedded relation network (HPRN) for SSR. Basically, the core framework is delicately assembled by several multi-residual relation blocks (MRBs) that fully facilitate the transmission and utilization of the low-frequency content prior of RGB signals. Innovatively, the semantic prior of RGB input is introduced to identify category attributes and a semantic-driven spatial relation module (SSRM) is put forward to perform the feature aggregation among the clustered similar characteristics using a semantic-embedded relation matrix. Additionally, we develop a transformer-based channel relation module (TCRM), which breaks the habit of employing scalars as the descriptors of channel-wise relations in the previous deep feature-prior and replaces them with certain vectors, together with Transformerstyle feature interactions, supporting the representations to be more discriminative. In order to maintain the mathematical correlation and spectral consistency between hyperspectral bands, the second-order prior constraints (SOPC) are incorporated into the loss function to guide the HSI reconstruction process.

Via

Access Paper or Ask Questions

Deep Learning for UAV-based Object Detection and Tracking: A Survey

Oct 25, 2021

Xin Wu, Wei Li, Danfeng Hong, Ran Tao, Qian Du

Figure 1 for Deep Learning for UAV-based Object Detection and Tracking: A Survey

Figure 2 for Deep Learning for UAV-based Object Detection and Tracking: A Survey

Figure 3 for Deep Learning for UAV-based Object Detection and Tracking: A Survey

Figure 4 for Deep Learning for UAV-based Object Detection and Tracking: A Survey

Abstract:Owing to effective and flexible data acquisition, unmanned aerial vehicle (UAV) has recently become a hotspot across the fields of computer vision (CV) and remote sensing (RS). Inspired by recent success of deep learning (DL), many advanced object detection and tracking approaches have been widely applied to various UAV-related tasks, such as environmental monitoring, precision agriculture, traffic management. This paper provides a comprehensive survey on the research progress and prospects of DL-based UAV object detection and tracking methods. More specifically, we first outline the challenges, statistics of existing methods, and provide solutions from the perspectives of DL-based models in three research topics: object detection from the image, object detection from the video, and object tracking from the video. Open datasets related to UAV-dominated object detection and tracking are exhausted, and four benchmark datasets are employed for performance evaluation using some state-of-the-art methods. Finally, prospects and considerations for the future work are discussed and summarized. It is expected that this survey can facilitate those researchers who come from remote sensing field with an overview of DL-based UAV object detection and tracking methods, along with some thoughts on their further developments.

* IEEE Geoscience and Remote Sensing Magazine,2021

Via

Access Paper or Ask Questions

Spectral Variability Augmented Sparse Unmixing of Hyperspectral Images

Oct 21, 2021

Ge Zhang, Shaohui Mei, Mingyang Ma, Yan Feng, Qian Du

Figure 1 for Spectral Variability Augmented Sparse Unmixing of Hyperspectral Images

Figure 2 for Spectral Variability Augmented Sparse Unmixing of Hyperspectral Images

Figure 3 for Spectral Variability Augmented Sparse Unmixing of Hyperspectral Images

Figure 4 for Spectral Variability Augmented Sparse Unmixing of Hyperspectral Images

Abstract:Spectral unmixing (SU) expresses the mixed pixels existed in hyperspectral images as the product of endmember and abundance, which has been widely used in hyperspectral imagery analysis. However, the influence of light, acquisition conditions and the inherent properties of materials, results in that the identified endmembers can vary spectrally within a given image (construed as spectral variability). To address this issue, recent methods usually use a priori obtained spectral library to represent multiple characteristic spectra of the same object, but few of them extracted the spectral variability explicitly. In this paper, a spectral variability augmented sparse unmixing model (SVASU) is proposed, in which the spectral variability is extracted for the first time. The variable spectra are divided into two parts of intrinsic spectrum and spectral variability for spectral reconstruction, and modeled synchronously in the SU model adding the regular terms restricting the sparsity of abundance and the generalization of the variability coefficient. It is noted that the spectral variability library and the intrinsic spectral library are all constructed from the In-situ observed image. Experimental results over both synthetic and real-world data sets demonstrate that the augmented decomposition by spectral variability significantly improves the unmixing performance than the decomposition only by spectral library, as well as compared to state-of-the-art algorithms.

Via

Access Paper or Ask Questions

Synthetic Aperture Radar Image Change Detection via Siamese Adaptive Fusion Network

Oct 18, 2021

Yunhao Gao, Feng Gao, Junyu Dong, Qian Du, Heng-Chao Li

Figure 1 for Synthetic Aperture Radar Image Change Detection via Siamese Adaptive Fusion Network

Figure 2 for Synthetic Aperture Radar Image Change Detection via Siamese Adaptive Fusion Network

Figure 3 for Synthetic Aperture Radar Image Change Detection via Siamese Adaptive Fusion Network

Figure 4 for Synthetic Aperture Radar Image Change Detection via Siamese Adaptive Fusion Network

Abstract:Synthetic aperture radar (SAR) image change detection is a critical yet challenging task in the field of remote sensing image analysis. The task is non-trivial due to the following challenges: Firstly, intrinsic speckle noise of SAR images inevitably degrades the neural network because of error gradient accumulation. Furthermore, the correlation among various levels or scales of feature maps is difficult to be achieved through summation or concatenation. Toward this end, we proposed a siamese adaptive fusion network for SAR image change detection. To be more specific, two-branch CNN is utilized to extract high-level semantic features of multitemporal SAR images. Besides, an adaptive fusion module is designed to adaptively combine multiscale responses in convolutional layers. Therefore, the complementary information is exploited, and feature learning in change detection is further improved. Moreover, a correlation layer is designed to further explore the correlation between multitemporal images. Thereafter, robust feature representation is utilized for classification through a fully-connected layer with softmax. Experimental results on four real SAR datasets demonstrate that the proposed method exhibits superior performance against several state-of-the-art methods. Our codes are available at https://github.com/summitgao/SAR_CD_SAFNet.

* This work has been accepted by IEEE JSTARS for publication. Our codes are available at https://github.com/summitgao/SAR_CD_SAFNet

Via

Access Paper or Ask Questions