Millimeter-wave (mmW) radar is widely applied to advanced autopilot assistance systems. However, its small antenna aperture causes a low imaging resolution. In this paper, a new distributed mmW radar system is designed to solve this problem. It forms a large sparse virtual planar array to enlarge the aperture, using multiple-input and multiple-output (MIMO) processing. However, in this system, traditional imaging methods cannot apply to the sparse array. Therefore, we also propose a 3D super-resolution imaging method specifically for this system in this paper. The proposed method consists of three steps: (1) using range FFT to get range imaging, (2) using 2D adaptive diagonal loading iterative adaptive approach (ADL-IAA) to acquire 2D super-resolution imaging, which can satisfy this sparsity under single-measurement, (3) using constant false alarm (CFAR) processing to gain final 3D super-resolution imaging. The simulation results show the proposed method can significantly improve imaging resolution under the sparse array and single-measurement.
Inevitable interferences exist for the SAR system, adversely affecting the imaging quality. However, current analysis and suppression methods mainly focus on the far-field situation. Due to different sources and characteristics of interferences, they are not applicable in the near field. To bridge this gap, in the first time, analysis and the suppression method of interferences in near-field SAR are presented in this work. We find that echoes from both the nadir points and the antenna coupling are the main causes, which have the constant-time-delay feature. To characterize this, we further establish an analytical model. It reveals that their patterns in 1D, 2D and 3D imaging results are all comb-like, while those of targets are point-like. Utilizing these features, a suppression method in image domain is proposed based on low-rank reconstruction. Measured data are used to validate the correctness of our analysis and the effectiveness of the suppression method.
Images of near-field SAR contains spatial-variant sidelobes and clutter, subduing the image quality. Current image restoration methods are only suitable for small observation angle, due to their assumption of 2D spatial-invariant degradation operation. This limits its potential for large-scale objects imaging, like the aircraft. To ease this restriction, in this work an image restoration method based on the 2D spatial-variant deconvolution is proposed. First, the image degradation is seen as a complex convolution process with 2D spatial-variant operations. Then, to restore the image, the process of deconvolution is performed by cyclic coordinate descent algorithm. Experiments on simulation and measured data validate the effectiveness and superiority of the proposed method. Compared with current methods, higher precision estimation of the targets' amplitude and position is obtained.
With the booming of Convolutional Neural Networks (CNNs), CNNs such as VGG-16 and ResNet-50 widely serve as backbone in SAR ship detection. However, CNN based backbone is hard to model long-range dependencies, and causes the lack of enough high-quality semantic information in feature maps of shallow layers, which leads to poor detection performance in complicated background and small-sized ships cases. To address these problems, we propose a SAR ship detection method based on Swin Transformer and Feature Enhancement Feature Pyramid Network (FEFPN). Swin Transformer serves as backbone to model long-range dependencies and generates hierarchical features maps. FEFPN is proposed to further improve the quality of feature maps by gradually enhancing the semantic information of feature maps at all levels, especially feature maps in shallow layers. Experiments conducted on SAR ship detection dataset (SSDD) reveal the advantage of our proposed methods.
Interferometric Synthetic Aperture Radar (InSAR) Imaging methods are usually based on algorithms of match-filtering type, without considering the scene's characteristic, which causes limited imaging quality. Besides, post-processing steps are inevitable, like image registration, flat-earth phase removing and phase noise filtering. To solve these problems, we propose a new InSAR imaging method. First, to enhance the imaging quality, we propose a new imaging framework base on 2D sparse regularization, where the characteristic of scene is embedded. Second, to avoid the post processing steps, we establish a new forward observation process, where the back-projection imaging method is embedded. Third, a forward and backward iterative solution method is proposed based on proximal gradient descent algorithm. Experiments on simulated and measured data reveal the effectiveness of the proposed method. Compared with the conventional method, higher quality interferogram can be obtained directly from raw echoes without post-processing. Besides, in the under-sampling situation, it's also applicable.
Single image desnowing is a common yet challenging task. The complex snow degradations and diverse degradation scales demand strong representation ability. In order for the desnowing network to see various snow degradations and model the context interaction of local details and global information, we propose a powerful architecture dubbed as SnowFormer. First, it performs Scale-aware Feature Aggregation in the encoder to capture rich snow information of various degradations. Second, in order to tackle with large-scale degradation, it uses a novel Context Interaction Transformer Block in the decoder, which conducts context interaction of local details and global information from previous scale-aware feature aggregation in global context interaction. And the introduction of local context interaction improves recovery of scene details. Third, we devise a Heterogeneous Feature Projection Head which progressively fuse features from both the encoder and decoder and project the refined feature into the clean image. Extensive experiments demonstrate that the proposed SnowFormer achieves significant improvements over other SOTA methods. Compared with SOTA single image desnowing method HDCW-Net, it boosts the PSNR metric by 9.2dB on the CSD testset. Moreover, it also achieves a 5.13dB increase in PSNR compared with general image restoration architecture NAFNet, which verifies the strong representation ability of our SnowFormer for snow removal task. The code is released in \url{https://github.com/Ephemeral182/SnowFormer}.
Transformer has been widely used in histopathology whole slide image (WSI) classification for the purpose of tumor grading, prognosis analysis, etc. However, the design of token-wise self-attention and positional embedding strategy in the common Transformer limits the effectiveness and efficiency in the application to gigapixel histopathology images. In this paper, we propose a kernel attention Transformer (KAT) for histopathology WSI classification. The information transmission of the tokens is achieved by cross-attention between the tokens and a set of kernels related to a set of positional anchors on the WSI. Compared to the common Transformer structure, the proposed KAT can better describe the hierarchical context information of the local regions of the WSI and meanwhile maintains a lower computational complexity. The proposed method was evaluated on a gastric dataset with 2040 WSIs and an endometrial dataset with 2560 WSIs, and was compared with 6 state-of-the-art methods. The experimental results have demonstrated the proposed KAT is effective and efficient in the task of histopathology WSI classification and is superior to the state-of-the-art methods. The code is available at https://github.com/zhengyushan/kat.
Local representation learning has been a key challenge to promote the performance of the histopathological whole slide images analysis. The previous representation learning methods followed the supervised learning paradigm. However, manual annotation for large-scale WSIs is time-consuming and labor-intensive. Hence, the self-supervised contrastive learning has recently attracted intensive attention. The present contrastive learning methods treat each sample as a single class, which suffers from class collision problems, especially in the domain of histopathology image analysis. In this paper, we proposed a novel contrastive representation learning framework named Lesion-Aware Contrastive Learning (LACL) for histopathology whole slide image analysis. We built a lesion queue based on the memory bank structure to store the representations of different classes of WSIs, which allowed the contrastive model to selectively define the negative pairs during the training. Moreover, We designed a queue refinement strategy to purify the representations stored in the lesion queue. The experimental results demonstrate that LACL achieves the best performance in histopathology image representation learning on different datasets, and outperforms state-of-the-art methods under different WSI classification benchmarks. The code is available at https://github.com/junl21/lacl.
Computer-aided diagnosis (CAD) can help pathologists improve diagnostic accuracy together with consistency and repeatability for cancers. However, the CAD models trained with the histopathological images only from a single center (hospital) generally suffer from the generalization problem due to the straining inconsistencies among different centers. In this work, we propose a pseudo-data based self-supervised federated learning (FL) framework, named SSL-FT-BT, to improve both the diagnostic accuracy and generalization of CAD models. Specifically, the pseudo histopathological images are generated from each center, which contains inherent and specific properties corresponding to the real images in this center, but does not include the privacy information. These pseudo images are then shared in the central server for self-supervised learning (SSL). A multi-task SSL is then designed to fully learn both the center-specific information and common inherent representation according to the data characteristics. Moreover, a novel Barlow Twins based FL (FL-BT) algorithm is proposed to improve the local training for the CAD model in each center by conducting contrastive learning, which benefits the optimization of the global model in the FL procedure. The experimental results on three public histopathological image datasets indicate the effectiveness of the proposed SSL-FL-BT on both diagnostic accuracy and generalization.
Convolutional neural networks (CNNs) and their variants have been successfully applied to the electroencephalogram (EEG) based motor imagery (MI) decoding task. However, these CNN-based algorithms generally have limitations in perceiving global temporal dependencies of EEG signals. Besides, they also ignore the diverse contributions of different EEG channels to the classification task. To address such issues, a novel channel attention based MLP-Mixer network (CAMLP-Net) is proposed for EEG-based MI decoding. Specifically, the MLP-based architecture is applied in this network to capture the temporal and spatial information. The attention mechanism is further embedded into MLP-Mixer to adaptively exploit the importance of different EEG channels. Therefore, the proposed CAMLP-Net can effectively learn more global temporal and spatial information. The experimental results on the newly built MI-2 dataset indicate that our proposed CAMLP-Net achieves superior classification performance over all the compared algorithms.