Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Deyu Meng

InDuDoNet+: A Model-Driven Interpretable Dual Domain Network for Metal Artifact Reduction in CT Images

Dec 23, 2021
Hong Wang, Yuexiang Li, Haimiao Zhang, Deyu Meng, Yefeng Zheng

Figure 1 for InDuDoNet+: A Model-Driven Interpretable Dual Domain Network for Metal Artifact Reduction in CT Images

Figure 2 for InDuDoNet+: A Model-Driven Interpretable Dual Domain Network for Metal Artifact Reduction in CT Images

Figure 3 for InDuDoNet+: A Model-Driven Interpretable Dual Domain Network for Metal Artifact Reduction in CT Images

Figure 4 for InDuDoNet+: A Model-Driven Interpretable Dual Domain Network for Metal Artifact Reduction in CT Images

During the computed tomography (CT) imaging process, metallic implants within patients always cause harmful artifacts, which adversely degrade the visual quality of reconstructed CT images and negatively affect the subsequent clinical diagnosis. For the metal artifact reduction (MAR) task, current deep learning based methods have achieved promising performance. However, most of them share two main common limitations: 1) the CT physical imaging geometry constraint is not comprehensively incorporated into deep network structures; 2) the entire framework has weak interpretability for the specific MAR task; hence, the role of every network module is difficult to be evaluated. To alleviate these issues, in the paper, we construct a novel interpretable dual domain network, termed InDuDoNet+, into which CT imaging process is finely embedded. Concretely, we derive a joint spatial and Radon domain reconstruction model and propose an optimization algorithm with only simple operators for solving it. By unfolding the iterative steps involved in the proposed algorithm into the corresponding network modules, we easily build the InDuDoNet+ with clear interpretability. Furthermore, we analyze the CT values among different tissues, and merge the prior observations into a prior network for our InDuDoNet+, which significantly improve its generalization performance. Comprehensive experiments on synthesized data and clinical data substantiate the superiority of the proposed methods as well as the superior generalization performance beyond the current state-of-the-art (SOTA) MAR methods. Code is available at \url{https://github.com/hongwang01/InDuDoNet_plus}.

Via

Access Paper or Ask Questions

Label Hierarchy Transition: Modeling Class Hierarchies to Enhance Deep Classifiers

Dec 04, 2021
Renzhen Wang, De cai, Kaiwen Xiao, Xixi Jia, Xiao Han, Deyu Meng

Figure 1 for Label Hierarchy Transition: Modeling Class Hierarchies to Enhance Deep Classifiers

Figure 2 for Label Hierarchy Transition: Modeling Class Hierarchies to Enhance Deep Classifiers

Figure 3 for Label Hierarchy Transition: Modeling Class Hierarchies to Enhance Deep Classifiers

Figure 4 for Label Hierarchy Transition: Modeling Class Hierarchies to Enhance Deep Classifiers

Hierarchical classification aims to sort the object into a hierarchy of categories. For example, a bird can be categorized according to a three-level hierarchy of order, family, and species. Existing methods commonly address hierarchical classification by decoupling it into several multi-class classification tasks. However, such a multi-task learning strategy fails to fully exploit the correlation among various categories across different hierarchies. In this paper, we propose Label Hierarchy Transition, a unified probabilistic framework based on deep learning, to address hierarchical classification. Specifically, we explicitly learn the label hierarchy transition matrices, whose column vectors represent the conditional label distributions of classes between two adjacent hierarchies and could be capable of encoding the correlation embedded in class hierarchies. We further propose a confusion loss, which encourages the classification network to learn the correlation across different label hierarchies during training. The proposed framework can be adapted to any existing deep network with only minor modifications. We experiment with three public benchmark datasets with various class hierarchies, and the results demonstrate the superiority of our approach beyond the prior arts. Source code will be made publicly available.

Via

Access Paper or Ask Questions

Infrared Small-Dim Target Detection with Transformer under Complex Backgrounds

Sep 29, 2021
Fangcen Liu, Chenqiang Gao, Fang Chen, Deyu Meng, Wangmeng Zuo, Xinbo Gao

Figure 1 for Infrared Small-Dim Target Detection with Transformer under Complex Backgrounds

Figure 2 for Infrared Small-Dim Target Detection with Transformer under Complex Backgrounds

Figure 3 for Infrared Small-Dim Target Detection with Transformer under Complex Backgrounds

Figure 4 for Infrared Small-Dim Target Detection with Transformer under Complex Backgrounds

The infrared small-dim target detection is one of the key techniques in the infrared search and tracking system. Since the local regions which similar to infrared small-dim targets spread over the whole background, exploring the interaction information amongst image features in large-range dependencies to mine the difference between the target and background is crucial for robust detection. However, existing deep learning-based methods are limited by the locality of convolutional neural networks, which impairs the ability to capture large-range dependencies. To this end, we propose a new infrared small-dim target detection method with the transformer. We adopt the self-attention mechanism of the transformer to learn the interaction information of image features in a larger range. Additionally, we design a feature enhancement module to learn more features of small-dim targets. After that, we adopt a decoder with the U-Net-like skip connection operation to get the detection result. Extensive experiments on two public datasets show the obvious superiority of the proposed method over state-of-the-art methods.

Via

Access Paper or Ask Questions

InDuDoNet: An Interpretable Dual Domain Network for CT Metal Artifact Reduction

Sep 11, 2021
Hong Wang, Yuexiang Li, Haimiao Zhang, Jiawei Chen, Kai Ma, Deyu Meng, Yefeng Zheng

Figure 1 for InDuDoNet: An Interpretable Dual Domain Network for CT Metal Artifact Reduction

Figure 2 for InDuDoNet: An Interpretable Dual Domain Network for CT Metal Artifact Reduction

Figure 3 for InDuDoNet: An Interpretable Dual Domain Network for CT Metal Artifact Reduction

Figure 4 for InDuDoNet: An Interpretable Dual Domain Network for CT Metal Artifact Reduction

For the task of metal artifact reduction (MAR), although deep learning (DL)-based methods have achieved promising performances, most of them suffer from two problems: 1) the CT imaging geometry constraint is not fully embedded into the network during training, leaving room for further performance improvement; 2) the model interpretability is lack of sufficient consideration. Against these issues, we propose a novel interpretable dual domain network, termed as InDuDoNet, which combines the advantages of model-driven and data-driven methodologies. Specifically, we build a joint spatial and Radon domain reconstruction model and utilize the proximal gradient technique to design an iterative algorithm for solving it. The optimization algorithm only consists of simple computational operators, which facilitate us to correspondingly unfold iterative steps into network modules and thus improve the interpretablility of the framework. Extensive experiments on synthesized and clinical data show the superiority of our InDuDoNet. Code is available in \url{https://github.com/hongwang01/InDuDoNet}.%method on the tasks of MAR and downstream multi-class pelvic fracture segmentation.

* International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) 2021

Via

Access Paper or Ask Questions

Unsupervised Local Discrimination for Medical Images

Aug 21, 2021
Huai Chen, Renzhen Wang, Jieyu Li, Qing Peng, Deyu Meng, Lisheng Wang

Figure 1 for Unsupervised Local Discrimination for Medical Images

Figure 2 for Unsupervised Local Discrimination for Medical Images

Figure 3 for Unsupervised Local Discrimination for Medical Images

Figure 4 for Unsupervised Local Discrimination for Medical Images

Contrastive representation learning is an effective unsupervised method to alleviate the demand for expensive annotated data in medical image processing. Recent work mainly based on instance-wise discrimination to learn global features, while neglect local details, which limit their application in processing tiny anatomical structures, tissues and lesions. Therefore, we aim to propose a universal local discrmination framework to learn local discriminative features to effectively initialize medical models, meanwhile, we systematacially investigate its practical medical applications. Specifically, based on the common property of intra-modality structure similarity, i.e. similar structures are shared among the same modality images, a systematic local feature learning framework is proposed. Instead of making instance-wise comparisons based on global embedding, our method makes pixel-wise embedding and focuses on measuring similarity among patches and regions. The finer contrastive rule makes the learnt representation more generalized for segmentation tasks and outperform extensive state-of-the-art methods by wining 11 out of all 12 downstream tasks in color fundus and chest X-ray. Furthermore, based on the property of inter-modality shape similarity, i.e. structures may share similar shape although in different medical modalities, we joint across-modality shape prior into region discrimination to realize unsupervised segmentation. It shows the feaibility of segmenting target only based on shape description from other modalities and inner pattern similarity provided by region discrimination. Finally, we enhance the center-sensitive ability of patch discrimination by introducing center-sensitive averaging to realize one-shot landmark localization, this is an effective application for patch discrimination.

* 16 pages, 9 figures

Via

Access Paper or Ask Questions

Infrared Small Target Detection Using Multi-patch Attention Network

Aug 13, 2021
Fang Chen, Chenqiang Gao, Fangcen Liu, Yue Zhao, Yuxi Zhou, Deyu Meng, Wangmeng Zuo

Figure 1 for Infrared Small Target Detection Using Multi-patch Attention Network

Figure 2 for Infrared Small Target Detection Using Multi-patch Attention Network

Figure 3 for Infrared Small Target Detection Using Multi-patch Attention Network

Figure 4 for Infrared Small Target Detection Using Multi-patch Attention Network

Infrared small target detection plays an important role in the infrared search and tracking applications. In recent years, deep learning techniques were introduced to this task and achieved noteworthy effects. Following general object segmentation methods, existing deep learning methods usually processed the image from the global view. However, the imaging locality of small targets and extreme class-imbalance between the target and background pixels were not well-considered by these deep learning methods, which causes the low-efficiency on training and high-dependence on numerous data. A multi-patch attention network (MANet) is proposed in this paper to detect small targets by jointly considering the global and local properties of infrared small target images. From the global view, a supervised attention module trained by the small target spread map is proposed to suppress most background pixels irrelevant with small target features. From the local view, local patches are split from global features and share the same convolution weights with each other in a patch net. By synthesizing the global and local properties, the data-driven framework proposed in this paper has fused multi-scale features for small target detection. Extensive synthetic and real data experiments show that the proposed method achieves the state-of-the-art performance compared with existing both conventional and deep learning methods.

* 11 pages, 7 figures

Via

Access Paper or Ask Questions

Fourier Series Expansion Based Filter Parametrization for Equivariant Convolutions

Jul 30, 2021
Qi Xie, Qian Zhao, Zongben Xu, Deyu Meng

Figure 1 for Fourier Series Expansion Based Filter Parametrization for Equivariant Convolutions

Figure 2 for Fourier Series Expansion Based Filter Parametrization for Equivariant Convolutions

Figure 3 for Fourier Series Expansion Based Filter Parametrization for Equivariant Convolutions

Figure 4 for Fourier Series Expansion Based Filter Parametrization for Equivariant Convolutions

It has been shown that equivariant convolution is very helpful for many types of computer vision tasks. Recently, the 2D filter parametrization technique plays an important role when designing equivariant convolutions. However, the current filter parametrization method still has its evident drawbacks, where the most critical one lies in the accuracy problem of filter representation. Against this issue, in this paper we modify the classical Fourier series expansion for 2D filters, and propose a new set of atomic basis functions for filter parametrization. The proposed filter parametrization method not only finely represents 2D filters with zero error when the filter is not rotated, but also substantially alleviates the fence-effect-caused quality degradation when the filter is rotated. Accordingly, we construct a new equivariant convolution method based on the proposed filter parametrization method, named F-Conv. We prove that the equivariance of the proposed F-Conv is exact in the continuous domain, which becomes approximate only after discretization. Extensive experiments show the superiority of the proposed method. Particularly, we adopt rotation equivariant convolution methods to image super-resolution task, and F-Conv evidently outperforms previous filter parametrization based method in this task, reflecting its intrinsic capability of faithfully preserving rotation symmetries in local image features.

* 27 pages, 19 figures

Via

Access Paper or Ask Questions

RCDNet: An Interpretable Rain Convolutional Dictionary Network for Single Image Deraining

Jul 14, 2021
Hong Wang, Qi Xie, Qian Zhao, Yong Liang, Deyu Meng

Figure 1 for RCDNet: An Interpretable Rain Convolutional Dictionary Network for Single Image Deraining

Figure 2 for RCDNet: An Interpretable Rain Convolutional Dictionary Network for Single Image Deraining

Figure 3 for RCDNet: An Interpretable Rain Convolutional Dictionary Network for Single Image Deraining

Figure 4 for RCDNet: An Interpretable Rain Convolutional Dictionary Network for Single Image Deraining

As a common weather, rain streaks adversely degrade the image quality. Hence, removing rains from an image has become an important issue in the field. To handle such an ill-posed single image deraining task, in this paper, we specifically build a novel deep architecture, called rain convolutional dictionary network (RCDNet), which embeds the intrinsic priors of rain streaks and has clear interpretability. In specific, we first establish a RCD model for representing rain streaks and utilize the proximal gradient descent technique to design an iterative algorithm only containing simple operators for solving the model. By unfolding it, we then build the RCDNet in which every network module has clear physical meanings and corresponds to each operation involved in the algorithm. This good interpretability greatly facilitates an easy visualization and analysis on what happens inside the network and why it works well in inference process. Moreover, taking into account the domain gap issue in real scenarios, we further design a novel dynamic RCDNet, where the rain kernels can be dynamically inferred corresponding to input rainy images and then help shrink the space for rain layer estimation with few rain maps so as to ensure a fine generalization performance in the inconsistent scenarios of rain types between training and testing data. By end-to-end training such an interpretable network, all involved rain kernels and proximal operators can be automatically extracted, faithfully characterizing the features of both rain and clean background layers, and thus naturally lead to better deraining performance. Comprehensive experiments substantiate the superiority of our method, especially on its well generality to diverse testing scenarios and good interpretability for all its modules. Code is available in \emph{\url{https://github.com/hongwang01/DRCDNet}}.

Via

Access Paper or Ask Questions

Learning an Explicit Hyperparameter Prediction Policy Conditioned on Tasks

Jul 06, 2021
Jun Shu, Deyu Meng, Zongben Xu

Figure 1 for Learning an Explicit Hyperparameter Prediction Policy Conditioned on Tasks

Figure 2 for Learning an Explicit Hyperparameter Prediction Policy Conditioned on Tasks

Figure 3 for Learning an Explicit Hyperparameter Prediction Policy Conditioned on Tasks

Figure 4 for Learning an Explicit Hyperparameter Prediction Policy Conditioned on Tasks

Meta learning has attracted much attention recently in machine learning community. Contrary to conventional machine learning aiming to learn inherent prediction rules to predict labels for new query data, meta learning aims to learn the learning methodology for machine learning from observed tasks, so as to generalize to new query tasks by leveraging the meta-learned learning methodology. In this study, we interpret such learning methodology as learning an explicit hyperparameter prediction policy shared by all training tasks. Specifically, this policy is represented as a parameterized function called meta-learner, mapping from a training/test task to its suitable hyperparameter setting, extracted from a pre-specified function set called meta learning machine. Such setting guarantees that the meta-learned learning methodology is able to flexibly fit diverse query tasks, instead of only obtaining fixed hyperparameters by many current meta learning methods, with less adaptability to query task's variations. Such understanding of meta learning also makes it easily succeed from traditional learning theory for analyzing its generalization bounds with general losses/tasks/models. The theory naturally leads to some feasible controlling strategies for ameliorating the quality of the extracted meta-learner, verified to be able to finely ameliorate its generalization capability in some typical meta learning applications, including few-shot regression, few-shot classification and domain generalization.

* 59 pages. arXiv admin note: text overlap with arXiv:1904.03758 by other authors

Via

Access Paper or Ask Questions

Unsupervised Single Image Super-resolution Under Complex Noise

Jul 02, 2021
Zongsheng Yue, Qian Zhao, Jianwen Xie, Lei Zhang, Deyu Meng

Figure 1 for Unsupervised Single Image Super-resolution Under Complex Noise

Figure 2 for Unsupervised Single Image Super-resolution Under Complex Noise

Figure 3 for Unsupervised Single Image Super-resolution Under Complex Noise

Figure 4 for Unsupervised Single Image Super-resolution Under Complex Noise

While the researches on single image super-resolution (SISR), especially equipped with deep neural networks (DNNs), have achieved tremendous successes recently, they still suffer from two major limitations. Firstly, the real image degradation is usually unknown and highly variant from one to another, making it extremely hard to train a single model to handle the general SISR task. Secondly, most of current methods mainly focus on the downsampling process of the degradation, but ignore or underestimate the inevitable noise contamination. For example, the commonly-used independent and identically distributed (i.i.d.) Gaussian noise distribution always largely deviates from the real image noise (e.g., camera sensor noise), which limits their performance in real scenarios. To address these issues, this paper proposes a model-based unsupervised SISR method to deal with the general SISR task with unknown degradations. Instead of the traditional i.i.d. Gaussian noise assumption, a novel patch-based non-i.i.d. noise modeling method is proposed to fit the complex real noise. Besides, a deep generator parameterized by a DNN is used to map the latent variable to the high-resolution image, and the conventional hyper-Laplacian prior is also elaborately embedded into such generator to further constrain the image gradients. Finally, a Monte Carlo EM algorithm is designed to solve our model, which provides a general inference framework to update the image generator both w.r.t. the latent variable and the network parameters. Comprehensive experiments demonstrate that the proposed method can evidently surpass the current state of the art (SotA) method (about 1dB PSNR) not only with a slighter model (0.34M vs. 2.40M) but also faster speed.

Via

Access Paper or Ask Questions