Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Murray Loew

CaraNet: Context Axial Reverse Attention Network for Segmentation of Small Medical Objects

Jan 31, 2023

Ange Lou, Shuyue Guan, Murray Loew

Abstract:Segmenting medical images accurately and reliably is important for disease diagnosis and treatment. It is a challenging task because of the wide variety of objects' sizes, shapes, and scanning modalities. Recently, many convolutional neural networks (CNN) have been designed for segmentation tasks and achieved great success. Few studies, however, have fully considered the sizes of objects, and thus most demonstrate poor performance for small objects segmentation. This can have a significant impact on the early detection of diseases. This paper proposes a Context Axial Reverse Attention Network (CaraNet) to improve the segmentation performance on small objects compared with several recent state-of-the-art models. CaraNet applies axial reserve attention (ARA) and channel-wise feature pyramid (CFP) module to dig feature information of small medical object. And we evaluate our model by six different measurement metrics. We test our CaraNet on brain tumor (BraTS 2018) and polyp (Kvasir-SEG, CVC-ColonDB, CVC-ClinicDB, CVC-300, and ETIS-LaribPolypDB) segmentation datasets. Our CaraNet achieves the top-rank mean Dice segmentation accuracy, and results show a distinct advantage of CaraNet in the segmentation of small medical objects.

* arXiv admin note: text overlap with arXiv:2108.07368

Via

Access Paper or Ask Questions

A Sneak Attack on Segmentation of Medical Images Using Deep Neural Network Classifiers

Jan 28, 2022

Shuyue Guan, Murray Loew

Figure 1 for A Sneak Attack on Segmentation of Medical Images Using Deep Neural Network Classifiers

Figure 2 for A Sneak Attack on Segmentation of Medical Images Using Deep Neural Network Classifiers

Figure 3 for A Sneak Attack on Segmentation of Medical Images Using Deep Neural Network Classifiers

Figure 4 for A Sneak Attack on Segmentation of Medical Images Using Deep Neural Network Classifiers

Abstract:Instead of using current deep-learning segmentation models (like the UNet and variants), we approach the segmentation problem using trained Convolutional Neural Network (CNN) classifiers, which automatically extract important features from images for classification. Those extracted features can be visualized and formed into heatmaps using Gradient-weighted Class Activation Mapping (Grad-CAM). This study tested whether the heatmaps could be used to segment the classified targets. We also proposed an evaluation method for the heatmaps; that is, to re-train the CNN classifier using images filtered by heatmaps and examine its performance. We used the mean-Dice coefficient to evaluate segmentation results. Results from our experiments show that heatmaps can locate and segment partial tumor areas. But use of only the heatmaps from CNN classifiers may not be an optimal approach for segmentation. We have verified that the predictions of CNN classifiers mainly depend on tumor areas, and dark regions in Grad-CAM's heatmaps also contribute to classification.

* 8 pages, 10 figures. Accepted by IEEE AIPR 2021 (Oral)

Via

Access Paper or Ask Questions

A Teacher-Student Framework with Fourier Augmentation for COVID-19 Infection Segmentation in CT Images

Oct 13, 2021

Han Chen, Yifan Jiang, Hanseok Ko, Murray Loew

Figure 1 for A Teacher-Student Framework with Fourier Augmentation for COVID-19 Infection Segmentation in CT Images

Figure 2 for A Teacher-Student Framework with Fourier Augmentation for COVID-19 Infection Segmentation in CT Images

Figure 3 for A Teacher-Student Framework with Fourier Augmentation for COVID-19 Infection Segmentation in CT Images

Figure 4 for A Teacher-Student Framework with Fourier Augmentation for COVID-19 Infection Segmentation in CT Images

Abstract:Automatic segmentation of infected regions in computed tomography (CT) images is necessary for the initial diagnosis of COVID-19. Deep-learning-based methods have the potential to automate this task but require a large amount of data with pixel-level annotations. Training a deep network with annotated lung cancer CT images, which are easier to obtain, can alleviate this problem to some extent. However, this approach may suffer from a reduction in performance when applied to unseen COVID-19 images during the testing phase due to the domain shift. In this paper, we propose a novel unsupervised method for COVID-19 infection segmentation that aims to learn the domain-invariant features from lung cancer and COVID-19 images to improve the generalization ability of the segmentation network for use with COVID-19 CT images. To overcome the intensity shift, our method first transforms annotated lung cancer data into the style of unlabeled COVID-19 data using an effective augmentation approach via a Fourier transform. Furthermore, to reduce the distribution shift, we design a teacher-student network to learn rotation-invariant features for segmentation. Experiments demonstrate that even without getting access to the annotations of COVID-19 CT during training, the proposed network can achieve a state-of-the-art segmentation performance on COVID-19 images.

Via

Access Paper or Ask Questions

A Novel Intrinsic Measure of Data Separability

Sep 11, 2021

Shuyue Guan, Murray Loew

Figure 1 for A Novel Intrinsic Measure of Data Separability

Figure 2 for A Novel Intrinsic Measure of Data Separability

Figure 3 for A Novel Intrinsic Measure of Data Separability

Figure 4 for A Novel Intrinsic Measure of Data Separability

Abstract:In machine learning, the performance of a classifier depends on both the classifier model and the separability/complexity of datasets. To quantitatively measure the separability of datasets, we create an intrinsic measure -- the Distance-based Separability Index (DSI), which is independent of the classifier model. We consider the situation in which different classes of data are mixed in the same distribution to be the most difficult for classifiers to separate. We then formally show that the DSI can indicate whether the distributions of datasets are identical for any dimensionality. And we verify the DSI to be an effective separability measure by comparing to several state-of-the-art separability/complexity measures using synthetic and real datasets. Having demonstrated the DSI's ability to compare distributions of samples, we also discuss some of its other promising applications, such as measuring the performance of generative adversarial networks (GANs) and evaluating the results of clustering methods.

* 16 pages, 12 figures. arXiv admin note: substantial text overlap with arXiv:2005.13120

Via

Access Paper or Ask Questions

A Distance-based Separability Measure for Internal Cluster Validation

Jun 17, 2021

Shuyue Guan, Murray Loew

Figure 1 for A Distance-based Separability Measure for Internal Cluster Validation

Figure 2 for A Distance-based Separability Measure for Internal Cluster Validation

Figure 3 for A Distance-based Separability Measure for Internal Cluster Validation

Figure 4 for A Distance-based Separability Measure for Internal Cluster Validation

Abstract:To evaluate clustering results is a significant part of cluster analysis. Since there are no true class labels for clustering in typical unsupervised learning, many internal cluster validity indices (CVIs), which use predicted labels and data, have been created. Without true labels, to design an effective CVI is as difficult as to create a clustering method. And it is crucial to have more CVIs because there are no universal CVIs that can be used to measure all datasets and no specific methods of selecting a proper CVI for clusters without true labels. Therefore, to apply a variety of CVIs to evaluate clustering results is necessary. In this paper, we propose a novel internal CVI -- the Distance-based Separability Index (DSI), based on a data separability measure. We compared the DSI with eight internal CVIs including studies from early Dunn (1974) to most recent CVDD (2019) and an external CVI as ground truth, by using clustering results of five clustering algorithms on 12 real and 97 synthetic datasets. Results show DSI is an effective, unique, and competitive CVI to other compared CVIs. We also summarized the general process to evaluate CVIs and created the rank-difference metric for comparison of CVIs' results.

* It is an extended version of the paper: arXiv:2009.01328

Via

Access Paper or Ask Questions

CFPNet-M: A Light-Weight Encoder-Decoder Based Network for Multimodal Biomedical Image Real-Time Segmentation

May 30, 2021

Ange Lou, Shuyue Guan, Murray Loew

Figure 1 for CFPNet-M: A Light-Weight Encoder-Decoder Based Network for Multimodal Biomedical Image Real-Time Segmentation

Figure 2 for CFPNet-M: A Light-Weight Encoder-Decoder Based Network for Multimodal Biomedical Image Real-Time Segmentation

Figure 3 for CFPNet-M: A Light-Weight Encoder-Decoder Based Network for Multimodal Biomedical Image Real-Time Segmentation

Figure 4 for CFPNet-M: A Light-Weight Encoder-Decoder Based Network for Multimodal Biomedical Image Real-Time Segmentation

Abstract:Currently, developments of deep learning techniques are providing instrumental to identify, classify, and quantify patterns in medical images. Segmentation is one of the important applications in medical image analysis. In this regard, U-Net is the predominant approach to medical image segmentation tasks. However, we found that those U-Net based models have limitations in several aspects, for example, millions of parameters in the U-Net consuming considerable computation resource and memory, lack of global information, and missing some tough objects. Therefore, we applied two modifications to improve the U-Net model: 1) designed and added the dilated channel-wise CNN module, 2) simplified the U shape network. Based on these two modifications, we proposed a novel light-weight architecture -- Channel-wise Feature Pyramid Network for Medicine (CFPNet-M). To evaluate our method, we selected five datasets with different modalities: thermography, electron microscopy, endoscopy, dermoscopy, and digital retinal images. And we compared its performance with several models having different parameter scales. This paper also involves our previous studies of DC-UNet and some commonly used light-weight neural networks. We applied the Tanimoto similarity instead of the Jaccard index for gray-level image measurements. By comparison, CFPNet-M achieves comparable segmentation results on all five medical datasets with only 0.65 million parameters, which is about 2% of U-Net, and 8.8 MB memory. Meanwhile, the inference speed can reach 80 FPS on a single RTX 2070Ti GPU with the 256 by 192 pixels input size.

Via

Access Paper or Ask Questions

CFPNet: Channel-wise Feature Pyramid for Real-Time Semantic Segmentation

Mar 22, 2021

Ange Lou, Murray Loew

Figure 1 for CFPNet: Channel-wise Feature Pyramid for Real-Time Semantic Segmentation

Figure 2 for CFPNet: Channel-wise Feature Pyramid for Real-Time Semantic Segmentation

Figure 3 for CFPNet: Channel-wise Feature Pyramid for Real-Time Semantic Segmentation

Figure 4 for CFPNet: Channel-wise Feature Pyramid for Real-Time Semantic Segmentation

Abstract:Real-time semantic segmentation is playing a more important role in computer vision, due to the growing demand for mobile devices and autonomous driving. Therefore, it is very important to achieve a good trade-off among performance, model size and inference speed. In this paper, we propose a Channel-wise Feature Pyramid (CFP) module to balance those factors. Based on the CFP module, we built CFPNet for real-time semantic segmentation which applied a series of dilated convolution channels to extract effective features. Experiments on Cityscapes and CamVid datasets show that the proposed CFPNet achieves an effective combination of those factors. For the Cityscapes test dataset, CFPNet achieves 70.1% class-wise mIoU with only 0.55 million parameters and 2.5 MB memory. The inference speed can reach 30 FPS on a single RTX 2080Ti GPU with a 1024x2048-pixel image.

Via

Access Paper or Ask Questions

Understanding the Ability of Deep Neural Networks to Count Connected Components in Images

Jan 05, 2021

Shuyue Guan, Murray Loew

Figure 1 for Understanding the Ability of Deep Neural Networks to Count Connected Components in Images

Figure 2 for Understanding the Ability of Deep Neural Networks to Count Connected Components in Images

Figure 3 for Understanding the Ability of Deep Neural Networks to Count Connected Components in Images

Figure 4 for Understanding the Ability of Deep Neural Networks to Count Connected Components in Images

Abstract:Humans can count very fast by subitizing, but slow substantially as the number of objects increases. Previous studies have shown a trained deep neural network (DNN) detector can count the number of objects in an amount of time that increases slowly with the number of objects. Such a phenomenon suggests the subitizing ability of DNNs, and unlike humans, it works equally well for large numbers. Many existing studies have successfully applied DNNs to object counting, but few studies have studied the subitizing ability of DNNs and its interpretation. In this paper, we found DNNs do not have the ability to generally count connected components. We provided experiments to support our conclusions and explanations to understand the results and phenomena of these experiments. We proposed three ML-learnable characteristics to verify learnable problems for ML models, such as DNNs, and explain why DNNs work for specific counting problems but cannot generally count connected components.

* 7 pages, 12 figures. Accepted by IEEE AIPR 2020 (Oral)

Via

Access Paper or Ask Questions

Segmentation of Infrared Breast Images Using MultiResUnet Neural Network

Oct 31, 2020

Ange Lou, Shuyue Guan, Nada Kamona, Murray Loew

Figure 1 for Segmentation of Infrared Breast Images Using MultiResUnet Neural Network

Figure 2 for Segmentation of Infrared Breast Images Using MultiResUnet Neural Network

Figure 3 for Segmentation of Infrared Breast Images Using MultiResUnet Neural Network

Figure 4 for Segmentation of Infrared Breast Images Using MultiResUnet Neural Network

Abstract:Breast cancer is the second leading cause of death for women in the U.S. Early detection of breast cancer is key to higher survival rates of breast cancer patients. We are investigating infrared (IR) thermography as a noninvasive adjunct to mammography for breast cancer screening. IR imaging is radiation-free, pain-free, and non-contact. Automatic segmentation of the breast area from the acquired full-size breast IR images will help limit the area for tumor search, as well as reduce the time and effort costs of manual segmentation. Autoencoder-like convolutional and deconvolutional neural networks (C-DCNN) had been applied to automatically segment the breast area in IR images in previous studies. In this study, we applied a state-of-the-art deep-learning segmentation model, MultiResUnet, which consists of an encoder part to capture features and a decoder part for precise localization. It was used to segment the breast area by using a set of breast IR images, collected in our pilot study by imaging breast cancer patients and normal volunteers with a thermal infrared camera (N2 Imager). The database we used has 450 images, acquired from 14 patients and 16 volunteers. We used a thresholding method to remove interference in the raw images and remapped them from the original 16-bit to 8-bit, and then cropped and segmented the 8-bit images manually. Experiments using leave-one-out cross-validation (LOOCV) and comparison with the ground-truth images by using Tanimoto similarity show that the average accuracy of MultiResUnet is 91.47%, which is about 2% higher than that of the autoencoder. MultiResUnet offers a better approach to segment breast IR images than our previous model.

* 6 pages. Accepted by IEEE AIPR 2019 (Oral)

Via

Access Paper or Ask Questions

The estimation of training accuracy for two-layer neural networks on random datasets without training

Oct 26, 2020

Shuyue Guan, Murray Loew

Figure 1 for The estimation of training accuracy for two-layer neural networks on random datasets without training

Figure 2 for The estimation of training accuracy for two-layer neural networks on random datasets without training

Figure 3 for The estimation of training accuracy for two-layer neural networks on random datasets without training

Figure 4 for The estimation of training accuracy for two-layer neural networks on random datasets without training

Abstract:Although the neural network (NN) technique plays an important role in machine learning, understanding the mechanism of NN models and the transparency of deep learning still require more basic research. In this study we propose a novel theory based on space partitioning to estimate the approximate training accuracy for two-layer neural networks on random datasets without training. There appear to be no other studies that have proposed a method to estimate training accuracy without using input data or trained models. Our method estimates the training accuracy for two-layer fully-connected neural networks on two-class random datasets using only three arguments: the dimensionality of inputs (d), the number of inputs (N), and the number of neurons in the hidden layer (L). We have verified our method using real training accuracies in our experiments. The results indicate that the method will work for any dimension, and the proposed theory could extend also to estimate deeper NN models. This study may provide a starting point for a new way for researchers to make progress on the difficult problem of understanding deep learning.

* 17 pages, 5 figures

Via

Access Paper or Ask Questions