Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yufan He

Deformable Cross-Attention Transformer for Medical Image Registration

Mar 10, 2023

Junyu Chen, Yihao Liu, Yufan He, Yong Du

Abstract:Transformers have recently shown promise for medical image applications, leading to an increasing interest in developing such models for medical image registration. Recent advancements in designing registration Transformers have focused on using cross-attention (CA) to enable a more precise understanding of spatial correspondences between moving and fixed images. Here, we propose a novel CA mechanism that computes windowed attention using deformable windows. In contrast to existing CA mechanisms that require intensive computational complexity by either computing CA globally or locally with a fixed and expanded search window, the proposed deformable CA can selectively sample a diverse set of features over a large search window while maintaining low computational complexity. The proposed model was extensively evaluated on multi-modal, mono-modal, and atlas-to-patient registration tasks, demonstrating promising performance against state-of-the-art methods and indicating its effectiveness for medical image registration. The source code for this work will be available after publication.

Via

Access Paper or Ask Questions

MONAI: An open-source framework for deep learning in healthcare

Nov 04, 2022

M. Jorge Cardoso, Wenqi Li, Richard Brown, Nic Ma, Eric Kerfoot, Yiheng Wang, Benjamin Murrey, Andriy Myronenko, Can Zhao, Dong Yang(+46 more)

Figure 1 for MONAI: An open-source framework for deep learning in healthcare

Figure 2 for MONAI: An open-source framework for deep learning in healthcare

Figure 3 for MONAI: An open-source framework for deep learning in healthcare

Figure 4 for MONAI: An open-source framework for deep learning in healthcare

Abstract:Artificial Intelligence (AI) is having a tremendous impact across most areas of science. Applications of AI in healthcare have the potential to improve our ability to detect, diagnose, prognose, and intervene on human disease. For AI models to be used clinically, they need to be made safe, reproducible and robust, and the underlying software framework must be aware of the particularities (e.g. geometry, physiology, physics) of medical data being processed. This work introduces MONAI, a freely available, community-supported, and consortium-led PyTorch-based framework for deep learning in healthcare. MONAI extends PyTorch to support medical data, with a particular focus on imaging, and provide purpose-specific AI model architectures, transformations and utilities that streamline the development and deployment of medical AI models. MONAI follows best practices for software-development, providing an easy-to-use, robust, well-documented, and well-tested software framework. MONAI preserves the simple, additive, and compositional approach of its underlying PyTorch libraries. MONAI is being used by and receiving contributions from research, clinical and industrial teams from around the world, who are pursuing applications spanning nearly every aspect of healthcare.

* www.monai.io

Via

Access Paper or Ask Questions

Automated head and neck tumor segmentation from 3D PET/CT

Sep 22, 2022

Andriy Myronenko, Md Mahfuzur Rahman Siddiquee, Dong Yang, Yufan He, Daguang Xu

Figure 1 for Automated head and neck tumor segmentation from 3D PET/CT

Figure 2 for Automated head and neck tumor segmentation from 3D PET/CT

Figure 3 for Automated head and neck tumor segmentation from 3D PET/CT

Figure 4 for Automated head and neck tumor segmentation from 3D PET/CT

Abstract:Head and neck tumor segmentation challenge (HECKTOR) 2022 offers a platform for researchers to compare their solutions to segmentation of tumors and lymph nodes from 3D CT and PET images. In this work, we describe our solution to HECKTOR 2022 segmentation task. We re-sample all images to a common resolution, crop around head and neck region, and train SegResNet semantic segmentation network from MONAI. We use 5-fold cross validation to select best model checkpoints. The final submission is an ensemble of 15 models from 3 runs. Our solution (team name NVAUTO) achieves the 1st place on the HECKTOR22 challenge leaderboard with an aggregated dice score of 0.78802.

* HECKTOR22 segmentation challenge. MICCAI 2022. arXiv admin note: text overlap with arXiv:2209.09546

Via

Access Paper or Ask Questions

Automated segmentation of intracranial hemorrhages from 3D CT

Sep 21, 2022

Md Mahfuzur Rahman Siddiquee, Dong Yang, Yufan He, Daguang Xu, Andriy Myronenko

Figure 1 for Automated segmentation of intracranial hemorrhages from 3D CT

Figure 2 for Automated segmentation of intracranial hemorrhages from 3D CT

Figure 3 for Automated segmentation of intracranial hemorrhages from 3D CT

Abstract:Intracranial hemorrhage segmentation challenge (INSTANCE 2022) offers a platform for researchers to compare their solutions to segmentation of hemorrhage stroke regions from 3D CTs. In this work, we describe our solution to INSTANCE 2022. We use a 2D segmentation network, SegResNet from MONAI, operating slice-wise without resampling. The final submission is an ensemble of 18 models. Our solution (team name NVAUTO) achieves the top place in terms of Dice metric (0.721), and overall rank 2. It is implemented with Auto3DSeg.

* INSTANCE22 challenge report, MICCAI2022. arXiv admin note: substantial text overlap with arXiv:2209.09546

Via

Access Paper or Ask Questions

Automated ischemic stroke lesion segmentation from 3D MRI

Sep 21, 2022

Md Mahfuzur Rahman Siddique, Dong Yang, Yufan He, Daguang Xu, Andriy Myronenko

Figure 1 for Automated ischemic stroke lesion segmentation from 3D MRI

Figure 2 for Automated ischemic stroke lesion segmentation from 3D MRI

Figure 3 for Automated ischemic stroke lesion segmentation from 3D MRI

Abstract:Ischemic Stroke Lesion Segmentation challenge (ISLES 2022) offers a platform for researchers to compare their solutions to 3D segmentation of ischemic stroke regions from 3D MRIs. In this work, we describe our solution to ISLES 2022 segmentation task. We re-sample all images to a common resolution, use two input MRI modalities (DWI and ADC) and train SegResNet semantic segmentation network from MONAI. The final submission is an ensemble of 15 models (from 3 runs of 5-fold cross validation). Our solution (team name NVAUTO) achieves the top place in terms of Dice metric (0.824), and overall rank 2 (based on the combined metric ranking).

* ISLES22 challenge report, MICCAI2022

Via

Access Paper or Ask Questions

HyperSegNAS: Bridging One-Shot Neural Architecture Search with 3D Medical Image Segmentation using HyperNet

Dec 20, 2021

Cheng Peng, Andriy Myronenko, Ali Hatamizadeh, Vish Nath, Md Mahfuzur Rahman Siddiquee, Yufan He, Daguang Xu, Rama Chellappa, Dong Yang

Figure 1 for HyperSegNAS: Bridging One-Shot Neural Architecture Search with 3D Medical Image Segmentation using HyperNet

Figure 2 for HyperSegNAS: Bridging One-Shot Neural Architecture Search with 3D Medical Image Segmentation using HyperNet

Figure 3 for HyperSegNAS: Bridging One-Shot Neural Architecture Search with 3D Medical Image Segmentation using HyperNet

Figure 4 for HyperSegNAS: Bridging One-Shot Neural Architecture Search with 3D Medical Image Segmentation using HyperNet

Abstract:Semantic segmentation of 3D medical images is a challenging task due to the high variability of the shape and pattern of objects (such as organs or tumors). Given the recent success of deep learning in medical image segmentation, Neural Architecture Search (NAS) has been introduced to find high-performance 3D segmentation network architectures. However, because of the massive computational requirements of 3D data and the discrete optimization nature of architecture search, previous NAS methods require a long search time or necessary continuous relaxation, and commonly lead to sub-optimal network architectures. While one-shot NAS can potentially address these disadvantages, its application in the segmentation domain has not been well studied in the expansive multi-scale multi-path search space. To enable one-shot NAS for medical image segmentation, our method, named HyperSegNAS, introduces a HyperNet to assist super-net training by incorporating architecture topology information. Such a HyperNet can be removed once the super-net is trained and introduces no overhead during architecture search. We show that HyperSegNAS yields better performing and more intuitive architectures compared to the previous state-of-the-art (SOTA) segmentation networks; furthermore, it can quickly and accurately find good architecture candidates under different computing constraints. Our method is evaluated on public datasets from the Medical Segmentation Decathlon (MSD) challenge, and achieves SOTA performances.

Via

Access Paper or Ask Questions

TransMorph: Transformer for unsupervised medical image registration

Nov 23, 2021

Junyu Chen, Yong Du, Yufan He, William P. Segars, Ye Li, Eric C. Frey

Figure 1 for TransMorph: Transformer for unsupervised medical image registration

Figure 2 for TransMorph: Transformer for unsupervised medical image registration

Figure 3 for TransMorph: Transformer for unsupervised medical image registration

Figure 4 for TransMorph: Transformer for unsupervised medical image registration

Abstract:In the last decade, convolutional neural networks (ConvNets) have dominated the field of medical image analysis. However, it is found that the performances of ConvNets may still be limited by their inability to model long-range spatial relations between voxels in an image. Numerous vision Transformers have been proposed recently to address the shortcomings of ConvNets, demonstrating state-of-the-art performances in many medical imaging applications. Transformers may be a strong candidate for image registration because their self-attention mechanism enables a more precise comprehension of the spatial correspondence between moving and fixed images. In this paper, we present TransMorph, a hybrid Transformer-ConvNet model for volumetric medical image registration. We also introduce three variants of TransMorph, with two diffeomorphic variants ensuring the topology-preserving deformations and a Bayesian variant producing a well-calibrated registration uncertainty estimate. The proposed models are extensively validated against a variety of existing registration methods and Transformer architectures using volumetric medical images from two applications: inter-patient brain MRI registration and phantom-to-CT registration. Qualitative and quantitative results demonstrate that TransMorph and its variants lead to a substantial performance improvement over the baseline methods, demonstrating the effectiveness of Transformers for medical image registration.

* 32 pages, 28 figures

Via

Access Paper or Ask Questions

ViT-V-Net: Vision Transformer for Unsupervised Volumetric Medical Image Registration

Apr 13, 2021

Junyu Chen, Yufan He, Eric C. Frey, Ye Li, Yong Du

Figure 1 for ViT-V-Net: Vision Transformer for Unsupervised Volumetric Medical Image Registration

Figure 2 for ViT-V-Net: Vision Transformer for Unsupervised Volumetric Medical Image Registration

Figure 3 for ViT-V-Net: Vision Transformer for Unsupervised Volumetric Medical Image Registration

Figure 4 for ViT-V-Net: Vision Transformer for Unsupervised Volumetric Medical Image Registration

Abstract:In the last decade, convolutional neural networks (ConvNets) have dominated and achieved state-of-the-art performances in a variety of medical imaging applications. However, the performances of ConvNets are still limited by lacking the understanding of long-range spatial relations in an image. The recently proposed Vision Transformer (ViT) for image classification uses a purely self-attention-based model that learns long-range spatial relations to focus on the relevant parts of an image. Nevertheless, ViT emphasizes the low-resolution features because of the consecutive downsamplings, result in a lack of detailed localization information, making it unsuitable for image registration. Recently, several ViT-based image segmentation methods have been combined with ConvNets to improve the recovery of detailed localization information. Inspired by them, we present ViT-V-Net, which bridges ViT and ConvNet to provide volumetric medical image registration. The experimental results presented here demonstrate that the proposed architecture achieves superior performance to several top-performing registration methods.

Via

Access Paper or Ask Questions

DiNTS: Differentiable Neural Network Topology Search for 3D Medical Image Segmentation

Mar 29, 2021

Yufan He, Dong Yang, Holger Roth, Can Zhao, Daguang Xu

Figure 1 for DiNTS: Differentiable Neural Network Topology Search for 3D Medical Image Segmentation

Figure 2 for DiNTS: Differentiable Neural Network Topology Search for 3D Medical Image Segmentation

Figure 3 for DiNTS: Differentiable Neural Network Topology Search for 3D Medical Image Segmentation

Figure 4 for DiNTS: Differentiable Neural Network Topology Search for 3D Medical Image Segmentation

Abstract:Recently, neural architecture search (NAS) has been applied to automatically search high-performance networks for medical image segmentation. The NAS search space usually contains a network topology level (controlling connections among cells with different spatial scales) and a cell level (operations within each cell). Existing methods either require long searching time for large-scale 3D image datasets, or are limited to pre-defined topologies (such as U-shaped or single-path). In this work, we focus on three important aspects of NAS in 3D medical image segmentation: flexible multi-path network topology, high search efficiency, and budgeted GPU memory usage. A novel differentiable search framework is proposed to support fast gradient-based search within a highly flexible network topology search space. The discretization of the searched optimal continuous model in differentiable scheme may produce a sub-optimal final discrete model (discretization gap). Therefore, we propose a topology loss to alleviate this problem. In addition, the GPU memory usage for the searched 3D model is limited with budget constraints during search. Our Differentiable Network Topology Search scheme (DiNTS) is evaluated on the Medical Segmentation Decathlon (MSD) challenge, which contains ten challenging segmentation tasks. Our method achieves the state-of-the-art performance and the top ranking on the MSD challenge leaderboard.

* CVPR2021 oral

Via

Access Paper or Ask Questions

Information-based Disentangled Representation Learning for Unsupervised MR Harmonization

Mar 24, 2021

Lianrui Zuo, Blake E. Dewey, Aaron Carass, Yihao Liu, Yufan He, Peter A. Calabresi, Jerry L. Prince

Figure 1 for Information-based Disentangled Representation Learning for Unsupervised MR Harmonization

Figure 2 for Information-based Disentangled Representation Learning for Unsupervised MR Harmonization

Figure 3 for Information-based Disentangled Representation Learning for Unsupervised MR Harmonization

Figure 4 for Information-based Disentangled Representation Learning for Unsupervised MR Harmonization

Abstract:Accuracy and consistency are two key factors in computer-assisted magnetic resonance (MR) image analysis. However, contrast variation from site to site caused by lack of standardization in MR acquisition impedes consistent measurements. In recent years, image harmonization approaches have been proposed to compensate for contrast variation in MR images. Current harmonization approaches either require cross-site traveling subjects for supervised training or heavily rely on site-specific harmonization models to encourage harmonization accuracy. These requirements potentially limit the application of current harmonization methods in large-scale multi-site studies. In this work, we propose an unsupervised MR harmonization framework, CALAMITI (Contrast Anatomy Learning and Analysis for MR Intensity Translation and Integration), based on information bottleneck theory. CALAMITI learns a disentangled latent space using a unified structure for multi-site harmonization without the need for traveling subjects. Our model is also able to adapt itself to harmonize MR images from a new site with fine tuning solely on images from the new site. Both qualitative and quantitative results show that the proposed method achieves superior performance compared with other unsupervised harmonization approaches.

* Accepted in the 27th International Conference on Information Processing in Medical Imaging (IPMI 2021)

Via

Access Paper or Ask Questions