Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:One Hyper-Initializer for All Network Architectures in Medical Image Analysis

Jun 08, 2022

Fangxin Shang, Yehui Yang, Dalu Yang, Junde Wu, Xiaorong Wang, Yanwu Xu

Figure 1 for One Hyper-Initializer for All Network Architectures in Medical Image Analysis

Figure 2 for One Hyper-Initializer for All Network Architectures in Medical Image Analysis

Figure 3 for One Hyper-Initializer for All Network Architectures in Medical Image Analysis

Figure 4 for One Hyper-Initializer for All Network Architectures in Medical Image Analysis

Share this with someone who'll enjoy it:

Abstract:Pre-training is essential to deep learning model performance, especially in medical image analysis tasks where limited training data are available. However, existing pre-training methods are inflexible as the pre-trained weights of one model cannot be reused by other network architectures. In this paper, we propose an architecture-irrelevant hyper-initializer, which can initialize any given network architecture well after being pre-trained for only once. The proposed initializer is a hypernetwork which takes a downstream architecture as input graphs and outputs the initialization parameters of the respective architecture. We show the effectiveness and efficiency of the hyper-initializer through extensive experimental results on multiple medical imaging modalities, especially in data-limited fields. Moreover, we prove that the proposed algorithm can be reused as a favorable plug-and-play initializer for any downstream architecture and task (both classification and segmentation) of the same modality.

View paper on

Share this with someone who'll enjoy it:

Title:One Hyper-Initializer for All Network Architectures in Medical Image Analysis

Paper and Code