Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Augmenting Sub-model to Improve Main Model

Jun 20, 2023

Byeongho Heo, Taekyung Kim, Sangdoo Yun, Dongyoon Han

Figure 1 for Augmenting Sub-model to Improve Main Model

Figure 2 for Augmenting Sub-model to Improve Main Model

Figure 3 for Augmenting Sub-model to Improve Main Model

Figure 4 for Augmenting Sub-model to Improve Main Model

Share this with someone who'll enjoy it:

Abstract:Image classification has improved with the development of training techniques. However, these techniques often require careful parameter tuning to balance the strength of regularization, limiting their potential benefits. In this paper, we propose a novel way to use regularization called Augmenting Sub-model (AugSub). AugSub consists of two models: the main model and the sub-model. While the main model employs conventional training recipes, the sub-model leverages the benefit of additional regularization. AugSub achieves this by mitigating adverse effects through a relaxed loss function similar to self-distillation loss. We demonstrate the effectiveness of AugSub with three drop techniques: dropout, drop-path, and random masking. Our analysis shows that all AugSub improves performance, with the training loss converging even faster than regular training. Among the three, AugMask is identified as the most practical method due to its performance and cost efficiency. We further validate AugMask across diverse training recipes, including DeiT-III, ResNet, MAE fine-tuning, and Swin Transformer. The results show that AugMask consistently provides significant performance gain. AugSub provides a practical and effective solution for introducing additional regularization under various training recipes. Code is available at \url{https://github.com/naver-ai/augsub}.

* 15 pages, 3 figures

View paper on

Share this with someone who'll enjoy it:

Title:Augmenting Sub-model to Improve Main Model

Paper and Code