Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Gradient-Coherent Strong Regularization for Deep Neural Networks

Nov 20, 2018

Dae Hoon Park, Chiu Man Ho, Yi Chang, Huaqing Zhang

Figure 1 for Gradient-Coherent Strong Regularization for Deep Neural Networks

Figure 2 for Gradient-Coherent Strong Regularization for Deep Neural Networks

Figure 3 for Gradient-Coherent Strong Regularization for Deep Neural Networks

Figure 4 for Gradient-Coherent Strong Regularization for Deep Neural Networks

Share this with someone who'll enjoy it:

Abstract:Deep neural networks are often prone to over-fitting with their numerous parameters, so regularization plays an important role in generalization. L1 and L2 regularizers are common regularization tools in machine learning with their simplicity and effectiveness. However, we observe that imposing strong L1 or L2 regularization on deep neural networks with stochastic gradient descent easily fails, which limits the generalization ability of the underlying neural networks. To understand this phenomenon, we first investigate how and why learning fails when strong regularization is imposed on deep neural networks. We then propose a novel method, gradient-coherent strong regularization, which imposes regularization only when the gradients are kept coherent in the presence of strong regularization. Experiments are performed with multiple deep architectures on three benchmark data sets for image recognition. Experimental results show that our proposed approach indeed endures strong regularization and significantly improves both accuracy and compression, which could not be achieved otherwise.

View paper on

Share this with someone who'll enjoy it:

Title:Gradient-Coherent Strong Regularization for Deep Neural Networks

Paper and Code