Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sajid Anwar

Compact Deep Convolutional Neural Networks With Coarse Pruning

Oct 30, 2016

Sajid Anwar, Wonyong Sung

Figure 1 for Compact Deep Convolutional Neural Networks With Coarse Pruning

Figure 2 for Compact Deep Convolutional Neural Networks With Coarse Pruning

Figure 3 for Compact Deep Convolutional Neural Networks With Coarse Pruning

Figure 4 for Compact Deep Convolutional Neural Networks With Coarse Pruning

Abstract:The learning capability of a neural network improves with increasing depth at higher computational costs. Wider layers with dense kernel connectivity patterns furhter increase this cost and may hinder real-time inference. We propose feature map and kernel level pruning for reducing the computational complexity of a deep convolutional neural network. Pruning feature maps reduces the width of a layer and hence does not need any sparse representation. Further, kernel pruning converts the dense connectivity pattern into a sparse one. Due to coarse nature, these pruning granularities can be exploited by GPUs and VLSI based implementations. We propose a simple and generic strategy to choose the least adversarial pruning masks for both granularities. The pruned networks are retrained which compensates the loss in accuracy. We obtain the best pruning ratios when we prune a network with both granularities. Experiments with the CIFAR-10 dataset show that more than 85% sparsity can be induced in the convolution layers with less than 1% increase in the missclassification rate of the baseline network.

Via

Access Paper or Ask Questions

Structured Pruning of Deep Convolutional Neural Networks

Dec 29, 2015

Sajid Anwar, Kyuyeon Hwang, Wonyong Sung

Figure 1 for Structured Pruning of Deep Convolutional Neural Networks

Figure 2 for Structured Pruning of Deep Convolutional Neural Networks

Figure 3 for Structured Pruning of Deep Convolutional Neural Networks

Figure 4 for Structured Pruning of Deep Convolutional Neural Networks

Abstract:Real time application of deep learning algorithms is often hindered by high computational complexity and frequent memory accesses. Network pruning is a promising technique to solve this problem. However, pruning usually results in irregular network connections that not only demand extra representation efforts but also do not fit well on parallel computation. We introduce structured sparsity at various scales for convolutional neural networks, which are channel wise, kernel wise and intra kernel strided sparsity. This structured sparsity is very advantageous for direct computational resource savings on embedded computers, parallel computing environments and hardware based systems. To decide the importance of network connections and paths, the proposed method uses a particle filtering approach. The importance weight of each particle is assigned by computing the misclassification rate with corresponding connectivity pattern. The pruned network is re-trained to compensate for the losses due to pruning. While implementing convolutions as matrix products, we particularly show that intra kernel strided sparsity with a simple constraint can significantly reduce the size of kernel and feature map matrices. The pruned network is finally fixed point optimized with reduced word length precision. This results in significant reduction in the total storage size providing advantages for on-chip memory based implementations of deep neural networks.

* 11 pages, 8 figures, 1 table

Via

Access Paper or Ask Questions