Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Tucker Tensor Layer in Fully Connected Neural Networks

Mar 14, 2019

Giuseppe G. Calvi, Ahmad Moniri, Mahmoud Mahfouz, Zeyang Yu, Qibin Zhao, Danilo P. Mandic

Figure 1 for Tucker Tensor Layer in Fully Connected Neural Networks

Figure 2 for Tucker Tensor Layer in Fully Connected Neural Networks

Figure 3 for Tucker Tensor Layer in Fully Connected Neural Networks

Figure 4 for Tucker Tensor Layer in Fully Connected Neural Networks

Share this with someone who'll enjoy it:

Abstract:We introduce the Tucker Tensor Layer (TTL), an alternative to the dense weight-matrices of the fully connected layers of feed-forward neural networks (NNs), to answer the long standing quest to compress NNs and improve their interpretability. This is achieved by treating these weight-matrices as the unfolding of a higher order weight-tensor. This enables us to introduce a framework for exploiting the multi-way nature of the weight-tensor in order to efficiently reduce the number of parameters, by virtue of the compression properties of tensor decompositions. The Tucker Decomposition (TKD) is employed to decompose the weight-tensor into a core tensor and factor matrices. We re-derive back-propagation within this framework, by extending the notion of matrix derivatives to tensors. In this way, the physical interpretability of the TKD is exploited to gain insights into training, through the process of computing gradients with respect to each factor matrix. The proposed framework is validated on synthetic data and on the Fashion-MNIST dataset, emphasizing the relative importance of various data features in training, hence mitigating the "black-box" issue inherent to NNs. Experiments on both MNIST and Fashion-MNIST illustrate the compression properties of the TTL, achieving a 66.63 fold compression whilst maintaining comparable performance to the uncompressed NN.

View paper on

Share this with someone who'll enjoy it:

Title:Tucker Tensor Layer in Fully Connected Neural Networks

Paper and Code