Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Johanna S. Fröhlich

Coding for Computation: Efficient Compression of Neural Networks for Reconfigurable Hardware

Apr 24, 2025

Hans Rosenberger, Rodrigo Fischer, Johanna S. Fröhlich, Ali Bereyhi, Ralf R. Müller

Abstract:As state of the art neural networks (NNs) continue to grow in size, their resource-efficient implementation becomes ever more important. In this paper, we introduce a compression scheme that reduces the number of computations required for NN inference on reconfigurable hardware such as FPGAs. This is achieved by combining pruning via regularized training, weight sharing and linear computation coding (LCC). Contrary to common NN compression techniques, where the objective is to reduce the memory used for storing the weights of the NNs, our approach is optimized to reduce the number of additions required for inference in a hardware-friendly manner. The proposed scheme achieves competitive performance for simple multilayer perceptrons, as well as for large scale deep NNs such as ResNet-34.

* Accepted at the 2025 IEEE Statistical Signal Processing (SSP) Workshop, Edinburgh

Via

Access Paper or Ask Questions

Linear Computation Coding: Exponential Search and Reduced-State Algorithms

Jan 13, 2023

Hans Rosenberger, Johanna S. Fröhlich, Ali Bereyhi, Ralf R. Müller

Abstract:Linear computation coding is concerned with the compression of multidimensional linear functions, i.e. with reducing the computational effort of multiplying an arbitrary vector to an arbitrary, but known, constant matrix. This paper advances over the state-of-the art, that is based on a discrete matching pursuit (DMP) algorithm, by a step-wise optimal search. Offering significant performance gains over DMP, it is however computationally infeasible for large matrices and high accuracy. Therefore, a reduced-state algorithm is introduced that offers performance superior to DMP, while still being computationally feasible even for large matrices. Depending on the matrix size, the performance gain over DMP is on the order of at least 10%.

* Accepted as paper for presentation at Data Compression Conference (DCC) 2023, Snowbird, UT. 10 pages, 4 figures

Via

Access Paper or Ask Questions