Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alex M. Bronstein

HCM: Hardware-Aware Complexity Metric for Neural Network Architectures

Apr 26, 2020

Alex Karbachevsky, Chaim Baskin, Evgenii Zheltonozhskii, Yevgeny Yermolin, Freddy Gabbay, Alex M. Bronstein, Avi Mendelson

Figure 1 for HCM: Hardware-Aware Complexity Metric for Neural Network Architectures

Figure 2 for HCM: Hardware-Aware Complexity Metric for Neural Network Architectures

Figure 3 for HCM: Hardware-Aware Complexity Metric for Neural Network Architectures

Figure 4 for HCM: Hardware-Aware Complexity Metric for Neural Network Architectures

Abstract:Convolutional Neural Networks (CNNs) have become common in many fields including computer vision, speech recognition, and natural language processing. Although CNN hardware accelerators are already included as part of many SoC architectures, the task of achieving high accuracy on resource-restricted devices is still considered challenging, mainly due to the vast number of design parameters that need to be balanced to achieve an efficient solution. Quantization techniques, when applied to the network parameters, lead to a reduction of power and area and may also change the ratio between communication and computation. As a result, some algorithmic solutions may suffer from lack of memory bandwidth or computational resources and fail to achieve the expected performance due to hardware constraints. Thus, the system designer and the micro-architect need to understand at early development stages the impact of their high-level decisions (e.g., the architecture of the CNN and the amount of bits used to represent its parameters) on the final product (e.g., the expected power saving, area, and accuracy). Unfortunately, existing tools fall short of supporting such decisions. This paper introduces a hardware-aware complexity metric that aims to assist the system designer of the neural network architectures, through the entire project lifetime (especially at its early stages) by predicting the impact of architectural and micro-architectural decisions on the final product. We demonstrate how the proposed metric can help evaluate different design alternatives of neural network models on resource-restricted devices such as real-time embedded systems, and to avoid making design mistakes at early stages.

Via

Access Paper or Ask Questions

Colored Noise Injection for Training Adversarially Robust Neural Networks

Mar 20, 2020

Evgenii Zheltonozhskii, Chaim Baskin, Yaniv Nemcovsky, Brian Chmiel, Avi Mendelson, Alex M. Bronstein

Figure 1 for Colored Noise Injection for Training Adversarially Robust Neural Networks

Figure 2 for Colored Noise Injection for Training Adversarially Robust Neural Networks

Abstract:Even though deep learning has shown unmatched performance on various tasks, neural networks have been shown to be vulnerable to small adversarial perturbations of the input that lead to significant performance degradation. In this work we extend the idea of adding white Gaussian noise to the network weights and activations during adversarial training (PNI) to the injection of colored noise for defense against common white-box and black-box attacks. We show that our approach outperforms PNI and various previous approaches in terms of adversarial accuracy on CIFAR-10 and CIFAR-100 datasets. In addition, we provide an extensive ablation study of the proposed method justifying the chosen configurations.

Via

Access Paper or Ask Questions

Smoothed Inference for Adversarially-Trained Models

Nov 17, 2019

Yaniv Nemcovsky, Evgenii Zheltonozhskii, Chaim Baskin, Brian Chmiel, Alex M. Bronstein, Avi Mendelson

Figure 1 for Smoothed Inference for Adversarially-Trained Models

Figure 2 for Smoothed Inference for Adversarially-Trained Models

Figure 3 for Smoothed Inference for Adversarially-Trained Models

Figure 4 for Smoothed Inference for Adversarially-Trained Models

Abstract:Deep neural networks are known to be vulnerable to inputs with maliciously constructed adversarial perturbations aimed at forcing misclassification. We study randomized smoothing as a way to both improve performance on unperturbed data as well as increase robustness to adversarial attacks. Moreover, we extend the method proposed by arXiv:1811.09310 by adding low-rank multivariate noise, which we then use as a base model for smoothing. The proposed method achieves 58.5% top-1 accuracy on CIFAR-10 under PGD attack and outperforms previous works by 4%. In addition, we consider a family of attacks, which were previously used for training purposes in the certified robustness scheme. We demonstrate that the proposed attacks are more effective than PGD against both smoothed and non-smoothed models. Since our method is based on sampling, it lends itself well for trading-off between the model inference complexity and its performance. A reference implementation of the proposed techniques is provided at https://github.com/yanemcovsky/SIAM.

Via

Access Paper or Ask Questions

Loss Aware Post-training Quantization

Nov 17, 2019

Yury Nahshan, Brian Chmiel, Chaim Baskin, Evgenii Zheltonozhskii, Ron Banner, Alex M. Bronstein, Avi Mendelson

Figure 1 for Loss Aware Post-training Quantization

Figure 2 for Loss Aware Post-training Quantization

Figure 3 for Loss Aware Post-training Quantization

Figure 4 for Loss Aware Post-training Quantization

Abstract:Neural network quantization enables the deployment of large models on resource-constrained devices. Current post-training quantization methods fall short in terms of accuracy for INT4 (or lower) but provide reasonable accuracy for INT8 (or above). In this work, we study the effect of quantization on the structure of the loss landscape. We show that the structure is flat and separable for mild quantization, enabling straightforward post-training quantization methods to achieve good results. On the other hand, we show that with more aggressive quantization, the loss landscape becomes highly non-separable with sharp minima points, making the selection of quantization parameters more challenging. Armed with this understanding, we design a method that quantizes the layer parameters jointly, enabling significant accuracy improvement over current post-training quantization methods. Reference implementation accompanies the paper at https://github.com/ynahshan/nn-quantization-pytorch/tree/master/lapq

Via

Access Paper or Ask Questions

CAT: Compression-Aware Training for bandwidth reduction

Sep 25, 2019

Chaim Baskin, Brian Chmiel, Evgenii Zheltonozhskii, Ron Banner, Alex M. Bronstein, Avi Mendelson

Figure 1 for CAT: Compression-Aware Training for bandwidth reduction

Figure 2 for CAT: Compression-Aware Training for bandwidth reduction

Figure 3 for CAT: Compression-Aware Training for bandwidth reduction

Figure 4 for CAT: Compression-Aware Training for bandwidth reduction

Abstract:Convolutional neural networks (CNNs) have become the dominant neural network architecture for solving visual processing tasks. One of the major obstacles hindering the ubiquitous use of CNNs for inference is their relatively high memory bandwidth requirements, which can be a main energy consumer and throughput bottleneck in hardware accelerators. Accordingly, an efficient feature map compression method can result in substantial performance gains. Inspired by quantization-aware training approaches, we propose a compression-aware training (CAT) method that involves training the model in a way that allows better compression of feature maps during inference. Our method trains the model to achieve low-entropy feature maps, which enables efficient compression at inference time using classical transform coding methods. CAT significantly improves the state-of-the-art results reported for quantization. For example, on ResNet-34 we achieve 73.1% accuracy (0.2% degradation from the baseline) with an average representation of only 1.79 bits per value. Reference implementation accompanies the paper at https://github.com/CAT-teams/CAT

Via

Access Paper or Ask Questions

Baby steps towards few-shot learning with multiple semantics

Jun 05, 2019

Eli Schwartz, Leonid Karlinsky, Rogerio Feris, Raja Giryes, Alex M. Bronstein

Figure 1 for Baby steps towards few-shot learning with multiple semantics

Figure 2 for Baby steps towards few-shot learning with multiple semantics

Figure 3 for Baby steps towards few-shot learning with multiple semantics

Figure 4 for Baby steps towards few-shot learning with multiple semantics

Abstract:Learning from one or few visual examples is one of the key capabilities of humans since early infancy, but is still a significant challenge for modern AI systems. While considerable progress has been achieved in few-shot learning from a few image examples, much less attention has been given to the verbal descriptions that are usually provided to infants when they are presented with a new object. In this paper, we focus on the role of additional semantics that can significantly facilitate few-shot visual learning. Building upon recent advances in few-shot learning with additional semantic information, we demonstrate that further improvements are possible using richer semantics and multiple semantic sources. Using these ideas, we offer the community a new result on the one-shot test of the popular miniImageNet benchmark, comparing favorably to the previous state-of-the-art results for both visual only and visual plus semantics-based approaches. We also performed an ablation study investigating the components and design choices of our approach.

Via

Access Paper or Ask Questions

Feature Map Transform Coding for Energy-Efficient CNN Inference

May 26, 2019

Brian Chmiel, Chaim Baskin, Ron Banner, Evgenii Zheltonozhskii, Yevgeny Yermolin, Alex Karbachevsky, Alex M. Bronstein, Avi Mendelson

Figure 1 for Feature Map Transform Coding for Energy-Efficient CNN Inference

Figure 2 for Feature Map Transform Coding for Energy-Efficient CNN Inference

Figure 3 for Feature Map Transform Coding for Energy-Efficient CNN Inference

Figure 4 for Feature Map Transform Coding for Energy-Efficient CNN Inference

Abstract:Convolutional neural networks (CNNs) achieve state-of-the-art accuracy in a variety of tasks in computer vision and beyond. One of the major obstacles hindering the ubiquitous use of CNNs for inference on low-power edge devices is their relatively high computational complexity and memory bandwidth requirements. The latter often dominates the energy footprint on modern hardware. In this paper, we introduce a lossy transform coding approach, inspired by image and video compression, designed to reduce the memory bandwidth due to the storage of intermediate activation calculation results. Our method exploits the high correlations between feature maps and adjacent pixels and allows to halve the data transfer volumes to the main memory without re-training. We analyze the performance of our approach on a variety of CNN architectures and demonstrated FPGA implementation of ResNet18 with our approach results in reduction of around 40% in the memory energy footprint compared to quantized network with negligible impact on accuracy. A reference implementation is available at https://github.com/CompressTeam/TransformCodingInference

Via

Access Paper or Ask Questions

Towards Learning of Filter-Level Heterogeneous Compression of Convolutional Neural Networks

Apr 29, 2019

Yochai Zur, Chaim Baskin, Evgenii Zheltonozhskii, Brian Chmiel, Itay Evron, Alex M. Bronstein, Avi Mendelson

Figure 1 for Towards Learning of Filter-Level Heterogeneous Compression of Convolutional Neural Networks

Figure 2 for Towards Learning of Filter-Level Heterogeneous Compression of Convolutional Neural Networks

Abstract:Recently, deep learning has become a de facto standard in machine learning with convolutional neural networks (CNNs) demonstrating spectacular success on a wide variety of tasks. However, CNNs are typically very demanding computationally at inference time. One of the ways to alleviate this burden on certain hardware platforms is quantization relying on the use of low-precision arithmetic representation for the weights and the activations. Another popular method is the pruning of the number of filters in each layer. While mainstream deep learning methods train the neural networks weights while keeping the network architecture fixed, the emerging neural architecture search (NAS) techniques make the latter also amenable to training. In this paper, we formulate optimal arithmetic bit length allocation and neural network pruning as a NAS problem, searching for the configurations satisfying a computational complexity budget while maximizing the accuracy. We use a differentiable search method based on the continuous relaxation of the search space proposed by Liu et al. (arXiv:1806.09055). We show, by grid search, that heterogeneous quantized networks suffer from a high variance which renders the benefit of the search questionable. For pruning, improvement over homogeneous cases is possible, but it is still challenging to find those configurations with the proposed method. The code is publicly available at https://github.com/yochaiz/Slimmable and https://github.com/yochaiz/darts-UNIQ

Via

Access Paper or Ask Questions

LaSO: Label-Set Operations networks for multi-label few-shot learning

Feb 26, 2019

Amit Alfassy, Leonid Karlinsky, Amit Aides, Joseph Shtok, Sivan Harary, Rogerio Feris, Raja Giryes, Alex M. Bronstein

Figure 1 for LaSO: Label-Set Operations networks for multi-label few-shot learning

Figure 2 for LaSO: Label-Set Operations networks for multi-label few-shot learning

Figure 3 for LaSO: Label-Set Operations networks for multi-label few-shot learning

Figure 4 for LaSO: Label-Set Operations networks for multi-label few-shot learning

Abstract:Example synthesis is one of the leading methods to tackle the problem of few-shot learning, where only a small number of samples per class are available. However, current synthesis approaches only address the scenario of a single category label per image. In this work, we propose a novel technique for synthesizing samples with multiple labels for the (yet unhandled) multi-label few-shot classification scenario. We propose to combine pairs of given examples in feature space, so that the resulting synthesized feature vectors will correspond to examples whose label sets are obtained through certain set operations on the label sets of the corresponding input pairs. Thus, our method is capable of producing a sample containing the intersection, union or set-difference of labels present in two input samples. As we show, these set operations generalize to labels unseen during training. This enables performing augmentation on examples of novel categories, thus, facilitating multi-label few-shot classifier learning. We conduct numerous experiments showing promising results for the label-set manipulation capabilities of the proposed approach, both directly (using the classification and retrieval metrics), and in the context of performing data augmentation for multi-label few-shot learning. We propose a benchmark for this new and challenging task and show that our method compares favorably to all the common baselines.

Via

Access Paper or Ask Questions

Beholder-GAN: Generation and Beautification of Facial Images with Conditioning on Their Beauty Level

Feb 25, 2019

Nir Diamant, Dean Zadok, Chaim Baskin, Eli Schwartz, Alex M. Bronstein

Figure 1 for Beholder-GAN: Generation and Beautification of Facial Images with Conditioning on Their Beauty Level

Figure 2 for Beholder-GAN: Generation and Beautification of Facial Images with Conditioning on Their Beauty Level

Figure 3 for Beholder-GAN: Generation and Beautification of Facial Images with Conditioning on Their Beauty Level

Figure 4 for Beholder-GAN: Generation and Beautification of Facial Images with Conditioning on Their Beauty Level

Abstract:Beauty is in the eye of the beholder. This maxim, emphasizing the subjectivity of the perception of beauty, has enjoyed a wide consensus since ancient times. In the digitalera, data-driven methods have been shown to be able to predict human-assigned beauty scores for facial images. In this work, we augment this ability and train a generative model that generates faces conditioned on a requested beauty score. In addition, we show how this trained generator can be used to beautify an input face image. By doing so, we achieve an unsupervised beautification model, in the sense that it relies on no ground truth target images.

Via

Access Paper or Ask Questions