Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alex M. Bronstein

Efficient non-uniform quantizer for quantized neural network targeting reconfigurable hardware

Nov 27, 2018

Natan Liss, Chaim Baskin, Avi Mendelson, Alex M. Bronstein, Raja Giryes

Figure 1 for Efficient non-uniform quantizer for quantized neural network targeting reconfigurable hardware

Figure 2 for Efficient non-uniform quantizer for quantized neural network targeting reconfigurable hardware

Figure 3 for Efficient non-uniform quantizer for quantized neural network targeting reconfigurable hardware

Figure 4 for Efficient non-uniform quantizer for quantized neural network targeting reconfigurable hardware

Abstract:Convolutional Neural Networks (CNN) has become more popular choice for various tasks such as computer vision, speech recognition and natural language processing. Thanks to their large computational capability and throughput, GPUs ,which are not power efficient and therefore does not suit low power systems such as mobile devices, are the most common platform for both training and inferencing tasks. Recent studies has shown that FPGAs can provide a good alternative to GPUs as a CNN accelerator, due to their re-configurable nature, low power and small latency. In order for FPGA-based accelerators outperform GPUs in inference task, both the parameters of the network and the activations must be quantized. While most works use uniform quantizers for both parameters and activations, it is not always the optimal one, and a non-uniform quantizer need to be considered. In this work we introduce a custom hardware-friendly approach to implement non-uniform quantizers. In addition, we use a single scale integer representation of both parameters and activations, for both training and inference. The combined method yields a hardware efficient non-uniform quantizer, fit for real-time applications. We have tested our method on CIFAR-10 and CIFAR-100 image classification datasets with ResNet-18 and VGG-like architectures, and saw little degradation in accuracy.

* In submission

Via

Access Paper or Ask Questions

UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural Networks

Oct 02, 2018

Chaim Baskin, Eli Schwartz, Evgenii Zheltonozhskii, Natan Liss, Raja Giryes, Alex M. Bronstein, Avi Mendelson

Figure 1 for UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural Networks

Figure 2 for UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural Networks

Figure 3 for UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural Networks

Figure 4 for UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural Networks

Abstract:We present a novel method for neural network quantization that emulates a non-uniform $k$-quantile quantizer, which adapts to the distribution of the quantized parameters. Our approach provides a novel alternative to the existing uniform quantization techniques for neural networks. We suggest to compare the results as a function of the bit-operations (BOPS) performed, assuming a look-up table availability for the non-uniform case. In this setup, we show the advantages of our strategy in the low computational budget regime. While the proposed solution is harder to implement in hardware, we believe it sets a basis for new alternatives to neural networks quantization.

Via

Access Paper or Ask Questions

High frame-rate cardiac ultrasound imaging with deep learning

Aug 23, 2018

Ortal Senouf, Sanketh Vedula, Grigoriy Zurakhov, Alex M. Bronstein, Michael Zibulevsky, Oleg Michailovich, Dan Adam, David Blondheim

Figure 1 for High frame-rate cardiac ultrasound imaging with deep learning

Figure 2 for High frame-rate cardiac ultrasound imaging with deep learning

Figure 3 for High frame-rate cardiac ultrasound imaging with deep learning

Figure 4 for High frame-rate cardiac ultrasound imaging with deep learning

Abstract:Cardiac ultrasound imaging requires a high frame rate in order to capture rapid motion. This can be achieved by multi-line acquisition (MLA), where several narrow-focused received lines are obtained from each wide-focused transmitted line. This shortens the acquisition time at the expense of introducing block artifacts. In this paper, we propose a data-driven learning-based approach to improve the MLA image quality. We train an end-to-end convolutional neural network on pairs of real ultrasound cardiac data, acquired through MLA and the corresponding single-line acquisition (SLA). The network achieves a significant improvement in image quality for both $5-$ and $7-$line MLA resulting in a decorrelation measure similar to that of SLA while having the frame rate of MLA.

* To appear in the Proceedings of MICCAI, 2018

Via

Access Paper or Ask Questions

High quality ultrasonic multi-line transmission through deep learning

Aug 23, 2018

Sanketh Vedula, Ortal Senouf, Grigoriy Zurakhov, Alex M. Bronstein, Michael Zibulevsky, Oleg Michailovich, Dan Adam, Diana Gaitini

Figure 1 for High quality ultrasonic multi-line transmission through deep learning

Figure 2 for High quality ultrasonic multi-line transmission through deep learning

Figure 3 for High quality ultrasonic multi-line transmission through deep learning

Abstract:Frame rate is a crucial consideration in cardiac ultrasound imaging and 3D sonography. Several methods have been proposed in the medical ultrasound literature aiming at accelerating the image acquisition. In this paper, we consider one such method called \textit{multi-line transmission} (MLT), in which several evenly separated focused beams are transmitted simultaneously. While MLT reduces the acquisition time, it comes at the expense of a heavy loss of contrast due to the interactions between the beams (cross-talk artifact). In this paper, we introduce a data-driven method to reduce the artifacts arising in MLT. To this end, we propose to train an end-to-end convolutional neural network consisting of correction layers followed by a constant apodization layer. The network is trained on pairs of raw data obtained through MLT and the corresponding \textit{single-line transmission} (SLT) data. Experimental evaluation demonstrates significant improvement both in the visual image quality and in objective measures such as contrast ratio and contrast-to-noise ratio, while preserving resolution unlike traditional apodization-based methods. We show that the proposed method is able to generalize well across different patients and anatomies on real and phantom data.

* To appear in Proceedings of MLMIR workshop, MICCAI 2018

Via

Access Paper or Ask Questions

Class-Aware Fully-Convolutional Gaussian and Poisson Denoising

Aug 20, 2018

Tal Remez, Or Litany, Raja Giryes, Alex M. Bronstein

Figure 1 for Class-Aware Fully-Convolutional Gaussian and Poisson Denoising

Figure 2 for Class-Aware Fully-Convolutional Gaussian and Poisson Denoising

Figure 3 for Class-Aware Fully-Convolutional Gaussian and Poisson Denoising

Figure 4 for Class-Aware Fully-Convolutional Gaussian and Poisson Denoising

Abstract:We propose a fully-convolutional neural-network architecture for image denoising which is simple yet powerful. Its structure allows to exploit the gradual nature of the denoising process, in which shallow layers handle local noise statistics, while deeper layers recover edges and enhance textures. Our method advances the state-of-the-art when trained for different noise levels and distributions (both Gaussian and Poisson). In addition, we show that making the denoiser class-aware by exploiting semantic class information boosts performance, enhances textures and reduces artifacts.

Via

Access Paper or Ask Questions

RepMet: Representative-based metric learning for classification and one-shot object detection

Jun 15, 2018

Eli Schwartz, Leonid Karlinsky, Joseph Shtok, Sivan Harary, Mattias Marder, Sharathchandra Pankanti, Rogerio Feris, Abhishek Kumar, Raja Giryes, Alex M. Bronstein

Figure 1 for RepMet: Representative-based metric learning for classification and one-shot object detection

Figure 2 for RepMet: Representative-based metric learning for classification and one-shot object detection

Figure 3 for RepMet: Representative-based metric learning for classification and one-shot object detection

Figure 4 for RepMet: Representative-based metric learning for classification and one-shot object detection

Abstract:Distance metric learning (DML) has been successfully applied to object classification, both in the standard regime of rich training data and in the few-shot scenario, where each category is represented by only few examples. In this work, we propose a new method for DML, featuring a joint learning of the embedding space and of the data distribution of the training categories, in a single training process. Our method improves upon leading algorithms for DML-based object classification. Furthermore, it opens the door for a new task in Computer Vision - a few-shot object detection, since the proposed DML architecture can be naturally embedded as the classification head of any standard object detector. In numerous experiments, we achieve state-of-the-art classification results on a variety of fine-grained datasets, and offer the community a benchmark on the few-shot detection task, performed on the Imagenet-LOC dataset. The code will be made available upon acceptance.

Via

Access Paper or Ask Questions

Delta-encoder: an effective sample synthesis method for few-shot object recognition

Jun 12, 2018

Eli Schwartz, Leonid Karlinsky, Joseph Shtok, Sivan Harary, Mattias Marder, Rogerio Feris, Abhishek Kumar, Raja Giryes, Alex M. Bronstein

Figure 1 for Delta-encoder: an effective sample synthesis method for few-shot object recognition

Figure 2 for Delta-encoder: an effective sample synthesis method for few-shot object recognition

Figure 3 for Delta-encoder: an effective sample synthesis method for few-shot object recognition

Figure 4 for Delta-encoder: an effective sample synthesis method for few-shot object recognition

Abstract:Learning to classify new categories based on just one or a few examples is a long-standing challenge in modern computer vision. In this work, we proposes a simple yet effective method for few-shot (and one-shot) object recognition. Our approach is based on a modified auto-encoder, denoted Delta-encoder, that learns to synthesize new samples for an unseen category just by seeing few examples from it. The synthesized samples are then used to train a classifier. The proposed approach learns to both extract transferable intra-class deformations, or "deltas", between same-class pairs of training examples, and to apply those deltas to the few provided examples of a novel class (unseen during training) in order to efficiently synthesize samples from that new class. The proposed method improves over the state-of-the-art in one-shot object-recognition and compares favorably in the few-shot case. Upon acceptance code will be made available.

Via

Access Paper or Ask Questions

Tradeoffs between Convergence Speed and Reconstruction Accuracy in Inverse Problems

Feb 15, 2018

Raja Giryes, Yonina C. Eldar, Alex M. Bronstein, Guillermo Sapiro

Figure 1 for Tradeoffs between Convergence Speed and Reconstruction Accuracy in Inverse Problems

Figure 2 for Tradeoffs between Convergence Speed and Reconstruction Accuracy in Inverse Problems

Figure 3 for Tradeoffs between Convergence Speed and Reconstruction Accuracy in Inverse Problems

Abstract:Solving inverse problems with iterative algorithms is popular, especially for large data. Due to time constraints, the number of possible iterations is usually limited, potentially affecting the achievable accuracy. Given an error one is willing to tolerate, an important question is whether it is possible to modify the original iterations to obtain faster convergence to a minimizer achieving the allowed error without increasing the computational cost of each iteration considerably. Relying on recent recovery techniques developed for settings in which the desired signal belongs to some low-dimensional set, we show that using a coarse estimate of this set may lead to faster convergence at the cost of an additional reconstruction error related to the accuracy of the set approximation. Our theory ties to recent advances in sparse recovery, compressed sensing, and deep learning. Particularly, it may provide a possible explanation to the successful approximation of the l1-minimization solution by neural networks with layers representing iterations, as practiced in the learned iterative shrinkage-thresholding algorithm (LISTA).

* To appear in IEEE Transactions on Signal Processing

Via

Access Paper or Ask Questions

DeepISP: Learning End-to-End Image Processing Pipeline

Jan 20, 2018

Eli Schwartz, Raja Giryes, Alex M. Bronstein

Figure 1 for DeepISP: Learning End-to-End Image Processing Pipeline

Figure 2 for DeepISP: Learning End-to-End Image Processing Pipeline

Figure 3 for DeepISP: Learning End-to-End Image Processing Pipeline

Figure 4 for DeepISP: Learning End-to-End Image Processing Pipeline

Abstract:We present DeepISP, a full end-to-end deep neural model of the camera image signal processing (ISP) pipeline. Our model learns a mapping from the raw low-light mosaiced image to the final visually compelling image and encompasses low-level tasks such as demosaicing and denoising as well as higher-level tasks such as color correction and image adjustment. The training and evaluation of the pipeline was performed on a dedicated dataset containing pairs of low-light and well-lit images captured by a Samsung S7 smartphone camera in both raw and processed JPEG formats. The proposed solution achieves state-of-the-art performance in objective evaluation of PSNR on the subtask of joint denoising and demosaicing. For the full end-to-end pipeline, it achieves better visual quality compared to the manufacturer ISP, in both a subjective human assessment and when rated by a deep model trained for assessing image quality.

Via

Access Paper or Ask Questions

Towards CT-quality Ultrasound Imaging using Deep Learning

Oct 17, 2017

Sanketh Vedula, Ortal Senouf, Alex M. Bronstein, Oleg V. Michailovich, Michael Zibulevsky

Figure 1 for Towards CT-quality Ultrasound Imaging using Deep Learning

Figure 2 for Towards CT-quality Ultrasound Imaging using Deep Learning

Figure 3 for Towards CT-quality Ultrasound Imaging using Deep Learning

Figure 4 for Towards CT-quality Ultrasound Imaging using Deep Learning

Abstract:The cost-effectiveness and practical harmlessness of ultrasound imaging have made it one of the most widespread tools for medical diagnosis. Unfortunately, the beam-forming based image formation produces granular speckle noise, blurring, shading and other artifacts. To overcome these effects, the ultimate goal would be to reconstruct the tissue acoustic properties by solving a full wave propagation inverse problem. In this work, we make a step towards this goal, using Multi-Resolution Convolutional Neural Networks (CNN). As a result, we are able to reconstruct CT-quality images from the reflected ultrasound radio-frequency(RF) data obtained by simulation from real CT scans of a human body. We also show that CNN is able to imitate existing computationally heavy despeckling methods, thereby saving orders of magnitude in computations and making them amenable to real-time applications.

Via

Access Paper or Ask Questions