Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Thomas Wiegand

DeepCABAC: Context-adaptive binary arithmetic coding for deep neural network compression

May 15, 2019
Simon Wiedemann, Heiner Kirchhoffer, Stefan Matlage, Paul Haase, Arturo Marban, Talmaj Marinc, David Neumann, Ahmed Osman, Detlev Marpe, Heiko Schwarz, Thomas Wiegand, Wojciech Samek

Figure 1 for DeepCABAC: Context-adaptive binary arithmetic coding for deep neural network compression

Figure 2 for DeepCABAC: Context-adaptive binary arithmetic coding for deep neural network compression

We present DeepCABAC, a novel context-adaptive binary arithmetic coder for compressing deep neural networks. It quantizes each weight parameter by minimizing a weighted rate-distortion function, which implicitly takes the impact of quantization on to the accuracy of the network into account. Subsequently, it compresses the quantized values into a bitstream representation with minimal redundancies. We show that DeepCABAC is able to reach very high compression ratios across a wide set of different network architectures and datasets. For instance, we are able to compress by x63.6 the VGG16 ImageNet model with no loss of accuracy, thus being able to represent the entire network with merely 8.7MB.

* ICML 2019, Joint Workshop on On-Device Machine Learning and Compact Deep Neural Network Representations (ODML-CDNNR)

Via

Access Paper or Ask Questions

Focus Group on Artificial Intelligence for Health

Sep 13, 2018
Marcel Salathé, Thomas Wiegand, Markus Wenzel

Figure 1 for Focus Group on Artificial Intelligence for Health

Figure 2 for Focus Group on Artificial Intelligence for Health

Artificial Intelligence (AI) - the phenomenon of machines being able to solve problems that require human intelligence - has in the past decade seen an enormous rise of interest due to significant advances in effectiveness and use. The health sector, one of the most important sectors for societies and economies worldwide, is particularly interesting for AI applications, given the ongoing digitalisation of all types of health information. The potential for AI assistance in the health domain is immense, because AI can support medical decision making at reduced costs, everywhere. However, due to the complexity of AI algorithms, it is difficult to distinguish good from bad AI-based solutions and to understand their strengths and weaknesses, which is crucial for clarifying responsibilities and for building trust. For this reason, the International Telecommunication Union (ITU) has established a new Focus Group on "Artificial Intelligence for Health" (FG-AI4H) in partnership with the World Health Organization (WHO). Health and care services are usually the responsibility of a government - even when provided through private insurance systems - and thus under the responsibility of WHO/ITU member states. FG-AI4H will identify opportunities for international standardization, which will foster the application of AI to health issues on a global scale. In particular, it will establish a standardized assessment framework with open benchmarks for the evaluation of AI-based methods for health, such as AI-based diagnosis, triage or treatment decisions.

* Whitepaper on ITU Focus Group AI4H for 1st workshop at WHO

Via

Access Paper or Ask Questions

Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment

Dec 07, 2017
Sebastian Bosse, Dominique Maniry, Klaus-Robert Müller, Thomas Wiegand, Wojciech Samek

Figure 1 for Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment

Figure 2 for Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment

Figure 3 for Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment

Figure 4 for Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment

We present a deep neural network-based approach to image quality assessment (IQA). The network is trained end-to-end and comprises ten convolutional layers and five pooling layers for feature extraction, and two fully connected layers for regression, which makes it significantly deeper than related IQA models. Unique features of the proposed architecture are that: 1) with slight adaptations it can be used in a no-reference (NR) as well as in a full-reference (FR) IQA setting and 2) it allows for joint learning of local quality and local weights, i.e., relative importance of local quality to the global quality estimate, in an unified framework. Our approach is purely data-driven and does not rely on hand-crafted features or other types of prior domain knowledge about the human visual system or image statistics. We evaluate the proposed approach on the LIVE, CISQ, and TID2013 databases as well as the LIVE In the wild image quality challenge database and show superior performance to state-of-the-art NR and FR IQA methods. Finally, cross-database evaluation shows a high ability to generalize between different databases, indicating a high robustness of the learned features.

* IEEE Transactions on Image Processing, 27(1):206-219, 2018

Via

Access Paper or Ask Questions

A Haar Wavelet-Based Perceptual Similarity Index for Image Quality Assessment

Nov 06, 2017
Rafael Reisenhofer, Sebastian Bosse, Gitta Kutyniok, Thomas Wiegand

Figure 1 for A Haar Wavelet-Based Perceptual Similarity Index for Image Quality Assessment

Figure 2 for A Haar Wavelet-Based Perceptual Similarity Index for Image Quality Assessment

Figure 3 for A Haar Wavelet-Based Perceptual Similarity Index for Image Quality Assessment

Figure 4 for A Haar Wavelet-Based Perceptual Similarity Index for Image Quality Assessment

In most practical situations, the compression or transmission of images and videos creates distortions that will eventually be perceived by a human observer. Vice versa, image and video restoration techniques, such as inpainting or denoising, aim to enhance the quality of experience of human viewers. Correctly assessing the similarity between an image and an undistorted reference image as subjectively experienced by a human viewer can thus lead to significant improvements in any transmission, compression, or restoration system. This paper introduces the Haar wavelet-based perceptual similarity index (HaarPSI), a novel and computationally inexpensive similarity measure for full reference image quality assessment. The HaarPSI utilizes the coefficients obtained from a Haar wavelet decomposition to assess local similarities between two images, as well as the relative importance of image areas. The consistency of the HaarPSI with the human quality of experience was validated on four large benchmark databases containing thousands of differently distorted images. On these databases, the HaarPSI achieves higher correlations with human opinion scores than state-of-the-art full reference similarity measures like the structural similarity index (SSIM), the feature similarity index (FSIM), and the visual saliency-based index (VSI). Along with the simple computational structure and the short execution time, these experimental results suggest a high applicability of the HaarPSI in real world tasks.

* Signal Processing: Image Communication 61 (2018) 33-43

Via

Access Paper or Ask Questions

The Convergence of Machine Learning and Communications

Aug 28, 2017
Wojciech Samek, Slawomir Stanczak, Thomas Wiegand

Figure 1 for The Convergence of Machine Learning and Communications

Figure 2 for The Convergence of Machine Learning and Communications

Figure 3 for The Convergence of Machine Learning and Communications

Figure 4 for The Convergence of Machine Learning and Communications

The areas of machine learning and communication technology are converging. Today's communications systems generate a huge amount of traffic data, which can help to significantly enhance the design and management of networks and communication components when combined with advanced machine learning methods. Furthermore, recently developed end-to-end training procedures offer new ways to jointly optimize the components of a communication system. Also in many emerging application fields of communication technology, e.g., smart cities or internet of things, machine learning methods are of central importance. This paper gives an overview over the use of machine learning in different areas of communications and discusses two exemplar applications in wireless networking. Furthermore, it identifies promising future research topics and discusses their potential impact.

* 8 pages, 4 figures

Via

Access Paper or Ask Questions

Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models

Aug 28, 2017
Wojciech Samek, Thomas Wiegand, Klaus-Robert Müller

Figure 1 for Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models

Figure 2 for Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models

With the availability of large databases and recent improvements in deep learning methodology, the performance of AI systems is reaching or even exceeding the human level on an increasing number of complex tasks. Impressive examples of this development can be found in domains such as image classification, sentiment analysis, speech understanding or strategic game playing. However, because of their nested non-linear structure, these highly successful machine learning and artificial intelligence models are usually applied in a black box manner, i.e., no information is provided about what exactly makes them arrive at their predictions. Since this lack of transparency can be a major drawback, e.g., in medical applications, the development of methods for visualizing, explaining and interpreting deep learning models has recently attracted increasing attention. This paper summarizes recent developments in this field and makes a plea for more interpretability in artificial intelligence. Furthermore, it presents two approaches to explaining predictions of deep learning models, one method which computes the sensitivity of the prediction with respect to changes in the input and one approach which meaningfully decomposes the decision in terms of the input variables. These methods are evaluated on three classification tasks.

* 8 pages, 2 figures

Via

Access Paper or Ask Questions