Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Antonio Criminisi

Unsupervised domain adaptation in brain lesion segmentation with adversarial networks

Dec 28, 2016

Konstantinos Kamnitsas, Christian Baumgartner, Christian Ledig, Virginia F. J. Newcombe, Joanna P. Simpson, Andrew D. Kane, David K. Menon, Aditya Nori, Antonio Criminisi, Daniel Rueckert(+1 more)

Figure 1 for Unsupervised domain adaptation in brain lesion segmentation with adversarial networks

Figure 2 for Unsupervised domain adaptation in brain lesion segmentation with adversarial networks

Figure 3 for Unsupervised domain adaptation in brain lesion segmentation with adversarial networks

Figure 4 for Unsupervised domain adaptation in brain lesion segmentation with adversarial networks

Abstract:Significant advances have been made towards building accurate automatic segmentation systems for a variety of biomedical applications using machine learning. However, the performance of these systems often degrades when they are applied on new data that differ from the training data, for example, due to variations in imaging protocols. Manually annotating new data for each test domain is not a feasible solution. In this work we investigate unsupervised domain adaptation using adversarial neural networks to train a segmentation method which is more invariant to differences in the input data, and which does not require any annotations on the test domain. Specifically, we learn domain-invariant features by learning to counter an adversarial network, which attempts to classify the domain of the input data by observing the activations of the segmentation network. Furthermore, we propose a multi-connected domain discriminator for improved adversarial training. Our system is evaluated using two MR databases of subjects with traumatic brain injuries, acquired using different scanners and imaging protocols. Using our unsupervised approach, we obtain segmentation accuracies which are close to the upper bound of supervised domain adaptation.

Via

Access Paper or Ask Questions

Deep Roots: Improving CNN Efficiency with Hierarchical Filter Groups

Nov 30, 2016

Yani Ioannou, Duncan Robertson, Roberto Cipolla, Antonio Criminisi

Figure 1 for Deep Roots: Improving CNN Efficiency with Hierarchical Filter Groups

Figure 2 for Deep Roots: Improving CNN Efficiency with Hierarchical Filter Groups

Figure 3 for Deep Roots: Improving CNN Efficiency with Hierarchical Filter Groups

Figure 4 for Deep Roots: Improving CNN Efficiency with Hierarchical Filter Groups

Abstract:We propose a new method for creating computationally efficient and compact convolutional neural networks (CNNs) using a novel sparse connection structure that resembles a tree root. This allows a significant reduction in computational cost and number of parameters compared to state-of-the-art deep CNNs, without compromising accuracy, by exploiting the sparsity of inter-layer filter dependencies. We validate our approach by using it to train more efficient variants of state-of-the-art CNN architectures, evaluated on the CIFAR10 and ILSVRC datasets. Our results show similar or higher accuracy than the baseline architectures with much less computation, as measured by CPU and GPU timings. For example, for ResNet 50, our model has 40% fewer parameters, 45% fewer floating point operations, and is 31% (12%) faster on a CPU (GPU). For the deeper ResNet 200 our model has 25% fewer floating point operations and 44% fewer parameters, while maintaining state-of-the-art accuracy. For GoogLeNet, our model has 7% fewer parameters and is 21% (16%) faster on a CPU (GPU).

* Updated full version of paper, in full letter paper two-column paper. Includes many textual changes, updated CIFAR10 results, and new analysis of inter/intra-layer correlation

Via

Access Paper or Ask Questions

Predicting Personal Traits from Facial Images using Convolutional Neural Networks Augmented with Facial Landmark Information

May 29, 2016

Yoad Lewenberg, Yoram Bachrach, Sukrit Shankar, Antonio Criminisi

Figure 1 for Predicting Personal Traits from Facial Images using Convolutional Neural Networks Augmented with Facial Landmark Information

Figure 2 for Predicting Personal Traits from Facial Images using Convolutional Neural Networks Augmented with Facial Landmark Information

Figure 3 for Predicting Personal Traits from Facial Images using Convolutional Neural Networks Augmented with Facial Landmark Information

Figure 4 for Predicting Personal Traits from Facial Images using Convolutional Neural Networks Augmented with Facial Landmark Information

Abstract:We consider the task of predicting various traits of a person given an image of their face. We estimate both objective traits, such as gender, ethnicity and hair-color; as well as subjective traits, such as the emotion a person expresses or whether he is humorous or attractive. For sizeable experimentation, we contribute a new Face Attributes Dataset (FAD), having roughly 200,000 attribute labels for the above traits, for over 10,000 facial images. Due to the recent surge of research on Deep Convolutional Neural Networks (CNNs), we begin by using a CNN architecture for estimating facial attributes and show that they indeed provide an impressive baseline performance. To further improve performance, we propose a novel approach that incorporates facial landmark information for input images as an additional channel, helping the CNN learn better attribute-specific features so that the landmarks across various training images hold correspondence. We empirically analyse the performance of our method, showing consistent improvement over the baseline across traits.

* 7 pages, 5 figures, IJCAI 2016

Via

Access Paper or Ask Questions

Refining Architectures of Deep Convolutional Neural Networks

Apr 22, 2016

Sukrit Shankar, Duncan Robertson, Yani Ioannou, Antonio Criminisi, Roberto Cipolla

Figure 1 for Refining Architectures of Deep Convolutional Neural Networks

Figure 2 for Refining Architectures of Deep Convolutional Neural Networks

Figure 3 for Refining Architectures of Deep Convolutional Neural Networks

Figure 4 for Refining Architectures of Deep Convolutional Neural Networks

Abstract:Deep Convolutional Neural Networks (CNNs) have recently evinced immense success for various image recognition tasks. However, a question of paramount importance is somewhat unanswered in deep learning research - is the selected CNN optimal for the dataset in terms of accuracy and model size? In this paper, we intend to answer this question and introduce a novel strategy that alters the architecture of a given CNN for a specified dataset, to potentially enhance the original accuracy while possibly reducing the model size. We use two operations for architecture refinement, viz. stretching and symmetrical splitting. Our procedure starts with a pre-trained CNN for a given dataset, and optimally decides the stretch and split factors across the network to refine the architecture. We empirically demonstrate the necessity of the two operations. We evaluate our approach on two natural scenes attributes datasets, SUN Attributes and CAMIT-NSAD, with architectures of GoogleNet and VGG-11, that are quite contrasting in their construction. We justify our choice of datasets, and show that they are interestingly distinct from each other, and together pose a challenge to our architectural refinement algorithm. Our results substantiate the usefulness of the proposed method.

* 9 pages, 6 figures, CVPR 2016

Via

Access Paper or Ask Questions

Decision Forests, Convolutional Networks and the Models in-Between

Mar 03, 2016

Yani Ioannou, Duncan Robertson, Darko Zikic, Peter Kontschieder, Jamie Shotton, Matthew Brown, Antonio Criminisi

Figure 1 for Decision Forests, Convolutional Networks and the Models in-Between

Figure 2 for Decision Forests, Convolutional Networks and the Models in-Between

Figure 3 for Decision Forests, Convolutional Networks and the Models in-Between

Figure 4 for Decision Forests, Convolutional Networks and the Models in-Between

Abstract:This paper investigates the connections between two state of the art classifiers: decision forests (DFs, including decision jungles) and convolutional neural networks (CNNs). Decision forests are computationally efficient thanks to their conditional computation property (computation is confined to only a small region of the tree, the nodes along a single branch). CNNs achieve state of the art accuracy, thanks to their representation learning capabilities. We present a systematic analysis of how to fuse conditional computation with representation learning and achieve a continuum of hybrid models with different ratios of accuracy vs. efficiency. We call this new family of hybrid models conditional networks. Conditional networks can be thought of as: i) decision trees augmented with data transformation operators, or ii) CNNs, with block-diagonal sparse weight matrices, and explicit data routing functions. Experimental validation is performed on the common task of image classification on both the CIFAR and Imagenet datasets. Compared to state of the art CNNs, our hybrid models yield the same accuracy with a fraction of the compute cost and much smaller number of parameters.

* Microsoft Research Technical Report

Via

Access Paper or Ask Questions

Training CNNs with Low-Rank Filters for Efficient Image Classification

Feb 07, 2016

Yani Ioannou, Duncan Robertson, Jamie Shotton, Roberto Cipolla, Antonio Criminisi

Figure 1 for Training CNNs with Low-Rank Filters for Efficient Image Classification

Figure 2 for Training CNNs with Low-Rank Filters for Efficient Image Classification

Figure 3 for Training CNNs with Low-Rank Filters for Efficient Image Classification

Figure 4 for Training CNNs with Low-Rank Filters for Efficient Image Classification

Abstract:We propose a new method for creating computationally efficient convolutional neural networks (CNNs) by using low-rank representations of convolutional filters. Rather than approximating filters in previously-trained networks with more efficient versions, we learn a set of small basis filters from scratch; during training, the network learns to combine these basis filters into more complex filters that are discriminative for image classification. To train such networks, a novel weight initialization scheme is used. This allows effective initialization of connection weights in convolutional layers composed of groups of differently-shaped filters. We validate our approach by applying it to several existing CNN architectures and training these networks from scratch using the CIFAR, ILSVRC and MIT Places datasets. Our results show similar or higher accuracy than conventional CNNs with much less compute. Applying our method to an improved version of VGG-11 network using global max-pooling, we achieve comparable validation accuracy using 41% less compute and only 24% of the original VGG-11 model parameters; another variant of our method gives a 1 percentage point increase in accuracy over our improved VGG-11 model, giving a top-5 center-crop validation accuracy of 89.7% while reducing computation by 16% relative to the original VGG-11 model. Applying our method to the GoogLeNet architecture for ILSVRC, we achieved comparable accuracy with 26% less compute and 41% fewer model parameters. Applying our method to a near state-of-the-art network for CIFAR, we achieved comparable accuracy with 46% less compute and 55% fewer parameters.

* International Conference on Learning Representations (ICLR), San Juan, Puerto Rico, 2-4 May 2016
* Published as a conference paper at ICLR 2016. v3: updated ICLR status. v2: Incorporated reviewer's feedback including: Amend Fig. 2 and 5 descriptions to explain that there are no ReLUs within the figures. Fix headings of Table 5 - Fix typo in the sentence at bottom of page 6. Add ref. to Predicting Parameters in Deep Learning. Fix Table 6, GMP-LR and GMP-LR-2x had incorrect numbers of filters

Via

Access Paper or Ask Questions

WESD - Weighted Spectral Distance for Measuring Shape Dissimilarity

Aug 24, 2012

Ender Konukoglu, Ben Glocker, Antonio Criminisi, Kilian M. Pohl

Figure 1 for WESD - Weighted Spectral Distance for Measuring Shape Dissimilarity

Figure 2 for WESD - Weighted Spectral Distance for Measuring Shape Dissimilarity

Figure 3 for WESD - Weighted Spectral Distance for Measuring Shape Dissimilarity

Figure 4 for WESD - Weighted Spectral Distance for Measuring Shape Dissimilarity

Abstract:This article presents a new distance for measuring shape dissimilarity between objects. Recent publications introduced the use of eigenvalues of the Laplace operator as compact shape descriptors. Here, we revisit the eigenvalues to define a proper distance, called Weighted Spectral Distance (WESD), for quantifying shape dissimilarity. The definition of WESD is derived through analysing the heat-trace. This analysis provides the proposed distance an intuitive meaning and mathematically links it to the intrinsic geometry of objects. We analyse the resulting distance definition, present and prove its important theoretical properties. Some of these properties include: i) WESD is defined over the entire sequence of eigenvalues yet it is guaranteed to converge, ii) it is a pseudometric, iii) it is accurately approximated with a finite number of eigenvalues, and iv) it can be mapped to the [0,1) interval. Lastly, experiments conducted on synthetic and real objects are presented. These experiments highlight the practical benefits of WESD for applications in vision and medical image analysis.

Via

Access Paper or Ask Questions