Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Guillermo Sapiro

University of Minnesota

Learning efficient sparse and low rank models

Dec 14, 2012

Pablo Sprechmann, Alex M. Bronstein, Guillermo Sapiro

Figure 1 for Learning efficient sparse and low rank models

Figure 2 for Learning efficient sparse and low rank models

Figure 3 for Learning efficient sparse and low rank models

Figure 4 for Learning efficient sparse and low rank models

Abstract:Parsimony, including sparsity and low rank, has been shown to successfully model data in numerous machine learning and signal processing tasks. Traditionally, such modeling approaches rely on an iterative algorithm that minimizes an objective function with parsimony-promoting terms. The inherently sequential structure and data-dependent complexity and latency of iterative optimization constitute a major limitation in many applications requiring real-time performance or involving large-scale data. Another limitation encountered by these modeling techniques is the difficulty of their inclusion in discriminative learning scenarios. In this work, we propose to move the emphasis from the model to the pursuit algorithm, and develop a process-centric view of parsimonious modeling, in which a learned deterministic fixed-complexity pursuit process is used in lieu of iterative optimization. We show a principled way to construct learnable pursuit process architectures for structured sparse and robust low rank models, derived from the iteration of proximal descent algorithms. These architectures learn to approximate the exact parsimonious representation at a fraction of the complexity of the standard optimization methods. We also show that appropriate training regimes allow to naturally extend parsimonious models to discriminative settings. State-of-the-art results are demonstrated on several challenging problems in image and audio processing with several orders of magnitude speedup compared to the exact optimization algorithms.

Via

Access Paper or Ask Questions

Computer vision tools for the non-invasive assessment of autism-related behavioral markers

Nov 08, 2012

Jordan Hashemi, Thiago Vallin Spina, Mariano Tepper, Amy Esler, Vassilios Morellas, Nikolaos Papanikolopoulos, Guillermo Sapiro

Figure 1 for Computer vision tools for the non-invasive assessment of autism-related behavioral markers

Figure 2 for Computer vision tools for the non-invasive assessment of autism-related behavioral markers

Figure 3 for Computer vision tools for the non-invasive assessment of autism-related behavioral markers

Figure 4 for Computer vision tools for the non-invasive assessment of autism-related behavioral markers

Abstract:The early detection of developmental disorders is key to child outcome, allowing interventions to be initiated that promote development and improve prognosis. Research on autism spectrum disorder (ASD) suggests behavioral markers can be observed late in the first year of life. Many of these studies involved extensive frame-by-frame video observation and analysis of a child's natural behavior. Although non-intrusive, these methods are extremely time-intensive and require a high level of observer training; thus, they are impractical for clinical and large population research purposes. Diagnostic measures for ASD are available for infants but are only accurate when used by specialists experienced in early diagnosis. This work is a first milestone in a long-term multidisciplinary project that aims at helping clinicians and general practitioners accomplish this early detection/measurement task automatically. We focus on providing computer vision tools to measure and identify ASD behavioral markers based on components of the Autism Observation Scale for Infants (AOSI). In particular, we develop algorithms to measure three critical AOSI activities that assess visual attention. We augment these AOSI activities with an additional test that analyzes asymmetrical patterns in unsupported gait. The first set of algorithms involves assessing head motion by tracking facial features, while the gait analysis relies on joint foreground segmentation and 2D body pose estimation in video. We show results that provide insightful knowledge to augment the clinician's behavioral observations obtained from real in-clinic assessments.

Via

Access Paper or Ask Questions

A Complete System for Candidate Polyps Detection in Virtual Colonoscopy

Sep 28, 2012

Marcelo Fiori, Pablo Musé, Guillermo Sapiro

Figure 1 for A Complete System for Candidate Polyps Detection in Virtual Colonoscopy

Figure 2 for A Complete System for Candidate Polyps Detection in Virtual Colonoscopy

Figure 3 for A Complete System for Candidate Polyps Detection in Virtual Colonoscopy

Figure 4 for A Complete System for Candidate Polyps Detection in Virtual Colonoscopy

Abstract:Computer tomographic colonography, combined with computer-aided detection, is a promising emerging technique for colonic polyp analysis. We present a complete pipeline for polyp detection, starting with a simple colon segmentation technique that enhances polyps, followed by an adaptive-scale candidate polyp delineation and classification based on new texture and geometric features that consider both the information in the candidate polyp location and its immediate surrounding area. The proposed system is tested with ground truth data, including flat and small polyps which are hard to detect even with optical colonoscopy. For polyps larger than 6mm in size we achieve 100% sensitivity with just 0.9 false positives per case, and for polyps larger than 3mm in size we achieve 93% sensitivity with 2.8 false positives per case.

Via

Access Paper or Ask Questions

Learning Robust Low-Rank Representations

Sep 27, 2012

Pablo Sprechmann, Alex M. Bronstein, Guillermo Sapiro

Figure 1 for Learning Robust Low-Rank Representations

Figure 2 for Learning Robust Low-Rank Representations

Figure 3 for Learning Robust Low-Rank Representations

Figure 4 for Learning Robust Low-Rank Representations

Abstract:In this paper we present a comprehensive framework for learning robust low-rank representations by combining and extending recent ideas for learning fast sparse coding regressors with structured non-convex optimization techniques. This approach connects robust principal component analysis (RPCA) with dictionary learning techniques and allows its approximation via trainable encoders. We propose an efficient feed-forward architecture derived from an optimization algorithm designed to exactly solve robust low dimensional projections. This architecture, in combination with different training objective functions, allows the regressors to be used as online approximants of the exact offline RPCA problem or as RPCA-based neural networks. Simple modifications of these encoders can handle challenging extensions, such as the inclusion of geometric data transformations. We present several examples with real data from image, audio, and video processing. When used to approximate RPCA, our basic implementation shows several orders of magnitude speedup compared to the exact solvers with almost no performance degradation. We show the strength of the inclusion of learning to the RPCA approach on a music source separation application, where the encoders outperform the exact RPCA algorithms, which are already reported to produce state-of-the-art results on a benchmark database. Our preliminary implementation on an iPad shows faster-than-real-time performance with minimal latency.

Via

Access Paper or Ask Questions

Are You Imitating Me? Unsupervised Sparse Modeling for Group Activity Analysis from a Single Video

Aug 27, 2012

Zhongwei Tang, Alexey Castrodad, Mariano Tepper, Guillermo Sapiro

Figure 1 for Are You Imitating Me? Unsupervised Sparse Modeling for Group Activity Analysis from a Single Video

Figure 2 for Are You Imitating Me? Unsupervised Sparse Modeling for Group Activity Analysis from a Single Video

Figure 3 for Are You Imitating Me? Unsupervised Sparse Modeling for Group Activity Analysis from a Single Video

Figure 4 for Are You Imitating Me? Unsupervised Sparse Modeling for Group Activity Analysis from a Single Video

Abstract:A framework for unsupervised group activity analysis from a single video is here presented. Our working hypothesis is that human actions lie on a union of low-dimensional subspaces, and thus can be efficiently modeled as sparse linear combinations of atoms from a learned dictionary representing the action's primitives. Contrary to prior art, and with the primary goal of spatio-temporal action grouping, in this work only one single video segment is available for both unsupervised learning and analysis without any prior training information. After extracting simple features at a single spatio-temporal scale, we learn a dictionary for each individual in the video during each short time lapse. These dictionaries allow us to compare the individuals' actions by producing an affinity matrix which contains sufficient discriminative information about the actions in the scene leading to grouping with simple and efficient tools. With diverse publicly available real videos, we demonstrate the effectiveness of the proposed framework and its robustness to cluttered backgrounds, changes of human appearance, and action variability.

Via

Access Paper or Ask Questions

Learning Efficient Structured Sparse Models

Jun 18, 2012

Alex Bronstein, Pablo Sprechmann, Guillermo Sapiro

Figure 1 for Learning Efficient Structured Sparse Models

Figure 2 for Learning Efficient Structured Sparse Models

Figure 3 for Learning Efficient Structured Sparse Models

Figure 4 for Learning Efficient Structured Sparse Models

Abstract:We present a comprehensive framework for structured sparse coding and modeling extending the recent ideas of using learnable fast regressors to approximate exact sparse codes. For this purpose, we develop a novel block-coordinate proximal splitting method for the iterative solution of hierarchical sparse coding problems, and show an efficient feed forward architecture derived from its iteration. This architecture faithfully approximates the exact structured sparse codes with a fraction of the complexity of the standard optimization methods. We also show that by using different training objective functions, learnable sparse encoders are no longer restricted to be mere approximants of the exact sparse code for a pre-given dictionary, as in earlier formulations, but can be rather used as full-featured sparse encoders or even modelers. A simple implementation shows several orders of magnitude speedup compared to the state-of-the-art at minimal performance degradation, making the proposed framework suitable for real time and large-scale applications.

* ICML2012

Via

Access Paper or Ask Questions

Semi-Supervised Single- and Multi-Domain Regression with Multi-Domain Training

Mar 20, 2012

Tomer Michaeli, Yonina C. Eldar, Guillermo Sapiro

Figure 1 for Semi-Supervised Single- and Multi-Domain Regression with Multi-Domain Training

Figure 2 for Semi-Supervised Single- and Multi-Domain Regression with Multi-Domain Training

Figure 3 for Semi-Supervised Single- and Multi-Domain Regression with Multi-Domain Training

Figure 4 for Semi-Supervised Single- and Multi-Domain Regression with Multi-Domain Training

Abstract:We address the problems of multi-domain and single-domain regression based on distinct and unpaired labeled training sets for each of the domains and a large unlabeled training set from all domains. We formulate these problems as a Bayesian estimation with partial knowledge of statistical relations. We propose a worst-case design strategy and study the resulting estimators. Our analysis explicitly accounts for the cardinality of the labeled sets and includes the special cases in which one of the labeled sets is very large or, in the other extreme, completely missing. We demonstrate our estimators in the context of removing expressions from facial images and in the context of audio-visual word recognition, and provide comparisons to several recently proposed multi-modal learning algorithms.

* 24 pages, 6 figures, 2 tables

Via

Access Paper or Ask Questions

Task-Driven Adaptive Statistical Compressive Sensing of Gaussian Mixture Models

Jan 25, 2012

Julio M. Duarte-Carvajalino, Guoshen Yu, Lawrence Carin, Guillermo Sapiro

Figure 1 for Task-Driven Adaptive Statistical Compressive Sensing of Gaussian Mixture Models

Figure 2 for Task-Driven Adaptive Statistical Compressive Sensing of Gaussian Mixture Models

Figure 3 for Task-Driven Adaptive Statistical Compressive Sensing of Gaussian Mixture Models

Figure 4 for Task-Driven Adaptive Statistical Compressive Sensing of Gaussian Mixture Models

Abstract:A framework for adaptive and non-adaptive statistical compressive sensing is developed, where a statistical model replaces the standard sparsity model of classical compressive sensing. We propose within this framework optimal task-specific sensing protocols specifically and jointly designed for classification and reconstruction. A two-step adaptive sensing paradigm is developed, where online sensing is applied to detect the signal class in the first step, followed by a reconstruction step adapted to the detected class and the observed samples. The approach is based on information theory, here tailored for Gaussian mixture models (GMMs), where an information-theoretic objective relationship between the sensed signals and a representation of the specific task of interest is maximized. Experimental results using synthetic signals, Landsat satellite attributes, and natural images of different sizes and with different noise levels show the improvements achieved using the proposed framework when compared to more standard sensing protocols. The underlying formulation can be applied beyond GMMs, at the price of higher mathematical and computational complexity.

Via

Access Paper or Ask Questions

Online Adaptive Statistical Compressed Sensing of Gaussian Mixture Models

Dec 26, 2011

Julio Duarte-Carvajalino, Guillermo Sapiro, Guoshen Yu, Lawrence Carin

Figure 1 for Online Adaptive Statistical Compressed Sensing of Gaussian Mixture Models

Figure 2 for Online Adaptive Statistical Compressed Sensing of Gaussian Mixture Models

Figure 3 for Online Adaptive Statistical Compressed Sensing of Gaussian Mixture Models

Figure 4 for Online Adaptive Statistical Compressed Sensing of Gaussian Mixture Models

Abstract:A framework of online adaptive statistical compressed sensing is introduced for signals following a mixture model. The scheme first uses non-adaptive measurements, from which an online decoding scheme estimates the model selection. As soon as a candidate model has been selected, an optimal sensing scheme for the selected model continues to apply. The final signal reconstruction is calculated from the ensemble of both the non-adaptive and the adaptive measurements. For signals generated from a Gaussian mixture model, the online adaptive sensing algorithm is given and its performance is analyzed. On both synthetic and real image data, the proposed adaptive scheme considerably reduces the average reconstruction error with respect to standard statistical compressed sensing that uses fully random measurements, at a marginally increased computational complexity.

Via

Access Paper or Ask Questions

An MDL framework for sparse coding and dictionary learning

Oct 11, 2011

Ignacio Ramírez, Guillermo Sapiro

Figure 1 for An MDL framework for sparse coding and dictionary learning

Figure 2 for An MDL framework for sparse coding and dictionary learning

Figure 3 for An MDL framework for sparse coding and dictionary learning

Figure 4 for An MDL framework for sparse coding and dictionary learning

Abstract:The power of sparse signal modeling with learned over-complete dictionaries has been demonstrated in a variety of applications and fields, from signal processing to statistical inference and machine learning. However, the statistical properties of these models, such as under-fitting or over-fitting given sets of data, are still not well characterized in the literature. As a result, the success of sparse modeling depends on hand-tuning critical parameters for each data and application. This work aims at addressing this by providing a practical and objective characterization of sparse models by means of the Minimum Description Length (MDL) principle -- a well established information-theoretic approach to model selection in statistical inference. The resulting framework derives a family of efficient sparse coding and dictionary learning algorithms which, by virtue of the MDL principle, are completely parameter free. Furthermore, such framework allows to incorporate additional prior information to existing models, such as Markovian dependencies, or to define completely new problem formulations, including in the matrix analysis area, in a natural way. These virtues will be demonstrated with parameter-free algorithms for the classic image denoising and classification problems, and for low-rank matrix recovery in video applications.

Via

Access Paper or Ask Questions