Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Erik B. Sudderth

Cascaded Scene Flow Prediction using Semantic Segmentation

Oct 05, 2017

Zhile Ren, Deqing Sun, Jan Kautz, Erik B. Sudderth

Figure 1 for Cascaded Scene Flow Prediction using Semantic Segmentation

Figure 2 for Cascaded Scene Flow Prediction using Semantic Segmentation

Figure 3 for Cascaded Scene Flow Prediction using Semantic Segmentation

Figure 4 for Cascaded Scene Flow Prediction using Semantic Segmentation

Abstract:Given two consecutive frames from a pair of stereo cameras, 3D scene flow methods simultaneously estimate the 3D geometry and motion of the observed scene. Many existing approaches use superpixels for regularization, but may predict inconsistent shapes and motions inside rigidly moving objects. We instead assume that scenes consist of foreground objects rigidly moving in front of a static background, and use semantic cues to produce pixel-accurate scene flow estimates. Our cascaded classification framework accurately models 3D scenes by iteratively refining semantic segmentation masks, stereo correspondences, 3D rigid motion estimates, and optical flow fields. We evaluate our method on the challenging KITTI autonomous driving benchmark, and show that accounting for the motion of segmented vehicles leads to state-of-the-art performance.

* International Conference on 3D Vision (3DV), 2017 (oral presentation)

Via

Access Paper or Ask Questions

Prediction-Constrained Training for Semi-Supervised Mixture and Topic Models

Jul 23, 2017

Michael C. Hughes, Leah Weiner, Gabriel Hope, Thomas H. McCoy Jr., Roy H. Perlis, Erik B. Sudderth, Finale Doshi-Velez

Figure 1 for Prediction-Constrained Training for Semi-Supervised Mixture and Topic Models

Figure 2 for Prediction-Constrained Training for Semi-Supervised Mixture and Topic Models

Figure 3 for Prediction-Constrained Training for Semi-Supervised Mixture and Topic Models

Figure 4 for Prediction-Constrained Training for Semi-Supervised Mixture and Topic Models

Abstract:Supervisory signals have the potential to make low-dimensional data representations, like those learned by mixture and topic models, more interpretable and useful. We propose a framework for training latent variable models that explicitly balances two goals: recovery of faithful generative explanations of high-dimensional data, and accurate prediction of associated semantic labels. Existing approaches fail to achieve these goals due to an incomplete treatment of a fundamental asymmetry: the intended application is always predicting labels from data, not data from labels. Our prediction-constrained objective for training generative models coherently integrates loss-based supervisory signals while enabling effective semi-supervised learning from partially labeled data. We derive learning algorithms for semi-supervised mixture and topic models using stochastic gradient descent with automatic differentiation. We demonstrate improved prediction quality compared to several previous supervised topic models, achieving predictions competitive with high-dimensional logistic regression on text sentiment analysis and electronic health records tasks while simultaneously learning interpretable topics.

Via

Access Paper or Ask Questions

Fast Learning of Clusters and Topics via Sparse Posteriors

Sep 23, 2016

Michael C. Hughes, Erik B. Sudderth

Figure 1 for Fast Learning of Clusters and Topics via Sparse Posteriors

Figure 2 for Fast Learning of Clusters and Topics via Sparse Posteriors

Figure 3 for Fast Learning of Clusters and Topics via Sparse Posteriors

Figure 4 for Fast Learning of Clusters and Topics via Sparse Posteriors

Abstract:Mixture models and topic models generate each observation from a single cluster, but standard variational posteriors for each observation assign positive probability to all possible clusters. This requires dense storage and runtime costs that scale with the total number of clusters, even though typically only a few clusters have significant posterior mass for any data point. We propose a constrained family of sparse variational distributions that allow at most $L$ non-zero entries, where the tunable threshold $L$ trades off speed for accuracy. Previous sparse approximations have used hard assignments ($L=1$), but we find that moderate values of $L>1$ provide superior performance. Our approach easily integrates with stochastic or incremental optimization algorithms to scale to millions of examples. Experiments training mixture models of image patches and topic models for news articles show that our approach produces better-quality models in far less time than baseline methods.

Via

Access Paper or Ask Questions

Joint modeling of multiple time series via the beta process with application to motion capture segmentation

Nov 13, 2014

Emily B. Fox, Michael C. Hughes, Erik B. Sudderth, Michael I. Jordan

Figure 1 for Joint modeling of multiple time series via the beta process with application to motion capture segmentation

Figure 2 for Joint modeling of multiple time series via the beta process with application to motion capture segmentation

Figure 3 for Joint modeling of multiple time series via the beta process with application to motion capture segmentation

Figure 4 for Joint modeling of multiple time series via the beta process with application to motion capture segmentation

Abstract:We propose a Bayesian nonparametric approach to the problem of jointly modeling multiple related time series. Our model discovers a latent set of dynamical behaviors shared among the sequences, and segments each time series into regions defined by a subset of these behaviors. Using a beta process prior, the size of the behavior set and the sharing pattern are both inferred from data. We develop Markov chain Monte Carlo (MCMC) methods based on the Indian buffet process representation of the predictive distribution of the beta process. Our MCMC inference algorithm efficiently adds and removes behaviors via novel split-merge moves as well as data-driven birth and death proposals, avoiding the need to consider a truncated model. We demonstrate promising results on unsupervised segmentation of human motion capture data.

* Annals of Applied Statistics 2014, Vol. 8, No. 3, 1281-1313
* Published in at http://dx.doi.org/10.1214/14-AOAS742 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org). arXiv admin note: text overlap with arXiv:1111.4226

Via

Access Paper or Ask Questions

Gibbs Sampling in Open-Universe Stochastic Languages

Mar 15, 2012

Nimar S. Arora, Rodrigo de Salvo Braz, Erik B. Sudderth, Stuart Russell

Figure 1 for Gibbs Sampling in Open-Universe Stochastic Languages

Figure 2 for Gibbs Sampling in Open-Universe Stochastic Languages

Figure 3 for Gibbs Sampling in Open-Universe Stochastic Languages

Figure 4 for Gibbs Sampling in Open-Universe Stochastic Languages

Abstract:Languages for open-universe probabilistic models (OUPMs) can represent situations with an unknown number of objects and iden- tity uncertainty. While such cases arise in a wide range of important real-world appli- cations, existing general purpose inference methods for OUPMs are far less efficient than those available for more restricted lan- guages and model classes. This paper goes some way to remedying this deficit by in- troducing, and proving correct, a generaliza- tion of Gibbs sampling to partial worlds with possibly varying model structure. Our ap- proach draws on and extends previous generic OUPM inference methods, as well as aux- iliary variable samplers for nonparametric mixture models. It has been implemented for BLOG, a well-known OUPM language. Combined with compile-time optimizations, the resulting algorithm yields very substan- tial speedups over existing methods on sev- eral test cases, and substantially improves the practicality of OUPM languages generally.

* Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)

Via

Access Paper or Ask Questions

Joint Modeling of Multiple Related Time Series via the Beta Process

Nov 17, 2011

Emily B. Fox, Erik B. Sudderth, Michael I. Jordan, Alan S. Willsky

Figure 1 for Joint Modeling of Multiple Related Time Series via the Beta Process

Figure 2 for Joint Modeling of Multiple Related Time Series via the Beta Process

Figure 3 for Joint Modeling of Multiple Related Time Series via the Beta Process

Figure 4 for Joint Modeling of Multiple Related Time Series via the Beta Process

Abstract:We propose a Bayesian nonparametric approach to the problem of jointly modeling multiple related time series. Our approach is based on the discovery of a set of latent, shared dynamical behaviors. Using a beta process prior, the size of the set and the sharing pattern are both inferred from data. We develop efficient Markov chain Monte Carlo methods based on the Indian buffet process representation of the predictive distribution of the beta process, without relying on a truncated model. In particular, our approach uses the sum-product algorithm to efficiently compute Metropolis-Hastings acceptance probabilities, and explores new dynamical behaviors via birth and death proposals. We examine the benefits of our proposed feature-based model on several synthetic datasets, and also demonstrate promising results on unsupervised segmentation of visual motion capture data.

* 33 pages, 8 figures

Via

Access Paper or Ask Questions

A sticky HDP-HMM with application to speaker diarization

Aug 16, 2011

Emily B. Fox, Erik B. Sudderth, Michael I. Jordan, Alan S. Willsky

Figure 1 for A sticky HDP-HMM with application to speaker diarization

Figure 2 for A sticky HDP-HMM with application to speaker diarization

Figure 3 for A sticky HDP-HMM with application to speaker diarization

Figure 4 for A sticky HDP-HMM with application to speaker diarization

Abstract:We consider the problem of speaker diarization, the problem of segmenting an audio recording of a meeting into temporal segments corresponding to individual speakers. The problem is rendered particularly difficult by the fact that we are not allowed to assume knowledge of the number of people participating in the meeting. To address this problem, we take a Bayesian nonparametric approach to speaker diarization that builds on the hierarchical Dirichlet process hidden Markov model (HDP-HMM) of Teh et al. [J. Amer. Statist. Assoc. 101 (2006) 1566--1581]. Although the basic HDP-HMM tends to over-segment the audio data---creating redundant states and rapidly switching among them---we describe an augmented HDP-HMM that provides effective control over the switching rate. We also show that this augmentation makes it possible to treat emission distributions nonparametrically. To scale the resulting architecture to realistic diarization problems, we develop a sampling algorithm that employs a truncated approximation of the Dirichlet process to jointly resample the full state sequence, greatly improving mixing rates. Working with a benchmark NIST data set, we show that our Bayesian nonparametric architecture yields state-of-the-art speaker diarization results.

* Annals of Applied Statistics 2011, Vol. 5, No. 2A, 1020-1056
* Published in at http://dx.doi.org/10.1214/10-AOAS395 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Via

Access Paper or Ask Questions

Bayesian Nonparametric Inference of Switching Linear Dynamical Systems

Mar 19, 2010

Emily B. Fox, Erik B. Sudderth, Michael I. Jordan, Alan S. Willsky

Figure 1 for Bayesian Nonparametric Inference of Switching Linear Dynamical Systems

Figure 2 for Bayesian Nonparametric Inference of Switching Linear Dynamical Systems

Figure 3 for Bayesian Nonparametric Inference of Switching Linear Dynamical Systems

Figure 4 for Bayesian Nonparametric Inference of Switching Linear Dynamical Systems

Abstract:Many complex dynamical phenomena can be effectively modeled by a system that switches among a set of conditionally linear dynamical modes. We consider two such models: the switching linear dynamical system (SLDS) and the switching vector autoregressive (VAR) process. Our Bayesian nonparametric approach utilizes a hierarchical Dirichlet process prior to learn an unknown number of persistent, smooth dynamical modes. We additionally employ automatic relevance determination to infer a sparse set of dynamic dependencies allowing us to learn SLDS with varying state dimension or switching VAR processes with varying autoregressive order. We develop a sampling algorithm that combines a truncated approximation to the Dirichlet process with efficient joint sampling of the mode and state sequences. The utility and flexibility of our model are demonstrated on synthetic data, sequences of dancing honey bees, the IBOVESPA stock index, and a maneuvering target tracking application.

* 50 pages, 7 figures

Via

Access Paper or Ask Questions