Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Richard G. Baraniuk

Out-of-Distribution Detection Using Neural Rendering Generative Models

Jul 10, 2019

Yujia Huang, Sihui Dai, Tan Nguyen, Richard G. Baraniuk, Anima Anandkumar

Figure 1 for Out-of-Distribution Detection Using Neural Rendering Generative Models

Figure 2 for Out-of-Distribution Detection Using Neural Rendering Generative Models

Figure 3 for Out-of-Distribution Detection Using Neural Rendering Generative Models

Figure 4 for Out-of-Distribution Detection Using Neural Rendering Generative Models

Abstract:Out-of-distribution (OoD) detection is a natural downstream task for deep generative models, due to their ability to learn the input probability distribution. There are mainly two classes of approaches for OoD detection using deep generative models, viz., based on likelihood measure and the reconstruction loss. However, both approaches are unable to carry out OoD detection effectively, especially when the OoD samples have smaller variance than the training samples. For instance, both flow based and VAE models assign higher likelihood to images from SVHN when trained on CIFAR-10 images. We use a recently proposed generative model known as neural rendering model (NRM) and derive metrics for OoD. We show that NRM unifies both approaches since it provides a likelihood estimate and also carries out reconstruction in each layer of the neural network. Among various measures, we found the joint likelihood of latent variables to be the most effective one for OoD detection. Our results show that when trained on CIFAR-10, lower likelihood (of latent variables) is assigned to SVHN images. Additionally, we show that this metric is consistent across other OoD datasets. To the best of our knowledge, this is the first work to show consistently lower likelihood for OoD data with smaller variance with deep generative models.

Via

Access Paper or Ask Questions

IdeoTrace: A Framework for Ideology Tracing with a Case Study on the 2016 U.S. Presidential Election

May 30, 2019

Indu Manickam, Andrew S. Lan, Gautam Dasarathy, Richard G. Baraniuk

Figure 1 for IdeoTrace: A Framework for Ideology Tracing with a Case Study on the 2016 U.S. Presidential Election

Figure 2 for IdeoTrace: A Framework for Ideology Tracing with a Case Study on the 2016 U.S. Presidential Election

Figure 3 for IdeoTrace: A Framework for Ideology Tracing with a Case Study on the 2016 U.S. Presidential Election

Figure 4 for IdeoTrace: A Framework for Ideology Tracing with a Case Study on the 2016 U.S. Presidential Election

Abstract:The 2016 United States presidential election has been characterized as a period of extreme divisiveness that was exacerbated on social media by the influence of fake news, trolls, and social bots. However, the extent to which the public became more polarized in response to these influences over the course of the election is not well understood. In this paper we propose IdeoTrace, a framework for (i) jointly estimating the ideology of social media users and news websites and (ii) tracing changes in user ideology over time. We apply this framework to the last two months of the election period for a group of 47508 Twitter users and demonstrate that both liberal and conservative users became more polarized over time.

* 9 pages, 4 figures, submitted to ASONAM 2019

Via

Access Paper or Ask Questions

Thresholding Graph Bandits with GrAPL

May 22, 2019

Daniel LeJeune, Gautam Dasarathy, Richard G. Baraniuk

Figure 1 for Thresholding Graph Bandits with GrAPL

Figure 2 for Thresholding Graph Bandits with GrAPL

Figure 3 for Thresholding Graph Bandits with GrAPL

Abstract:In this paper, we introduce a new online decision making paradigm that we call Thresholding Graph Bandits. The main goal is to efficiently identify a subset of arms in a multi-armed bandit problem whose means are above a specified threshold. While traditionally in such problems, the arms are assumed to be independent, in our paradigm we further suppose that we have access to the similarity between the arms in the form of a graph, allowing us gain information about the arm means in fewer samples. Such settings play a key role in a wide range of modern decision making problems where rapid decisions need to be made in spite of the large number of options available at each time. We present GrAPL, a novel algorithm for the thresholding graph bandit problem. We demonstrate theoretically that this algorithm is effective in taking advantage of the graph structure when available and the reward function homophily (that strongly connected arms have similar rewards) when favorable. We confirm these theoretical findings via experiments on both synthetic and real data.

* 15 pages, 3 figures

Via

Access Paper or Ask Questions

RACE: Sub-Linear Memory Sketches for Approximate Near-Neighbor Search on Streaming Data

Apr 09, 2019

Benjamin Coleman, Anshumali Shrivastava, Richard G. Baraniuk

Figure 1 for RACE: Sub-Linear Memory Sketches for Approximate Near-Neighbor Search on Streaming Data

Figure 2 for RACE: Sub-Linear Memory Sketches for Approximate Near-Neighbor Search on Streaming Data

Figure 3 for RACE: Sub-Linear Memory Sketches for Approximate Near-Neighbor Search on Streaming Data

Figure 4 for RACE: Sub-Linear Memory Sketches for Approximate Near-Neighbor Search on Streaming Data

Abstract:We present the first sublinear memory sketch which can be queried to find the $v$ nearest neighbors in a dataset. Our online sketching algorithm can compress an $N$-element dataset to a sketch of size $O(N^b \log^3{N})$ in $O(N^{b+1} \log^3{N})$ time, where $b < 1$ when the query satisfies a data-dependent near-neighbor stability condition. We achieve data-dependent sublinear space by combining recent advances in locality sensitive hashing (LSH)-based estimators with compressed sensing. Our results shed new light on the memory-accuracy tradeoff for near-neighbor search. The techniques presented reveal a deep connection between the fundamental compressed sensing (or heavy hitters) recovery problem and near-neighbor search, leading to new insight for geometric search problems and implications for sketching algorithms.

Via

Access Paper or Ask Questions

Representing Formal Languages: A Comparison Between Finite Automata and Recurrent Neural Networks

Feb 27, 2019

Joshua J. Michalenko, Ameesh Shah, Abhinav Verma, Richard G. Baraniuk, Swarat Chaudhuri, Ankit B. Patel

Figure 1 for Representing Formal Languages: A Comparison Between Finite Automata and Recurrent Neural Networks

Figure 2 for Representing Formal Languages: A Comparison Between Finite Automata and Recurrent Neural Networks

Figure 3 for Representing Formal Languages: A Comparison Between Finite Automata and Recurrent Neural Networks

Figure 4 for Representing Formal Languages: A Comparison Between Finite Automata and Recurrent Neural Networks

Abstract:We investigate the internal representations that a recurrent neural network (RNN) uses while learning to recognize a regular formal language. Specifically, we train a RNN on positive and negative examples from a regular language, and ask if there is a simple decoding function that maps states of this RNN to states of the minimal deterministic finite automaton (MDFA) for the language. Our experiments show that such a decoding function indeed exists, and that it maps states of the RNN not to MDFA states, but to states of an {\em abstraction} obtained by clustering small sets of MDFA states into "superstates". A qualitative analysis reveals that the abstraction often has a simple interpretation. Overall, the results suggest a strong structural relationship between internal representations used by RNNs and finite automata, and explain the well-known ability of RNNs to recognize formal grammatical structure.

* 15 Pages, 13 Figures, Accepted to ICLR 2019

Via

Access Paper or Ask Questions

Adaptive Estimation for Approximate k-Nearest-Neighbor Computations

Feb 25, 2019

Daniel LeJeune, Richard G. Baraniuk, Reinhard Heckel

Figure 1 for Adaptive Estimation for Approximate k-Nearest-Neighbor Computations

Figure 2 for Adaptive Estimation for Approximate k-Nearest-Neighbor Computations

Abstract:Algorithms often carry out equally many computations for "easy" and "hard" problem instances. In particular, algorithms for finding nearest neighbors typically have the same running time regardless of the particular problem instance. In this paper, we consider the approximate k-nearest-neighbor problem, which is the problem of finding a subset of O(k) points in a given set of points that contains the set of k nearest neighbors of a given query point. We propose an algorithm based on adaptively estimating the distances, and show that it is essentially optimal out of algorithms that are only allowed to adaptively estimate distances. We then demonstrate both theoretically and experimentally that the algorithm can achieve significant speedups relative to the naive method.

* 11 pages, 2 figures. To appear in AISTATS 2019

Via

Access Paper or Ask Questions

From Hard to Soft: Understanding Deep Network Nonlinearities via Vector Quantization and Statistical Inference

Oct 22, 2018

Randall Balestriero, Richard G. Baraniuk

Figure 1 for From Hard to Soft: Understanding Deep Network Nonlinearities via Vector Quantization and Statistical Inference

Figure 2 for From Hard to Soft: Understanding Deep Network Nonlinearities via Vector Quantization and Statistical Inference

Abstract:Nonlinearity is crucial to the performance of a deep (neural) network (DN). To date there has been little progress understanding the menagerie of available nonlinearities, but recently progress has been made on understanding the r\^ole played by piecewise affine and convex nonlinearities like the ReLU and absolute value activation functions and max-pooling. In particular, DN layers constructed from these operations can be interpreted as {\em max-affine spline operators} (MASOs) that have an elegant link to vector quantization (VQ) and $K$-means. While this is good theoretical progress, the entire MASO approach is predicated on the requirement that the nonlinearities be piecewise affine and convex, which precludes important activation functions like the sigmoid, hyperbolic tangent, and softmax. {\em This paper extends the MASO framework to these and an infinitely large class of new nonlinearities by linking deterministic MASOs with probabilistic Gaussian Mixture Models (GMMs).} We show that, under a GMM, piecewise affine, convex nonlinearities like ReLU, absolute value, and max-pooling can be interpreted as solutions to certain natural "hard" VQ inference problems, while sigmoid, hyperbolic tangent, and softmax can be interpreted as solutions to corresponding "soft" VQ inference problems. We further extend the framework by hybridizing the hard and soft VQ optimizations to create a $\beta$-VQ inference that interpolates between hard, soft, and linear VQ inference. A prime example of a $\beta$-VQ DN nonlinearity is the {\em swish} nonlinearity, which offers state-of-the-art performance in a range of computer vision tasks but was developed ad hoc by experimentation. Finally, we validate with experiments an important assertion of our theory, namely that DN performance can be significantly improved by enforcing orthogonality in its linear filters.

Via

Access Paper or Ask Questions

prDeep: Robust Phase Retrieval with a Flexible Deep Network

Jun 29, 2018

Christopher A. Metzler, Philip Schniter, Ashok Veeraraghavan, Richard G. Baraniuk

Figure 1 for prDeep: Robust Phase Retrieval with a Flexible Deep Network

Figure 2 for prDeep: Robust Phase Retrieval with a Flexible Deep Network

Figure 3 for prDeep: Robust Phase Retrieval with a Flexible Deep Network

Figure 4 for prDeep: Robust Phase Retrieval with a Flexible Deep Network

Abstract:Phase retrieval algorithms have become an important component in many modern computational imaging systems. For instance, in the context of ptychography and speckle correlation imaging, they enable imaging past the diffraction limit and through scattering media, respectively. Unfortunately, traditional phase retrieval algorithms struggle in the presence of noise. Progress has been made recently on more robust algorithms using signal priors, but at the expense of limiting the range of supported measurement models (e.g., to Gaussian or coded diffraction patterns). In this work we leverage the regularization-by-denoising framework and a convolutional neural network denoiser to create prDeep, a new phase retrieval algorithm that is both robust and broadly applicable. We test and validate prDeep in simulation to demonstrate that it is robust to noise and can handle a variety of system models. A MatConvNet implementation of prDeep is available at https://github.com/ricedsp/prDeep.

Via

Access Paper or Ask Questions

MISSION: Ultra Large-Scale Feature Selection using Count-Sketches

Jun 12, 2018

Amirali Aghazadeh, Ryan Spring, Daniel LeJeune, Gautam Dasarathy, Anshumali Shrivastava, Richard G. Baraniuk

Figure 1 for MISSION: Ultra Large-Scale Feature Selection using Count-Sketches

Figure 2 for MISSION: Ultra Large-Scale Feature Selection using Count-Sketches

Figure 3 for MISSION: Ultra Large-Scale Feature Selection using Count-Sketches

Figure 4 for MISSION: Ultra Large-Scale Feature Selection using Count-Sketches

Abstract:Feature selection is an important challenge in machine learning. It plays a crucial role in the explainability of machine-driven decisions that are rapidly permeating throughout modern society. Unfortunately, the explosion in the size and dimensionality of real-world datasets poses a severe challenge to standard feature selection algorithms. Today, it is not uncommon for datasets to have billions of dimensions. At such scale, even storing the feature vector is impossible, causing most existing feature selection methods to fail. Workarounds like feature hashing, a standard approach to large-scale machine learning, helps with the computational feasibility, but at the cost of losing the interpretability of features. In this paper, we present MISSION, a novel framework for ultra large-scale feature selection that performs stochastic gradient descent while maintaining an efficient representation of the features in memory using a Count-Sketch data structure. MISSION retains the simplicity of feature hashing without sacrificing the interpretability of the features while using only O(log^2(p)) working memory. We demonstrate that MISSION accurately and efficiently performs feature selection on real-world, large-scale datasets with billions of dimensions.

Via

Access Paper or Ask Questions

Unsupervised Learning with Stein's Unbiased Risk Estimator

May 26, 2018

Christopher A. Metzler, Ali Mousavi, Reinhard Heckel, Richard G. Baraniuk

Figure 1 for Unsupervised Learning with Stein's Unbiased Risk Estimator

Figure 2 for Unsupervised Learning with Stein's Unbiased Risk Estimator

Figure 3 for Unsupervised Learning with Stein's Unbiased Risk Estimator

Figure 4 for Unsupervised Learning with Stein's Unbiased Risk Estimator

Abstract:Learning from unlabeled and noisy data is one of the grand challenges of machine learning. As such, it has seen a flurry of research with new ideas proposed continuously. In this work, we revisit a classical idea: Stein's Unbiased Risk Estimator (SURE). We show that, in the context of image recovery, SURE and its generalizations can be used to train convolutional neural networks (CNNs) for a range of image denoising and recovery problems {\em without any ground truth data.} Specifically, our goal is to reconstruct an image $x$ from a {\em noisy} linear transformation (measurement) of the image. We consider two scenarios: one where no additional data is available and one where we have measurements of other images that are drawn from the same noisy distribution as $x$, but have no access to the clean images. Such is the case, for instance, in the context of medical imaging, microscopy, and astronomy, where noise-less ground truth data is rarely available. We show that in this situation, SURE can be used to estimate the mean-squared-error loss associated with an estimate of $x$. Using this estimate of the loss, we train networks to perform denoising and compressed sensing recovery. In addition, we also use the SURE framework to partially explain and improve upon an intriguing results presented by Ulyanov et al. in "Deep Image Prior": that a network initialized with random weights and fit to a single noisy image can effectively denoise that image.

Via

Access Paper or Ask Questions