Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Junier Oliva

Department of Computer Science, University of North Carolina at Chapel Hill

Acquisition Conditioned Oracle for Nongreedy Active Feature Acquisition

Feb 27, 2023

Michael Valancius, Max Lennon, Junier Oliva

Abstract:We develop novel methodology for active feature acquisition (AFA), the study of how to sequentially acquire a dynamic (on a per instance basis) subset of features that minimizes acquisition costs whilst still yielding accurate predictions. The AFA framework can be useful in a myriad of domains, including health care applications where the cost of acquiring additional features for a patient (in terms of time, money, risk, etc.) can be weighed against the expected improvement to diagnostic performance. Previous approaches for AFA have employed either: deep learning RL techniques, which have difficulty training policies in the AFA MDP due to sparse rewards and a complicated action space; deep learning surrogate generative models, which require modeling complicated multidimensional conditional distributions; or greedy policies, which fail to account for how joint feature acquisitions can be informative together for better predictions. In this work we show that we can bypass many of these challenges with a novel, nonparametric oracle based approach, which we coin the acquisition conditioned oracle (ACO). Extensive experiments show the superiority of the ACO to state-of-the-art AFA methods when acquiring features for both predictions and general decision-making.

Via

Access Paper or Ask Questions

Learning to Retrieve Videos by Asking Questions

May 13, 2022

Avinash Madasu, Junier Oliva, Gedas Bertasius

Figure 1 for Learning to Retrieve Videos by Asking Questions

Figure 2 for Learning to Retrieve Videos by Asking Questions

Figure 3 for Learning to Retrieve Videos by Asking Questions

Figure 4 for Learning to Retrieve Videos by Asking Questions

Abstract:The majority of traditional text-to-video retrieval systems operate in static environments, i.e., there is no interaction between the user and the agent beyond the initial textual query provided by the user. This can be suboptimal if the initial query has ambiguities, which would lead to many falsely retrieved videos. To overcome this limitation, we propose a novel framework for Video Retrieval using Dialog (ViReD), which enables the user to interact with an AI agent via multiple rounds of dialog. The key contribution of our framework is a novel multimodal question generator that learns to ask questions that maximize the subsequent video retrieval performance. Our multimodal question generator uses (i) the video candidates retrieved during the last round of interaction with the user and (ii) the text-based dialog history documenting all previous interactions, to generate questions that incorporate both visual and linguistic cues relevant to video retrieval. Furthermore, to generate maximally informative questions, we propose an Information-Guided Supervision (IGS), which guides the question generator to ask questions that would boost subsequent video retrieval accuracy. We validate the effectiveness of our interactive ViReD framework on the AVSD dataset, showing that our interactive method performs significantly better than traditional non-interactive video retrieval systems. Furthermore, we also demonstrate that our proposed approach also generalizes to the real-world settings that involve interactions with real humans, thus, demonstrating the robustness and generality of our framework

Via

Access Paper or Ask Questions

Interpretable Single-Cell Set Classification with Kernel Mean Embeddings

Feb 10, 2022

Siyuan Shan, Vishal Baskaran, Haidong Yi, Jolene Ranek, Natalie Stanley, Junier Oliva

Figure 1 for Interpretable Single-Cell Set Classification with Kernel Mean Embeddings

Figure 2 for Interpretable Single-Cell Set Classification with Kernel Mean Embeddings

Figure 3 for Interpretable Single-Cell Set Classification with Kernel Mean Embeddings

Figure 4 for Interpretable Single-Cell Set Classification with Kernel Mean Embeddings

Abstract:Modern single-cell flow and mass cytometry technologies measure the expression of several proteins of the individual cells within a blood or tissue sample. Each profiled biological sample is thus represented by a set of hundreds of thousands of multidimensional cell feature vectors, which incurs a high computational cost to predict each biological sample's associated phenotype with machine learning models. Such a large set cardinality also limits the interpretability of machine learning models due to the difficulty in tracking how each individual cell influences the ultimate prediction. Using Kernel Mean Embedding to encode the cellular landscape of each profiled biological sample, we can train a simple linear classifier and achieve state-of-the-art classification accuracy on 3 flow and mass cytometry datasets. Our model contains few parameters but still performs similarly to deep learning models with millions of parameters. In contrast with deep learning approaches, the linearity and sub-selection step of our model make it easy to interpret classification results. Clustering analysis further shows that our method admits rich biological interpretability for linking cellular heterogeneity to clinical phenotype.

* Codes are avialbe at https://github.com/shansiliu95/CKME

Via

Access Paper or Ask Questions

Multiscale Score Matching for Out-of-Distribution Detection

Oct 27, 2020

Ahsan Mahmood, Junier Oliva, Martin Styner

Figure 1 for Multiscale Score Matching for Out-of-Distribution Detection

Figure 2 for Multiscale Score Matching for Out-of-Distribution Detection

Figure 3 for Multiscale Score Matching for Out-of-Distribution Detection

Figure 4 for Multiscale Score Matching for Out-of-Distribution Detection

Abstract:We present a new methodology for detecting out-of-distribution (OOD) images by utilizing norms of the score estimates at multiple noise scales. A score is defined to be the gradient of the log density with respect to the input data. Our methodology is completely unsupervised and follows a straight forward training scheme. First, we train a deep network to estimate scores for levels of noise. Once trained, we calculate the noisy score estimates for N in-distribution samples and take the L2-norms across the input dimensions (resulting in an NxL matrix). Then we train an auxiliary model (such as a Gaussian Mixture Model) to learn the in-distribution spatial regions in this L-dimensional space. This auxiliary model can now be used to identify points that reside outside the learned space. Despite its simplicity, our experiments show that this methodology significantly outperforms the state-of-the-art in detecting out-of-distribution images. For example, our method can effectively separate CIFAR-10 (inlier) and SVHN (OOD) images, a setting which has been previously shown to be difficult for deep likelihood models.

Via

Access Paper or Ask Questions

Deep Message Passing on Sets

Sep 21, 2019

Yifeng Shi, Junier Oliva, Marc Niethammer

Figure 1 for Deep Message Passing on Sets

Figure 2 for Deep Message Passing on Sets

Figure 3 for Deep Message Passing on Sets

Figure 4 for Deep Message Passing on Sets

Abstract:Modern methods for learning over graph input data have shown the fruitfulness of accounting for relationships among elements in a collection. However, most methods that learn over set input data use only rudimentary approaches to exploit intra-collection relationships. In this work we introduce Deep Message Passing on Sets (DMPS), a novel method that incorporates relational learning for sets. DMPS not only connects learning on graphs with learning on sets via deep kernel learning, but it also bridges message passing on sets and traditional diffusion dynamics commonly used in denoising models. Based on these connections, we develop two new blocks for relational learning on sets: the set-denoising block and the set-residual block. The former is motivated by the connection between message passing on general graphs and diffusion-based denoising models, whereas the latter is inspired by the well-known residual network. In addition to demonstrating the interpretability of our model by learning the true underlying relational structure experimentally, we also show the effectiveness of our approach on both synthetic and real-world datasets by achieving results that are competitive with or outperform the state-of-the-art.

* 11 pages, 8 figures

Via

Access Paper or Ask Questions

Meta-Neighborhoods

Sep 18, 2019

Siyuan Shan, Junier Oliva

Abstract:Traditional methods for training neural networks use training data just once, as it is discarded after training. Instead, in this work we also leverage the training data during testing to adjust the network and gain more expressivity. Our approach, named Meta-Neighborhoods, is developed under a multi-task learning framework and is a generalization of k-nearest neighbors methods. It can flexibly adapt network parameters w.r.t. different query data using their respective local neighborhood information. Local information is learned and stored in a dictionary of learnable neighbors rather than directly retrieved from the training set for greater flexibility and performance. The network parameters and the dictionary are optimized end-to-end via meta-learning. Extensive experiments demonstrate that Meta-Neighborhoods consistently improved classification and regression performance across various network architectures and datasets. We also observed superior improvements than other state-of-the-art meta-learning methods designed to improve supervised learning.

* 8 pages

Via

Access Paper or Ask Questions

MolecularRNN: Generating realistic molecular graphs with optimized properties

May 31, 2019

Mariya Popova, Mykhailo Shvets, Junier Oliva, Olexandr Isayev

Figure 1 for MolecularRNN: Generating realistic molecular graphs with optimized properties

Figure 2 for MolecularRNN: Generating realistic molecular graphs with optimized properties

Figure 3 for MolecularRNN: Generating realistic molecular graphs with optimized properties

Figure 4 for MolecularRNN: Generating realistic molecular graphs with optimized properties

Abstract:Designing new molecules with a set of predefined properties is a core problem in modern drug discovery and development. There is a growing need for de-novo design methods that would address this problem. We present MolecularRNN, the graph recurrent generative model for molecular structures. Our model generates diverse realistic molecular graphs after likelihood pretraining on a big database of molecules. We perform an analysis of our pretrained models on large-scale generated datasets of 1 million samples. Further, the model is tuned with policy gradient algorithm, provided a critic that estimates the reward for the property of interest. We show a significant distribution shift to the desired range for lipophilicity, drug-likeness, and melting point outperforming state-of-the-art works. With the use of rejection sampling based on valency constraints, our model yields 100% validity. Moreover, we show that invalid molecules provide a rich signal to the model through the use of structure penalty in our reinforcement learning pipeline.

Via

Access Paper or Ask Questions

Permutation Invariant Likelihoods and Equivariant Transformations

Feb 05, 2019

Chris Bender, Juan Jose Garcia, Kevin O'Connor, Junier Oliva

Figure 1 for Permutation Invariant Likelihoods and Equivariant Transformations

Figure 2 for Permutation Invariant Likelihoods and Equivariant Transformations

Figure 3 for Permutation Invariant Likelihoods and Equivariant Transformations

Figure 4 for Permutation Invariant Likelihoods and Equivariant Transformations

Abstract:In this work, we fill a substantial void in machine learning and statistical methodology by developing extensive generative density estimation techniques for exchangeable non-iid data. We do so through the use of permutation invariant likelihoods and permutation equivariant transformations of variables. These methods exploit the intradependencies within sets in ways that are independent of ordering (for likelihoods) or order preserving (for transformations). The proposed techniques are able to directly model exchangeable data (such as sets) without the need to account for permutations or assume independence of elements. We consider applications to point clouds and provide several interesting experiments on both synthetic and real-world datasets.

Via

Access Paper or Ask Questions

A Forest from the Trees: Generation through Neighborhoods

Feb 04, 2019

Yang Li, Tianxiang Gao, Junier Oliva

Figure 1 for A Forest from the Trees: Generation through Neighborhoods

Figure 2 for A Forest from the Trees: Generation through Neighborhoods

Figure 3 for A Forest from the Trees: Generation through Neighborhoods

Figure 4 for A Forest from the Trees: Generation through Neighborhoods

Abstract:In this work, we propose to learn a generative model using both learned features (through a latent space) and memories (through neighbors). Although human learning makes seamless use of both learned perceptual features and instance recall, current generative learning paradigms only make use of one of these two components. Take, for instance, flow models, which learn a latent space of invertible features that follow a simple distribution. Conversely, kernel density techniques use instances to shift a simple distribution into an aggregate mixture model. Here we propose multiple methods to enhance the latent space of a flow model with neighborhood information. Not only does our proposed framework represent a more human-like approach by leveraging both learned features and memories, but it may also be viewed as a step forward in non-parametric methods. The efficacy of our model is shown empirically with standard image datasets. We observe compelling results and a significant improvement over baselines.

Via

Access Paper or Ask Questions

Bayesian Nonparametric Kernel-Learning

Jan 30, 2018

Junier Oliva, Avinava Dubey, Andrew G. Wilson, Barnabas Poczos, Jeff Schneider, Eric P. Xing

Figure 1 for Bayesian Nonparametric Kernel-Learning

Figure 2 for Bayesian Nonparametric Kernel-Learning

Figure 3 for Bayesian Nonparametric Kernel-Learning

Figure 4 for Bayesian Nonparametric Kernel-Learning

Abstract:Kernel methods are ubiquitous tools in machine learning. However, there is often little reason for the common practice of selecting a kernel a priori. Even if a universal approximating kernel is selected, the quality of the finite sample estimator may be greatly affected by the choice of kernel. Furthermore, when directly applying kernel methods, one typically needs to compute a $N \times N$ Gram matrix of pairwise kernel evaluations to work with a dataset of $N$ instances. The computation of this Gram matrix precludes the direct application of kernel methods on large datasets, and makes kernel learning especially difficult. In this paper we introduce Bayesian nonparmetric kernel-learning (BaNK), a generic, data-driven framework for scalable learning of kernels. BaNK places a nonparametric prior on the spectral distribution of random frequencies allowing it to both learn kernels and scale to large datasets. We show that this framework can be used for large scale regression and classification tasks. Furthermore, we show that BaNK outperforms several other scalable approaches for kernel learning on a variety of real world datasets.

Via

Access Paper or Ask Questions