Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Samuel Kaski

Learning Image Relations with Contrast Association Networks

May 16, 2017
Yao Lu, Zhirong Yang, Juho Kannala, Samuel Kaski

Figure 1 for Learning Image Relations with Contrast Association Networks

Figure 2 for Learning Image Relations with Contrast Association Networks

Figure 3 for Learning Image Relations with Contrast Association Networks

Figure 4 for Learning Image Relations with Contrast Association Networks

Inferring the relations between two images is an important class of tasks in computer vision. Examples of such tasks include computing optical flow and stereo disparity. We treat the relation inference tasks as a machine learning problem and tackle it with neural networks. A key to the problem is learning a representation of relations. We propose a new neural network module, contrast association unit (CAU), which explicitly models the relations between two sets of input variables. Due to the non-negativity of the weights in CAU, we adopt a multiplicative update algorithm for learning these weights. Experiments show that neural networks with CAUs are more effective in learning five fundamental image transformations than conventional neural networks.

Via

Access Paper or Ask Questions

Improving drug sensitivity predictions in precision medicine through active expert knowledge elicitation

May 09, 2017
Iiris Sundin, Tomi Peltola, Muntasir Mamun Majumder, Pedram Daee, Marta Soare, Homayun Afrabandpey, Caroline Heckman, Samuel Kaski, Pekka Marttinen

Figure 1 for Improving drug sensitivity predictions in precision medicine through active expert knowledge elicitation

Figure 2 for Improving drug sensitivity predictions in precision medicine through active expert knowledge elicitation

Figure 3 for Improving drug sensitivity predictions in precision medicine through active expert knowledge elicitation

Figure 4 for Improving drug sensitivity predictions in precision medicine through active expert knowledge elicitation

Predicting the efficacy of a drug for a given individual, using high-dimensional genomic measurements, is at the core of precision medicine. However, identifying features on which to base the predictions remains a challenge, especially when the sample size is small. Incorporating expert knowledge offers a promising alternative to improve a prediction model, but collecting such knowledge is laborious to the expert if the number of candidate features is very large. We introduce a probabilistic model that can incorporate expert feedback about the impact of genomic measurements on the sensitivity of a cancer cell for a given drug. We also present two methods to intelligently collect this feedback from the expert, using experimental design and multi-armed bandit models. In a multiple myeloma blood cancer data set (n=51), expert knowledge decreased the prediction error by 8%. Furthermore, the intelligent approaches can be used to reduce the workload of feedback collection to less than 30% on average compared to a naive approach.

* 24 pages, 3 figures

Via

Access Paper or Ask Questions

Likelihood-free inference via classification

Mar 03, 2017
Michael U. Gutmann, Ritabrata Dutta, Samuel Kaski, Jukka Corander

Figure 1 for Likelihood-free inference via classification

Figure 2 for Likelihood-free inference via classification

Figure 3 for Likelihood-free inference via classification

Figure 4 for Likelihood-free inference via classification

Increasingly complex generative models are being used across disciplines as they allow for realistic characterization of data, but a common difficulty with them is the prohibitively large computational cost to evaluate the likelihood function and thus to perform likelihood-based statistical inference. A likelihood-free inference framework has emerged where the parameters are identified by finding values that yield simulated data resembling the observed data. While widely applicable, a major difficulty in this framework is how to measure the discrepancy between the simulated and observed data. Transforming the original problem into a problem of classifying the data into simulated versus observed, we find that classification accuracy can be used to assess the discrepancy. The complete arsenal of classification methods becomes thereby available for inference of intractable generative models. We validate our approach using theory and simulations for both point estimation and Bayesian inference, and demonstrate its use on real data by inferring an individual-based epidemiological model for bacterial infections in child care centers.

* Accepted for publication in Statistics and Computing (Feb 13, 2017)

Via

Access Paper or Ask Questions

Interactive Prior Elicitation of Feature Similarities for Small Sample Size Prediction

Feb 28, 2017
Homayun Afrabandpey, Tomi Peltola, Samuel Kaski

Figure 1 for Interactive Prior Elicitation of Feature Similarities for Small Sample Size Prediction

Figure 2 for Interactive Prior Elicitation of Feature Similarities for Small Sample Size Prediction

Figure 3 for Interactive Prior Elicitation of Feature Similarities for Small Sample Size Prediction

Regression under the "small $n$, large $p$" conditions, of small sample size $n$ and large number of features $p$ in the learning data set, is a recurring setting in which learning from data is difficult. With prior knowledge about relationships of the features, $p$ can effectively be reduced, but explicating such prior knowledge is difficult for experts. In this paper we introduce a new method for eliciting expert prior knowledge about the similarity of the roles of features in the prediction task. The key idea is to use an interactive multidimensional-scaling (MDS) type scatterplot display of the features to elicit the similarity relationships, and then use the elicited relationships in the prior distribution of prediction parameters. Specifically, for learning to predict a target variable with Bayesian linear regression, the feature relationships are used to construct a Gaussian prior with a full covariance matrix for the regression coefficients. Evaluation of our method in experiments with simulated and real users on text data confirm that prior elicitation of feature similarities improves prediction accuracy. Furthermore, elicitation with an interactive scatterplot display outperforms straightforward elicitation where the users choose feature pairs from a feature list.

Via

Access Paper or Ask Questions

Interpreting Outliers: Localized Logistic Regression for Density Ratio Estimation

Feb 21, 2017
Makoto Yamada, Song Liu, Samuel Kaski

Figure 1 for Interpreting Outliers: Localized Logistic Regression for Density Ratio Estimation

Figure 2 for Interpreting Outliers: Localized Logistic Regression for Density Ratio Estimation

Figure 3 for Interpreting Outliers: Localized Logistic Regression for Density Ratio Estimation

We propose an inlier-based outlier detection method capable of both identifying the outliers and explaining why they are outliers, by identifying the outlier-specific features. Specifically, we employ an inlier-based outlier detection criterion, which uses the ratio of inlier and test probability densities as a measure of plausibility of being an outlier. For estimating the density ratio function, we propose a localized logistic regression algorithm. Thanks to the locality of the model, variable selection can be outlier-specific, and will help interpret why points are outliers in a high-dimensional space. Through synthetic experiments, we show that the proposed algorithm can successfully detect the important features for outliers. Moreover, we show that the proposed algorithm tends to outperform existing algorithms in benchmark datasets.

Via

Access Paper or Ask Questions

Regression with n$\to$1 by Expert Knowledge Elicitation

Feb 07, 2017
Marta Soare, Muhammad Ammad-ud-din, Samuel Kaski

$Figure 1 for Regression with n$\to$1 by Expert Knowledge Elicitation$

$Figure 2 for Regression with n$\to$1 by Expert Knowledge Elicitation$

$Figure 3 for Regression with n$\to$1 by Expert Knowledge Elicitation$

$Figure 4 for Regression with n$\to$1 by Expert Knowledge Elicitation$

We consider regression under the "extremely small $n$ large $p$" condition, where the number of samples $n$ is so small compared to the dimensionality $p$ that predictors cannot be estimated without prior knowledge. This setup occurs in personalized medicine, for instance, when predicting treatment outcomes for an individual patient based on noisy high-dimensional genomics data. A remaining source of information is expert knowledge, which has received relatively little attention in recent years. We formulate the inference problem of asking expert feedback on features on a budget, propose an elicitation strategy for a simple "small $n$" setting, and derive conditions under which the elicitation strategy is optimal. Experiments on simulated experts, both on synthetic and genomics data, demonstrate that the proposed strategy can drastically improve prediction accuracy.

* In Proceedings of the 15th IEEE International Conference on Machine Learning and Applications (IEEE ICMLA'16)

Via

Access Paper or Ask Questions

Interactive Elicitation of Knowledge on Feature Relevance Improves Predictions in Small Data Sets

Jan 16, 2017
Luana Micallef, Iiris Sundin, Pekka Marttinen, Muhammad Ammad-ud-din, Tomi Peltola, Marta Soare, Giulio Jacucci, Samuel Kaski

Figure 1 for Interactive Elicitation of Knowledge on Feature Relevance Improves Predictions in Small Data Sets

Figure 2 for Interactive Elicitation of Knowledge on Feature Relevance Improves Predictions in Small Data Sets

Figure 3 for Interactive Elicitation of Knowledge on Feature Relevance Improves Predictions in Small Data Sets

Figure 4 for Interactive Elicitation of Knowledge on Feature Relevance Improves Predictions in Small Data Sets

Providing accurate predictions is challenging for machine learning algorithms when the number of features is larger than the number of samples in the data. Prior knowledge can improve machine learning models by indicating relevant variables and parameter values. Yet, this prior knowledge is often tacit and only available from domain experts. We present a novel approach that uses interactive visualization to elicit the tacit prior knowledge and uses it to improve the accuracy of prediction models. The main component of our approach is a user model that models the domain expert's knowledge of the relevance of different features for a prediction task. In particular, based on the expert's earlier input, the user model guides the selection of the features on which to elicit user's knowledge next. The results of a controlled user study show that the user model significantly improves prior knowledge elicitation and prediction accuracy, when predicting the relative citation counts of scientific documents in a specific domain.

* in Proceedings of the 22nd International Conference on Intelligent User Interfaces (IUI 2017)

Via

Access Paper or Ask Questions

Inferring Cognitive Models from Data using Approximate Bayesian Computation

Jan 13, 2017
Antti Kangasrääsiö, Kumaripaba Athukorala, Andrew Howes, Jukka Corander, Samuel Kaski, Antti Oulasvirta

Figure 1 for Inferring Cognitive Models from Data using Approximate Bayesian Computation

Figure 2 for Inferring Cognitive Models from Data using Approximate Bayesian Computation

Figure 3 for Inferring Cognitive Models from Data using Approximate Bayesian Computation

Figure 4 for Inferring Cognitive Models from Data using Approximate Bayesian Computation

An important problem for HCI researchers is to estimate the parameter values of a cognitive model from behavioral data. This is a difficult problem, because of the substantial complexity and variety in human behavioral strategies. We report an investigation into a new approach using approximate Bayesian computation (ABC) to condition model parameters to data and prior knowledge. As the case study we examine menu interaction, where we have click time data only to infer a cognitive model that implements a search behaviour with parameters such as fixation duration and recall probability. Our results demonstrate that ABC (i) improves estimates of model parameter values, (ii) enables meaningful comparisons between model variants, and (iii) supports fitting models to individual users. ABC provides ample opportunities for theoretical HCI research by allowing principled inference of model parameter values and their uncertainty.

* To appear in CHI'2017

Via

Access Paper or Ask Questions

Localized Lasso for High-Dimensional Regression

Oct 14, 2016
Makoto Yamada, Koh Takeuchi, Tomoharu Iwata, John Shawe-Taylor, Samuel Kaski

Figure 1 for Localized Lasso for High-Dimensional Regression

Figure 2 for Localized Lasso for High-Dimensional Regression

Figure 3 for Localized Lasso for High-Dimensional Regression

Figure 4 for Localized Lasso for High-Dimensional Regression

We introduce the localized Lasso, which is suited for learning models that are both interpretable and have a high predictive power in problems with high dimensionality $d$ and small sample size $n$. More specifically, we consider a function defined by local sparse models, one at each data point. We introduce sample-wise network regularization to borrow strength across the models, and sample-wise exclusive group sparsity (a.k.a., $\ell_{1,2}$ norm) to introduce diversity into the choice of feature sets in the local models. The local models are interpretable in terms of similarity of their sparsity patterns. The cost function is convex, and thus has a globally optimal solution. Moreover, we propose a simple yet efficient iterative least-squares based optimization procedure for the localized Lasso, which does not need a tuning parameter, and is guaranteed to converge to a globally optimal solution. The solution is empirically shown to outperform alternatives for both simulated and genomic personalized medicine data.

Via

Access Paper or Ask Questions

Bayesian multi-tensor factorization

Oct 12, 2016
Suleiman A. Khan, Eemeli Leppäaho, Samuel Kaski

Figure 1 for Bayesian multi-tensor factorization

Figure 2 for Bayesian multi-tensor factorization

Figure 3 for Bayesian multi-tensor factorization

Figure 4 for Bayesian multi-tensor factorization

We introduce Bayesian multi-tensor factorization, a model that is the first Bayesian formulation for joint factorization of multiple matrices and tensors. The research problem generalizes the joint matrix-tensor factorization problem to arbitrary sets of tensors of any depth, including matrices, can be interpreted as unsupervised multi-view learning from multiple data tensors, and can be generalized to relax the usual trilinear tensor factorization assumptions. The result is a factorization of the set of tensors into factors shared by any subsets of the tensors, and factors private to individual tensors. We demonstrate the performance against existing baselines in multiple tensor factorization tasks in structural toxicogenomics and functional neuroimaging.

* Machine Learning, 105(2), 233-253, 2016
* R Implementation / source code: http://research.cs.aalto.fi/pml/software/mtf/

Via

Access Paper or Ask Questions