Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vasant Honavar

Top-N-Rank: A Scalable List-wise Ranking Method for Recommender Systems

Dec 19, 2018
Junjie Liang, Jinlong Hu, Shoubin Dong, Vasant Honavar

Figure 1 for Top-N-Rank: A Scalable List-wise Ranking Method for Recommender Systems

Figure 2 for Top-N-Rank: A Scalable List-wise Ranking Method for Recommender Systems

We propose Top-N-Rank, a novel family of list-wise Learning-to-Rank models for reliably recommending the N top-ranked items. The proposed models optimize a variant of the widely used discounted cumulative gain (DCG) objective function which differs from DCG in two important aspects: (i) It limits the evaluation of DCG only on the top N items in the ranked lists, thereby eliminating the impact of low-ranked items on the learned ranking function; and (ii) it incorporates weights that allow the model to leverage multiple types of implicit feedback with differing levels of reliability or trustworthiness. Because the resulting objective function is non-smooth and hence challenging to optimize, we consider two smooth approximations of the objective function, using the traditional sigmoid function and the rectified linear unit (ReLU). We propose a family of learning-to-rank algorithms (Top-N-Rank) that work with any smooth objective function. Then, a more efficient variant, Top-N-Rank.ReLU, is introduced, which effectively exploits the properties of ReLU function to reduce the computational complexity of Top-N-Rank from quadratic to linear in the average number of items rated by users. The results of our experiments using two widely used benchmarks, namely, the MovieLens data set and the Amazon Video Games data set demonstrate that: (i) The `top-N truncation' of the objective function substantially improves the ranking quality of the top N recommendations; (ii) using the ReLU for smoothing the objective function yields significant improvement in both ranking quality as well as runtime as compared to using the sigmoid; and (iii) Top-N-Rank.ReLU substantially outperforms the well-performing list-wise ranking methods in terms of ranking quality.

* paper accepted by the 2018 IEEE International Conference on Big Data

Via

Access Paper or Ask Questions

Multi-View Network Embedding Via Graph Factorization Clustering and Co-Regularized Multi-View Agreement

Nov 08, 2018
Yiwei Sun, Ngot Bui, Tsung-Yu Hsieh, Vasant Honavar

Figure 1 for Multi-View Network Embedding Via Graph Factorization Clustering and Co-Regularized Multi-View Agreement

Figure 2 for Multi-View Network Embedding Via Graph Factorization Clustering and Co-Regularized Multi-View Agreement

Figure 3 for Multi-View Network Embedding Via Graph Factorization Clustering and Co-Regularized Multi-View Agreement

Figure 4 for Multi-View Network Embedding Via Graph Factorization Clustering and Co-Regularized Multi-View Agreement

Real-world social networks and digital platforms are comprised of individuals (nodes) that are linked to other individuals or entities through multiple types of relationships (links). Sub-networks of such a network based on each type of link correspond to distinct views of the underlying network. In real-world applications, each node is typically linked to only a small subset of other nodes. Hence, practical approaches to problems such as node labeling have to cope with the resulting sparse networks. While low-dimensional network embeddings offer a promising approach to this problem, most of the current network embedding methods focus primarily on single view networks. We introduce a novel multi-view network embedding (MVNE) algorithm for constructing low-dimensional node embeddings from multi-view networks. MVNE adapts and extends an approach to single view network embedding (SVNE) using graph factorization clustering (GFC) to the multi-view setting using an objective function that maximizes the agreement between views based on both the local and global structure of the underlying multi-view graph. Our experiments with several benchmark real-world single view networks show that GFC-based SVNE yields network embeddings that are competitive with or superior to those produced by the state-of-the-art single view network embedding methods when the embeddings are used for labeling unlabeled nodes in the networks. Our experiments with several multi-view networks show that MVNE substantially outperforms the single view methods on integrated view and the state-of-the-art multi-view methods. We further show that even when the goal is to predict labels of nodes within a single target view, MVNE outperforms its single-view counterpart suggesting that the MVNE is able to extract the information that is useful for labeling nodes in the target view from the all of the views.

* ICDMW2018 -- IEEE International Conference on Data Mining workshop on Graph Analytics

Via

Access Paper or Ask Questions

Compositional Stochastic Average Gradient for Machine Learning and Related Applications

Sep 07, 2018
Tsung-Yu Hsieh, Yasser EL-Manzalawy, Yiwei Sun, Vasant Honavar

Figure 1 for Compositional Stochastic Average Gradient for Machine Learning and Related Applications

Figure 2 for Compositional Stochastic Average Gradient for Machine Learning and Related Applications

Figure 3 for Compositional Stochastic Average Gradient for Machine Learning and Related Applications

Many machine learning, statistical inference, and portfolio optimization problems require minimization of a composition of expected value functions (CEVF). Of particular interest is the finite-sum versions of such compositional optimization problems (FS-CEVF). Compositional stochastic variance reduced gradient (C-SVRG) methods that combine stochastic compositional gradient descent (SCGD) and stochastic variance reduced gradient descent (SVRG) methods are the state-of-the-art methods for FS-CEVF problems. We introduce compositional stochastic average gradient descent (C-SAG) a novel extension of the stochastic average gradient method (SAG) to minimize composition of finite-sum functions. C-SAG, like SAG, estimates gradient by incorporating memory of previous gradient information. We present theoretical analyses of C-SAG which show that C-SAG, like SAG, and C-SVRG, achieves a linear convergence rate when the objective function is strongly convex; However, C-CAG achieves lower oracle query complexity per iteration than C-SVRG. Finally, we present results of experiments showing that C-SAG converges substantially faster than full gradient (FG), as well as C-SVRG.

Via

Access Paper or Ask Questions

Advances in Artificial Intelligence Require Progress Across all of Computer Science

Jul 13, 2017
Gregory D. Hager, Randal Bryant, Eric Horvitz, Maja Mataric, Vasant Honavar

Advances in Artificial Intelligence require progress across all of computer science.

* 7 pages, Computing Community Consortium White Paper

Via

Access Paper or Ask Questions

Lifted Representation of Relational Causal Models Revisited: Implications for Reasoning and Structure Learning

Aug 17, 2015
Sanghack Lee, Vasant Honavar

Figure 1 for Lifted Representation of Relational Causal Models Revisited: Implications for Reasoning and Structure Learning

Figure 2 for Lifted Representation of Relational Causal Models Revisited: Implications for Reasoning and Structure Learning

Figure 3 for Lifted Representation of Relational Causal Models Revisited: Implications for Reasoning and Structure Learning

Figure 4 for Lifted Representation of Relational Causal Models Revisited: Implications for Reasoning and Structure Learning

Maier et al. (2010) introduced the relational causal model (RCM) for representing and inferring causal relationships in relational data. A lifted representation, called abstract ground graph (AGG), plays a central role in reasoning with and learning of RCM. The correctness of the algorithm proposed by Maier et al. (2013a) for learning RCM from data relies on the soundness and completeness of AGG for relational d-separation to reduce the learning of an RCM to learning of an AGG. We revisit the definition of AGG and show that AGG, as defined in Maier et al. (2013b), does not correctly abstract all ground graphs. We revise the definition of AGG to ensure that it correctly abstracts all ground graphs. We further show that AGG representation is not complete for relational d-separation, that is, there can exist conditional independence relations in an RCM that are not entailed by AGG. A careful examination of the relationship between the lack of completeness of AGG for relational d-separation and faithfulness conditions suggests that weaker notions of completeness, namely adjacency faithfulness and orientation faithfulness between an RCM and its AGG, can be used to learn an RCM from data.

* Workshop on Advances in Causal Inference, Conference on Uncertainty in Artificial Intelligence, 2015

Via

Access Paper or Ask Questions

CRISNER: A Practically Efficient Reasoner for Qualitative Preferences

Jul 30, 2015
Ganesh Ram Santhanam, Samik Basu, Vasant Honavar

Figure 1 for CRISNER: A Practically Efficient Reasoner for Qualitative Preferences

Figure 2 for CRISNER: A Practically Efficient Reasoner for Qualitative Preferences

Figure 3 for CRISNER: A Practically Efficient Reasoner for Qualitative Preferences

Figure 4 for CRISNER: A Practically Efficient Reasoner for Qualitative Preferences

We present CRISNER (Conditional & Relative Importance Statement Network PrEference Reasoner), a tool that provides practically efficient as well as exact reasoning about qualitative preferences in popular ceteris paribus preference languages such as CP-nets, TCP-nets, CP-theories, etc. The tool uses a model checking engine to translate preference specifications and queries into appropriate Kripke models and verifiable properties over them respectively. The distinguishing features of the tool are: (1) exact and provably correct query answering for testing dominance, consistency with respect to a preference specification, and testing equivalence and subsumption of two sets of preferences; (2) automatic generation of proofs evidencing the correctness of answer produced by CRISNER to any of the above queries; (3) XML inputs and outputs that make it portable and pluggable into other applications. We also describe the extensible architecture of CRISNER, which can be extended to new reference formalisms based on ceteris paribus semantics that may be developed in the future.

* 15 pages

Via

Access Paper or Ask Questions

Representing and Reasoning with Qualitative Preferences for Compositional Systems

Jan 16, 2014
Ganesh Ram Santhanam, Samik Basu, Vasant Honavar

Figure 1 for Representing and Reasoning with Qualitative Preferences for Compositional Systems

Figure 2 for Representing and Reasoning with Qualitative Preferences for Compositional Systems

Figure 3 for Representing and Reasoning with Qualitative Preferences for Compositional Systems

Figure 4 for Representing and Reasoning with Qualitative Preferences for Compositional Systems

Many applications, e.g., Web service composition, complex system design, team formation, etc., rely on methods for identifying collections of objects or entities satisfying some functional requirement. Among the collections that satisfy the functional requirement, it is often necessary to identify one or more collections that are optimal with respect to user preferences over a set of attributes that describe the non-functional properties of the collection. We develop a formalism that lets users express the relative importance among attributes and qualitative preferences over the valuations of each attribute. We define a dominance relation that allows us to compare collections of objects in terms of preferences over attributes of the objects that make up the collection. We establish some key properties of the dominance relation. In particular, we show that the dominance relation is a strict partial order when the intra-attribute preference relations are strict partial orders and the relative importance preference relation is an interval order. We provide algorithms that use this dominance relation to identify the set of most preferred collections. We show that under certain conditions, the algorithms are guaranteed to return only (sound), all (complete), or at least one (weakly complete) of the most preferred collections. We present results of simulation experiments comparing the proposed algorithms with respect to (a) the quality of solutions (number of most preferred solutions) produced by the algorithms, and (b) their performance and efficiency. We also explore some interesting conjectures suggested by the results of our experiments that relate the properties of the user preferences, the dominance relation, and the algorithms.

* Journal Of Artificial Intelligence Research, Volume 42, pages 211-274, 2011

Via

Access Paper or Ask Questions

Efficient Markov Network Structure Discovery Using Independence Tests

Jan 15, 2014
Facundo Bromberg, Dimitris Margaritis, Vasant Honavar

Figure 1 for Efficient Markov Network Structure Discovery Using Independence Tests

Figure 2 for Efficient Markov Network Structure Discovery Using Independence Tests

Figure 3 for Efficient Markov Network Structure Discovery Using Independence Tests

Figure 4 for Efficient Markov Network Structure Discovery Using Independence Tests

We present two algorithms for learning the structure of a Markov network from data: GSMN* and GSIMN. Both algorithms use statistical independence tests to infer the structure by successively constraining the set of structures consistent with the results of these tests. Until very recently, algorithms for structure learning were based on maximum likelihood estimation, which has been proved to be NP-hard for Markov networks due to the difficulty of estimating the parameters of the network, needed for the computation of the data likelihood. The independence-based approach does not require the computation of the likelihood, and thus both GSMN* and GSIMN can compute the structure efficiently (as shown in our experiments). GSMN* is an adaptation of the Grow-Shrink algorithm of Margaritis and Thrun for learning the structure of Bayesian networks. GSIMN extends GSMN* by additionally exploiting Pearls well-known properties of the conditional independence relation to infer novel independences from known ones, thus avoiding the performance of statistical tests to estimate them. To accomplish this efficiently GSIMN uses the Triangle theorem, also introduced in this work, which is a simplified version of the set of Markov axioms. Experimental comparisons on artificial and real-world data sets show GSIMN can yield significant savings with respect to GSMN*, while generating a Markov network with comparable or in some cases improved quality. We also compare GSIMN to a forward-chaining implementation, called GSIMN-FCH, that produces all possible conditional independences resulting from repeatedly applying Pearls theorems on the known conditional independence tests. The results of this comparison show that GSIMN, by the sole use of the Triangle theorem, is nearly optimal in terms of the set of independences tests that it infers.

* Journal Of Artificial Intelligence Research, Volume 35, pages 449-484, 2009

Via

Access Paper or Ask Questions

Causal Transportability of Experiments on Controllable Subsets of Variables: z-Transportability

Sep 26, 2013
Sanghack Lee, Vasant Honavar

Figure 1 for Causal Transportability of Experiments on Controllable Subsets of Variables: z-Transportability

Figure 2 for Causal Transportability of Experiments on Controllable Subsets of Variables: z-Transportability

Figure 3 for Causal Transportability of Experiments on Controllable Subsets of Variables: z-Transportability

We introduce z-transportability, the problem of estimating the causal effect of a set of variables X on another set of variables Y in a target domain from experiments on any subset of controllable variables Z where Z is an arbitrary subset of observable variables V in a source domain. z-Transportability generalizes z-identifiability, the problem of estimating in a given domain the causal effect of X on Y from surrogate experiments on a set of variables Z such that Z is disjoint from X;. z-Transportability also generalizes transportability which requires that the causal effect of X on Y in the target domain be estimable from experiments on any subset of all observable variables in the source domain. We first generalize z-identifiability to allow cases where Z is not necessarily disjoint from X. Then, we establish a necessary and sufficient condition for z-transportability in terms of generalized z-identifiability and transportability. We provide a correct and complete algorithm that determines whether a causal effect is z-transportable; and if it is, produces a transport formula, that is, a recipe for estimating the causal effect of X on Y in the target domain using information elicited from the results of experimental manipulations of Z in the source domain and observational data from the target domain. Our results also show that do-calculus is complete for z-transportability.

* Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

Via

Access Paper or Ask Questions