Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yi Mao

Domain Knowledge Uncertainty and Probabilistic Parameter Constraints

May 09, 2012

Yi Mao, Guy Lebanon

Figure 1 for Domain Knowledge Uncertainty and Probabilistic Parameter Constraints

Figure 2 for Domain Knowledge Uncertainty and Probabilistic Parameter Constraints

Figure 3 for Domain Knowledge Uncertainty and Probabilistic Parameter Constraints

Figure 4 for Domain Knowledge Uncertainty and Probabilistic Parameter Constraints

Abstract:Incorporating domain knowledge into the modeling process is an effective way to improve learning accuracy. However, as it is provided by humans, domain knowledge can only be specified with some degree of uncertainty. We propose to explicitly model such uncertainty through probabilistic constraints over the parameter space. In contrast to hard parameter constraints, our approach is effective also when the domain knowledge is inaccurate and generally results in superior modeling accuracy. We focus on generative and conditional modeling where the parameters are assigned a Dirichlet or Gaussian prior and demonstrate the framework with experiments on both synthetic and real-world data.

* Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)

Via

Access Paper or Ask Questions

Linguistic Geometries for Unsupervised Dimensionality Reduction

Mar 02, 2010

Yi Mao, Krishnakumar Balasubramanian, Guy Lebanon

Figure 1 for Linguistic Geometries for Unsupervised Dimensionality Reduction

Figure 2 for Linguistic Geometries for Unsupervised Dimensionality Reduction

Figure 3 for Linguistic Geometries for Unsupervised Dimensionality Reduction

Figure 4 for Linguistic Geometries for Unsupervised Dimensionality Reduction

Abstract:Text documents are complex high dimensional objects. To effectively visualize such data it is important to reduce its dimensionality and visualize the low dimensional embedding as a 2-D or 3-D scatter plot. In this paper we explore dimensionality reduction methods that draw upon domain knowledge in order to achieve a better low dimensional embedding and visualization of documents. We consider the use of geometries specified manually by an expert, geometries derived automatically from corpus statistics, and geometries computed from linguistic resources.

* 13 pages, 15 figures

Via

Access Paper or Ask Questions