Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shohei Shimizu

Combining Linear Non-Gaussian Acyclic Model with Logistic Regression Model for Estimating Causal Structure from Mixed Continuous and Discrete Data

Feb 16, 2018
Chao Li, Shohei Shimizu

Figure 1 for Combining Linear Non-Gaussian Acyclic Model with Logistic Regression Model for Estimating Causal Structure from Mixed Continuous and Discrete Data

Figure 2 for Combining Linear Non-Gaussian Acyclic Model with Logistic Regression Model for Estimating Causal Structure from Mixed Continuous and Discrete Data

Figure 3 for Combining Linear Non-Gaussian Acyclic Model with Logistic Regression Model for Estimating Causal Structure from Mixed Continuous and Discrete Data

Estimating causal models from observational data is a crucial task in data analysis. For continuous-valued data, Shimizu et al. have proposed a linear acyclic non-Gaussian model to understand the data generating process, and have shown that their model is identifiable when the number of data is sufficiently large. However, situations in which continuous and discrete variables coexist in the same problem are common in practice. Most existing causal discovery methods either ignore the discrete data and apply a continuous-valued algorithm or discretize all the continuous data and then apply a discrete Bayesian network approach. These methods possibly loss important information when we ignore discrete data or introduce the approximation error due to discretization. In this paper, we define a novel hybrid causal model which consists of both continuous and discrete variables. The model assumes: (1) the value of a continuous variable is a linear function of its parent variables plus a non-Gaussian noise, and (2) each discrete variable is a logistic variable whose distribution parameters depend on the values of its parent variables. In addition, we derive the BIC scoring function for model selection. The new discovery algorithm can learn causal structures from mixed continuous and discrete data without discretization. We empirically demonstrate the power of our method through thorough simulations.

Via

Access Paper or Ask Questions

Estimation of interventional effects of features on prediction

Sep 03, 2017
Patrick Blöbaum, Shohei Shimizu

Figure 1 for Estimation of interventional effects of features on prediction

Figure 2 for Estimation of interventional effects of features on prediction

The interpretability of prediction mechanisms with respect to the underlying prediction problem is often unclear. While several studies have focused on developing prediction models with meaningful parameters, the causal relationships between the predictors and the actual prediction have not been considered. Here, we connect the underlying causal structure of a data generation process and the causal structure of a prediction mechanism. To achieve this, we propose a framework that identifies the feature with the greatest causal influence on the prediction and estimates the necessary causal intervention of a feature such that a desired prediction is obtained. The general concept of the framework has no restrictions regarding data linearity; however, we focus on an implementation for linear data here. The framework applicability is evaluated using artificial data and demonstrated using real-world data.

* To appear in Proc. IEEE International Workshop on Machine Learning for Signal Processing (MLSP2017)

Via

Access Paper or Ask Questions

Error Asymmetry in Causal and Anticausal Regression

Apr 17, 2017
Patrick Blöbaum, Takashi Washio, Shohei Shimizu

Figure 1 for Error Asymmetry in Causal and Anticausal Regression

Figure 2 for Error Asymmetry in Causal and Anticausal Regression

Figure 3 for Error Asymmetry in Causal and Anticausal Regression

Figure 4 for Error Asymmetry in Causal and Anticausal Regression

It is generally difficult to make any statements about the expected prediction error in an univariate setting without further knowledge about how the data were generated. Recent work showed that knowledge about the real underlying causal structure of a data generation process has implications for various machine learning settings. Assuming an additive noise and an independence between data generating mechanism and its input, we draw a novel connection between the intrinsic causal relationship of two variables and the expected prediction error. We formulate the theorem that the expected error of the true data generating function as prediction model is generally smaller when the effect is predicted from its cause and, on the contrary, greater when the cause is predicted from its effect. The theorem implies an asymmetry in the error depending on the prediction direction. This is further corroborated with empirical evaluations in artificial and real-world data sets.

* Behaviormetrika, 2017, 10.1007/s41237-017-0022-z

Via

Access Paper or Ask Questions

Learning Instrumental Variables with Non-Gaussianity Assumptions: Theoretical Limitations and Practical Algorithms

Nov 09, 2015
Ricardo Silva, Shohei Shimizu

Figure 1 for Learning Instrumental Variables with Non-Gaussianity Assumptions: Theoretical Limitations and Practical Algorithms

Figure 2 for Learning Instrumental Variables with Non-Gaussianity Assumptions: Theoretical Limitations and Practical Algorithms

Figure 3 for Learning Instrumental Variables with Non-Gaussianity Assumptions: Theoretical Limitations and Practical Algorithms

Figure 4 for Learning Instrumental Variables with Non-Gaussianity Assumptions: Theoretical Limitations and Practical Algorithms

Learning a causal effect from observational data is not straightforward, as this is not possible without further assumptions. If hidden common causes between treatment $X$ and outcome $Y$ cannot be blocked by other measurements, one possibility is to use an instrumental variable. In principle, it is possible under some assumptions to discover whether a variable is structurally instrumental to a target causal effect $X \rightarrow Y$, but current frameworks are somewhat lacking on how general these assumptions can be. A instrumental variable discovery problem is challenging, as no variable can be tested as an instrument in isolation but only in groups, but different variables might require different conditions to be considered an instrument. Moreover, identification constraints might be hard to detect statistically. In this paper, we give a theoretical characterization of instrumental variable discovery, highlighting identifiability problems and solutions, the need for non-Gaussianity assumptions, and how they fit within existing methods.

* 12 pages, 4 figures

Via

Access Paper or Ask Questions

A direct method for estimating a causal ordering in a linear non-Gaussian acyclic model

Aug 09, 2014
Shohei Shimizu, Aapo Hyvarinen, Yoshinobu Kawahara

Figure 1 for A direct method for estimating a causal ordering in a linear non-Gaussian acyclic model

Figure 2 for A direct method for estimating a causal ordering in a linear non-Gaussian acyclic model

Figure 3 for A direct method for estimating a causal ordering in a linear non-Gaussian acyclic model

Figure 4 for A direct method for estimating a causal ordering in a linear non-Gaussian acyclic model

Structural equation models and Bayesian networks have been widely used to analyze causal relations between continuous variables. In such frameworks, linear acyclic models are typically used to model the datagenerating process of variables. Recently, it was shown that use of non-Gaussianity identifies a causal ordering of variables in a linear acyclic model without using any prior knowledge on the network structure, which is not the case with conventional methods. However, existing estimation methods are based on iterative search algorithms and may not converge to a correct solution in a finite number of steps. In this paper, we propose a new direct method to estimate a causal ordering based on non-Gaussianity. In contrast to the previous methods, our algorithm requires no algorithmic parameters and is guaranteed to converge to the right solution within a small fixed number of steps if the data strictly follows the model.

* Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)

Via

Access Paper or Ask Questions

A Bayesian estimation approach to analyze non-Gaussian data-generating processes with latent classes

Aug 02, 2014
Naoki Tanaka, Shohei Shimizu, Takashi Washio

Figure 1 for A Bayesian estimation approach to analyze non-Gaussian data-generating processes with latent classes

Figure 2 for A Bayesian estimation approach to analyze non-Gaussian data-generating processes with latent classes

A large amount of observational data has been accumulated in various fields in recent times, and there is a growing need to estimate the generating processes of these data. A linear non-Gaussian acyclic model (LiNGAM) based on the non-Gaussianity of external influences has been proposed to estimate the data-generating processes of variables. However, the results of the estimation can be biased if there are latent classes. In this paper, we first review LiNGAM, its extended model, as well as the estimation procedure for LiNGAM in a Bayesian framework. We then propose a new Bayesian estimation procedure that solves the problem.

* 10 pages, 1 figures

Via

Access Paper or Ask Questions

Bayesian estimation of possible causal direction in the presence of latent confounders using a linear non-Gaussian acyclic structural equation model with individual-specific effects

May 20, 2014
Shohei Shimizu, Kenneth Bollen

Figure 1 for Bayesian estimation of possible causal direction in the presence of latent confounders using a linear non-Gaussian acyclic structural equation model with individual-specific effects

Figure 2 for Bayesian estimation of possible causal direction in the presence of latent confounders using a linear non-Gaussian acyclic structural equation model with individual-specific effects

Figure 3 for Bayesian estimation of possible causal direction in the presence of latent confounders using a linear non-Gaussian acyclic structural equation model with individual-specific effects

Figure 4 for Bayesian estimation of possible causal direction in the presence of latent confounders using a linear non-Gaussian acyclic structural equation model with individual-specific effects

We consider learning the possible causal direction of two observed variables in the presence of latent confounding variables. Several existing methods have been shown to consistently estimate causal direction assuming linear or some type of nonlinear relationship and no latent confounders. However, the estimation results could be distorted if either assumption is actually violated. In this paper, we first propose a new linear non-Gaussian acyclic structural equation model with individual-specific effects that allows latent confounders to be considered. We then propose an empirical Bayesian approach for estimating possible causal direction using the new model. We demonstrate the effectiveness of our method using artificial and real-world data.

* 21 pages, 4 figures. A revised version was accepted at Journal of Machine Learning Research

Via

Access Paper or Ask Questions

Causal Discovery in a Binary Exclusive-or Skew Acyclic Model: BExSAM

Jan 22, 2014
Takanori Inazumi, Takashi Washio, Shohei Shimizu, Joe Suzuki, Akihiro Yamamoto, Yoshinobu Kawahara

Figure 1 for Causal Discovery in a Binary Exclusive-or Skew Acyclic Model: BExSAM

Figure 2 for Causal Discovery in a Binary Exclusive-or Skew Acyclic Model: BExSAM

Figure 3 for Causal Discovery in a Binary Exclusive-or Skew Acyclic Model: BExSAM

Figure 4 for Causal Discovery in a Binary Exclusive-or Skew Acyclic Model: BExSAM

Discovering causal relations among observed variables in a given data set is a major objective in studies of statistics and artificial intelligence. Recently, some techniques to discover a unique causal model have been explored based on non-Gaussianity of the observed data distribution. However, most of these are limited to continuous data. In this paper, we present a novel causal model for binary data and propose an efficient new approach to deriving the unique causal model governing a given binary data set under skew distributions of external binary noises. Experimental evaluation shows excellent performance for both artificial and real world data sets.

* 10 pages. A longer version of our UAI2011 paper (Inazumi et al., 2011). arXiv admin note: text overlap with arXiv:1202.3736

Via

Access Paper or Ask Questions

Identifiability of an Integer Modular Acyclic Additive Noise Model and its Causal Structure Discovery

Jan 22, 2014
Joe Suzuki, Takanori Inazumi, Takashi Washio, Shohei Shimizu

Figure 1 for Identifiability of an Integer Modular Acyclic Additive Noise Model and its Causal Structure Discovery

Figure 2 for Identifiability of an Integer Modular Acyclic Additive Noise Model and its Causal Structure Discovery

Figure 3 for Identifiability of an Integer Modular Acyclic Additive Noise Model and its Causal Structure Discovery

Figure 4 for Identifiability of an Integer Modular Acyclic Additive Noise Model and its Causal Structure Discovery

The notion of causality is used in many situations dealing with uncertainty. We consider the problem whether causality can be identified given data set generated by discrete random variables rather than continuous ones. In particular, for non-binary data, thus far it was only known that causality can be identified except rare cases. In this paper, we present necessary and sufficient condition for an integer modular acyclic additive noise (IMAN) of two variables. In addition, we relate bivariate and multivariate causal identifiability in a more explicit manner, and develop a practical algorithm to find the order of variables and their parent sets. We demonstrate its performance in applications to artificial data and real world body motion data with comparisons to conventional methods.

* 30 pages, 4 figures

Via

Access Paper or Ask Questions

ParceLiNGAM: A causal ordering method robust against latent confounders

Jul 29, 2013
Tatsuya Tashiro, Shohei Shimizu, Aapo Hyvarinen, Takashi Washio

We consider learning a causal ordering of variables in a linear non-Gaussian acyclic model called LiNGAM. Several existing methods have been shown to consistently estimate a causal ordering assuming that all the model assumptions are correct. But, the estimation results could be distorted if some assumptions actually are violated. In this paper, we propose a new algorithm for learning causal orders that is robust against one typical violation of the model assumptions: latent confounders. The key idea is to detect latent confounders by testing independence between estimated external influences and find subsets (parcels) that include variables that are not affected by latent confounders. We demonstrate the effectiveness of our method using artificial data and simulated brain imaging data.

* A revised version of this was accepted in Neural Computation. 18 pages and 5 figures. arXiv admin note: substantial text overlap with arXiv:1204.1795

Via

Access Paper or Ask Questions