Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ricardo Silva

Causal discovery under a confounder blanket

May 11, 2022

David Watson, Ricardo Silva

Figure 1 for Causal discovery under a confounder blanket

Figure 2 for Causal discovery under a confounder blanket

Figure 3 for Causal discovery under a confounder blanket

Figure 4 for Causal discovery under a confounder blanket

Abstract:Inferring causal relationships from observational data is rarely straightforward, but the problem is especially difficult in high dimensions. For these applications, causal discovery algorithms typically require parametric restrictions or extreme sparsity constraints. We relax these assumptions and focus on an important but more specialized problem, namely recovering a directed acyclic subgraph of variables known to be causally descended from some (possibly large) set of confounding covariates, i.e. a $\textit{confounder blanket}$. This is useful in many settings, for example when studying a dynamic biomolecular subsystem with genetic data providing causally relevant background information. Under a structural assumption that, we argue, must be satisfied in practice if informative answers are to be found, our method accommodates graphs of low or high sparsity while maintaining polynomial time complexity. We derive a sound and complete algorithm for identifying causal relationships under these conditions and implement testing procedures with provable error control for linear and nonlinear systems. We demonstrate our approach on a range of simulation settings.

Via

Access Paper or Ask Questions

The Causal Marginal Polytope for Bounding Treatment Effects

Feb 28, 2022

Jakob Zeitler, Ricardo Silva

Figure 1 for The Causal Marginal Polytope for Bounding Treatment Effects

Figure 2 for The Causal Marginal Polytope for Bounding Treatment Effects

Figure 3 for The Causal Marginal Polytope for Bounding Treatment Effects

Figure 4 for The Causal Marginal Polytope for Bounding Treatment Effects

Abstract:Due to unmeasured confounding, it is often not possible to identify causal effects from a postulated model. Nevertheless, we can ask for partial identification, which usually boils down to finding upper and lower bounds of a causal quantity of interest derived from all solutions compatible with the encoded structural assumptions. One appealing way to derive such bounds is by casting it in terms of a constrained optimization method that searches over all causal models compatible with evidence, as introduced in the classic work of Balke and Pearl (1994) for discrete data. Although by construction this guarantees tight bounds, it poses a formidable computational challenge. To cope with this issue, alternatives include algorithms that are not guaranteed to be tight, or by introducing restrictions on the class of models. In this paper, we introduce a novel alternative: inspired by ideas coming from belief propagation, we enforce compatibility between marginals of a causal model and data, without constructing a global causal model. We call this collection of locally consistent marginals the causal marginal polytope. As global independence constraints disappear when considering small dimensional tractable marginals, this also leads to a rethinking of how to elicit and express causal knowledge. We provide an explicit algorithm and implementation of this idea, and assess its practicality with numerical experiments.

Via

Access Paper or Ask Questions

Stochastic Causal Programming for Bounding Treatment Effects

Feb 22, 2022

Kirtan Padh, Jakob Zeitler, David Watson, Matt Kusner, Ricardo Silva, Niki Kilbertus

Figure 1 for Stochastic Causal Programming for Bounding Treatment Effects

Figure 2 for Stochastic Causal Programming for Bounding Treatment Effects

Figure 3 for Stochastic Causal Programming for Bounding Treatment Effects

Figure 4 for Stochastic Causal Programming for Bounding Treatment Effects

Abstract:Causal effect estimation is important for numerous tasks in the natural and social sciences. However, identifying effects is impossible from observational data without making strong, often untestable assumptions. We consider algorithms for the partial identification problem, bounding treatment effects from multivariate, continuous treatments over multiple possible causal models when unmeasured confounding makes identification impossible. We consider a framework where observable evidence is matched to the implications of constraints encoded in a causal model by norm-based criteria. This generalizes classical approaches based purely on generative models. Casting causal effects as objective functions in a constrained optimization problem, we combine flexible learning algorithms with Monte Carlo methods to implement a family of solutions under the name of stochastic causal programming. In particular, we present ways by which such constrained optimization problems can be parameterized without likelihood functions for the causal or the observed data model, reducing the computational and statistical complexity of the task.

Via

Access Paper or Ask Questions

Questions for Flat-Minima Optimization of Modern Neural Networks

Feb 02, 2022

Jean Kaddour, Linqing Liu, Ricardo Silva, Matt J. Kusner

Figure 1 for Questions for Flat-Minima Optimization of Modern Neural Networks

Figure 2 for Questions for Flat-Minima Optimization of Modern Neural Networks

Figure 3 for Questions for Flat-Minima Optimization of Modern Neural Networks

Figure 4 for Questions for Flat-Minima Optimization of Modern Neural Networks

Abstract:For training neural networks, flat-minima optimizers that seek to find parameters in neighborhoods having uniformly low loss (flat minima) have been shown to improve upon stochastic and adaptive gradient-based methods. Two methods for finding flat minima stand out: 1. Averaging methods (i.e., Stochastic Weight Averaging, SWA), and 2. Minimax methods (i.e., Sharpness Aware Minimization, SAM). However, despite similar motivations, there has been limited investigation into their properties and no comprehensive comparison between them. In this work, we investigate the loss surfaces from a systematic benchmarking of these approaches across computer vision, natural language processing, and graph learning tasks. The results lead to a simple hypothesis: since both approaches find different flat solutions, combining them should improve generalization even further. We verify this improves over either flat-minima approach in 39 out of 42 cases. When it does not, we investigate potential reasons. We hope our results across image, graph, and text data will help researchers to improve deep learning optimizers, and practitioners to pinpoint the optimizer for the problem at hand.

Via

Access Paper or Ask Questions

Graph Intervention Networks for Causal Effect Estimation

Jun 18, 2021

Jean Kaddour, Qi Liu, Yuchen Zhu, Matt J. Kusner, Ricardo Silva

Figure 1 for Graph Intervention Networks for Causal Effect Estimation

Figure 2 for Graph Intervention Networks for Causal Effect Estimation

Figure 3 for Graph Intervention Networks for Causal Effect Estimation

Figure 4 for Graph Intervention Networks for Causal Effect Estimation

Abstract:We address the estimation of conditional average treatment effects (CATEs) when treatments are graph-structured (e.g., molecular graphs of drugs). Given a weak condition on the effect, we propose a plug-in estimator that decomposes CATE estimation into separate, simpler optimization problems. Our estimator (a) isolates the causal estimands (reducing regularization bias), and (b) allows one to plug in arbitrary models for learning. In experiments with small-world and molecular graphs, we show that our approach outperforms prior approaches and is robust to varying selection biases. Our implementation is online.

Via

Access Paper or Ask Questions

Operationalizing Complex Causes: A Pragmatic View of Mediation

Jun 10, 2021

Limor Gultchin, David S. Watson, Matt J. Kusner, Ricardo Silva

Figure 1 for Operationalizing Complex Causes: A Pragmatic View of Mediation

Figure 2 for Operationalizing Complex Causes: A Pragmatic View of Mediation

Figure 3 for Operationalizing Complex Causes: A Pragmatic View of Mediation

Figure 4 for Operationalizing Complex Causes: A Pragmatic View of Mediation

Abstract:We examine the problem of causal response estimation for complex objects (e.g., text, images, genomics). In this setting, classical \emph{atomic} interventions are often not available (e.g., changes to characters, pixels, DNA base-pairs). Instead, we only have access to indirect or \emph{crude} interventions (e.g., enrolling in a writing program, modifying a scene, applying a gene therapy). In this work, we formalize this problem and provide an initial solution. Given a collection of candidate mediators, we propose (a) a two-step method for predicting the causal responses of crude interventions; and (b) a testing procedure to identify mediators of crude interventions. We demonstrate, on a range of simulated and real-world-inspired examples, that our approach allows us to efficiently estimate the effect of crude interventions with limited data from new treatment regimes.

* International Conference on Machine Learning 2021

Via

Access Paper or Ask Questions

Proximal Causal Learning with Kernels: Two-Stage Estimation and Moment Restriction

Jun 06, 2021

Afsaneh Mastouri, Yuchen Zhu, Limor Gultchin, Anna Korba, Ricardo Silva, Matt J. Kusner, Arthur Gretton, Krikamol Muandet

Figure 1 for Proximal Causal Learning with Kernels: Two-Stage Estimation and Moment Restriction

Figure 2 for Proximal Causal Learning with Kernels: Two-Stage Estimation and Moment Restriction

Figure 3 for Proximal Causal Learning with Kernels: Two-Stage Estimation and Moment Restriction

Figure 4 for Proximal Causal Learning with Kernels: Two-Stage Estimation and Moment Restriction

Abstract:We address the problem of causal effect estimation in the presence of unobserved confounding, but where proxies for the latent confounder(s) are observed. We propose two kernel-based methods for nonlinear causal effect estimation in this setting: (a) a two-stage regression approach, and (b) a maximum moment restriction approach. We focus on the proximal causal learning setting, but our methods can be used to solve a wider class of inverse problems characterised by a Fredholm integral equation. In particular, we provide a unifying view of two-stage and moment restriction approaches for solving this problem in a nonlinear setting. We provide consistency guarantees for each algorithm, and we demonstrate these approaches achieve competitive results on synthetic data and data simulating a real-world task. In particular, our approach outperforms earlier methods that are not suited to leveraging proxy variables.

Via

Access Paper or Ask Questions

Learning Joint Nonlinear Effects from Single-variable Interventions in the Presence of Hidden Confounders

Jun 16, 2020

Sorawit Saengkyongam, Ricardo Silva

Figure 1 for Learning Joint Nonlinear Effects from Single-variable Interventions in the Presence of Hidden Confounders

Figure 2 for Learning Joint Nonlinear Effects from Single-variable Interventions in the Presence of Hidden Confounders

Figure 3 for Learning Joint Nonlinear Effects from Single-variable Interventions in the Presence of Hidden Confounders

Figure 4 for Learning Joint Nonlinear Effects from Single-variable Interventions in the Presence of Hidden Confounders

Abstract:We propose an approach to estimate the effect of multiple simultaneous interventions in the presence of hidden confounders. To overcome the problem of hidden confounding, we consider the setting where we have access to not only the observational data but also sets of single-variable interventions in which each of the treatment variables is intervened on separately. We prove identifiability under the assumption that the data is generated from a nonlinear continuous structural causal model with additive Gaussian noise. In addition, we propose a simple parameter estimation method by pooling all the data from different regimes and jointly maximizing the combined likelihood. We also conduct comprehensive experiments to verify the identifiability result as well as to compare the performance of our approach against a baseline on both synthetic and real-world data.

* Accepted to The Conference on Uncertainty in Artificial Intelligence (UAI) 2020

Via

Access Paper or Ask Questions

A Class of Algorithms for General Instrumental Variable Models

Jun 11, 2020

Niki Kilbertus, Matt J. Kusner, Ricardo Silva

Figure 1 for A Class of Algorithms for General Instrumental Variable Models

Figure 2 for A Class of Algorithms for General Instrumental Variable Models

Figure 3 for A Class of Algorithms for General Instrumental Variable Models

Figure 4 for A Class of Algorithms for General Instrumental Variable Models

Abstract:Causal treatment effect estimation is a key problem that arises in a variety of real-world settings, from personalized medicine to governmental policy making. There has been a flurry of recent work in machine learning on estimating causal effects when one has access to an instrument. However, to achieve identifiability, they in general require one-size-fits-all assumptions such as an additive error model for the outcome. An alternative is partial identification, which provides bounds on the causal effect. Little exists in terms of bounding methods that can deal with the most general case, where the treatment itself can be continuous. Moreover, bounding methods generally do not allow for a continuum of assumptions on the shape of the causal effect that can smoothly trade off stronger background knowledge for more informative bounds. In this work, we provide a method for causal effect bounding in continuous distributions, leveraging recent advances in gradient-based methods for the optimization of computationally intractable objective functions. We demonstrate on a set of synthetic and real-world data that our bounds capture the causal effect when additive methods fail, providing a useful range of answers compatible with observation as opposed to relying on unwarranted structural assumptions.

* Code at https://github.com/nikikilbertus/general-iv-models

Via

Access Paper or Ask Questions

Differentiable Causal Backdoor Discovery

Mar 03, 2020

Limor Gultchin, Matt J. Kusner, Varun Kanade, Ricardo Silva

Figure 1 for Differentiable Causal Backdoor Discovery

Figure 2 for Differentiable Causal Backdoor Discovery

Figure 3 for Differentiable Causal Backdoor Discovery

Figure 4 for Differentiable Causal Backdoor Discovery

Abstract:Discovering the causal effect of a decision is critical to nearly all forms of decision-making. In particular, it is a key quantity in drug development, in crafting government policy, and when implementing a real-world machine learning system. Given only observational data, confounders often obscure the true causal effect. Luckily, in some cases, it is possible to recover the causal effect by using certain observed variables to adjust for the effects of confounders. However, without access to the true causal model, finding this adjustment requires brute-force search. In this work, we present an algorithm that exploits auxiliary variables, similar to instruments, in order to find an appropriate adjustment by a gradient-based optimization method. We demonstrate that it outperforms practical alternatives in estimating the true causal effect, without knowledge of the full causal graph.

* Published in the Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS) 2020, Palermo, Italy

Via

Access Paper or Ask Questions