Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Finale Doshi-Velez

Unsupervised Grammar Induction with Depth-bounded PCFG

Feb 26, 2018

Lifeng Jin, Finale Doshi-Velez, Timothy Miller, William Schuler, Lane Schwartz

Abstract:There has been recent interest in applying cognitively or empirically motivated bounds on recursion depth to limit the search space of grammar induction models (Ponvert et al., 2011; Noji and Johnson, 2016; Shain et al., 2016). This work extends this depth-bounding approach to probabilistic context-free grammar induction (DB-PCFG), which has a smaller parameter space than hierarchical sequence models, and therefore more fully exploits the space reductions of depth-bounding. Results for this model on grammar acquisition from transcribed child-directed speech and newswire text exceed or are competitive with those of other models when evaluated on parse accuracy. Moreover, gram- mars acquired from this model demonstrate a consistent use of category labels, something which has not been demonstrated by other acquisition models.

* Accepted by Transactions of the Association for Computational Linguistics

Via

Access Paper or Ask Questions

How do Humans Understand Explanations from Machine Learning Systems? An Evaluation of the Human-Interpretability of Explanation

Feb 02, 2018

Menaka Narayanan, Emily Chen, Jeffrey He, Been Kim, Sam Gershman, Finale Doshi-Velez

Figure 1 for How do Humans Understand Explanations from Machine Learning Systems? An Evaluation of the Human-Interpretability of Explanation

Figure 2 for How do Humans Understand Explanations from Machine Learning Systems? An Evaluation of the Human-Interpretability of Explanation

Figure 3 for How do Humans Understand Explanations from Machine Learning Systems? An Evaluation of the Human-Interpretability of Explanation

Figure 4 for How do Humans Understand Explanations from Machine Learning Systems? An Evaluation of the Human-Interpretability of Explanation

Abstract:Recent years have seen a boom in interest in machine learning systems that can provide a human-understandable rationale for their predictions or decisions. However, exactly what kinds of explanation are truly human-interpretable remains poorly understood. This work advances our understanding of what makes explanations interpretable in the specific context of verification. Suppose we have a machine learning system that predicts X, and we provide rationale for this prediction X. Given an input, an explanation, and an output, is the output consistent with the input and the supposed rationale? Via a series of user-studies, we identify what kinds of increases in complexity have the greatest effect on the time it takes for humans to verify the rationale, and which seem relatively insensitive.

Via

Access Paper or Ask Questions

Prediction-Constrained Topic Models for Antidepressant Recommendation

Dec 01, 2017

Michael C. Hughes, Gabriel Hope, Leah Weiner, Thomas H. McCoy, Roy H. Perlis, Erik B. Sudderth, Finale Doshi-Velez

Figure 1 for Prediction-Constrained Topic Models for Antidepressant Recommendation

Figure 2 for Prediction-Constrained Topic Models for Antidepressant Recommendation

Abstract:Supervisory signals can help topic models discover low-dimensional data representations that are more interpretable for clinical tasks. We propose a framework for training supervised latent Dirichlet allocation that balances two goals: faithful generative explanations of high-dimensional data and accurate prediction of associated class labels. Existing approaches fail to balance these goals by not properly handling a fundamental asymmetry: the intended task is always predicting labels from data, not data from labels. Our new prediction-constrained objective trains models that predict labels from heldout data well while also producing good generative likelihoods and interpretable topic-word parameters. In a case study on predicting depression medications from electronic health records, we demonstrate improved recommendations compared to previous supervised topic models and high- dimensional logistic regression from words alone.

* Accepted poster at NIPS 2017 Workshop on Machine Learning for Health (https://ml4health.github.io/2017/)

Via

Access Paper or Ask Questions

Improving the Adversarial Robustness and Interpretability of Deep Neural Networks by Regularizing their Input Gradients

Nov 26, 2017

Andrew Slavin Ross, Finale Doshi-Velez

Figure 1 for Improving the Adversarial Robustness and Interpretability of Deep Neural Networks by Regularizing their Input Gradients

Figure 2 for Improving the Adversarial Robustness and Interpretability of Deep Neural Networks by Regularizing their Input Gradients

Figure 3 for Improving the Adversarial Robustness and Interpretability of Deep Neural Networks by Regularizing their Input Gradients

Figure 4 for Improving the Adversarial Robustness and Interpretability of Deep Neural Networks by Regularizing their Input Gradients

Abstract:Deep neural networks have proven remarkably effective at solving many classification problems, but have been criticized recently for two major weaknesses: the reasons behind their predictions are uninterpretable, and the predictions themselves can often be fooled by small adversarial perturbations. These problems pose major obstacles for the adoption of neural networks in domains that require security or transparency. In this work, we evaluate the effectiveness of defenses that differentiably penalize the degree to which small changes in inputs can alter model predictions. Across multiple attacks, architectures, defenses, and datasets, we find that neural networks trained with this input gradient regularization exhibit robustness to transferred adversarial examples generated to fool all of the other models. We also find that adversarial examples generated to fool gradient-regularized models fool all other models equally well, and actually lead to more "legitimate," interpretable misclassifications as rated by people (which we confirm in a human subject experiment). Finally, we demonstrate that regularizing input gradients makes them more naturally interpretable as rationales for model predictions. We conclude by discussing this relationship between interpretability and robustness in deep neural networks.

* To appear in AAAI 2018

Via

Access Paper or Ask Questions

Accountability of AI Under the Law: The Role of Explanation

Nov 21, 2017

Finale Doshi-Velez, Mason Kortz, Ryan Budish, Chris Bavitz, Sam Gershman, David O'Brien, Stuart Schieber, James Waldo, David Weinberger, Alexandra Wood

Figure 1 for Accountability of AI Under the Law: The Role of Explanation

Figure 2 for Accountability of AI Under the Law: The Role of Explanation

Figure 3 for Accountability of AI Under the Law: The Role of Explanation

Abstract:The ubiquity of systems using artificial intelligence or "AI" has brought increasing attention to how those systems should be regulated. The choice of how to regulate AI systems will require care. AI systems have the potential to synthesize large amounts of data, allowing for greater levels of personalization and precision than ever before---applications range from clinical decision support to autonomous driving and predictive policing. That said, there exist legitimate concerns about the intentional and unintentional negative consequences of AI systems. There are many ways to hold AI systems accountable. In this work, we focus on one: explanation. Questions about a legal right to explanation from AI systems was recently debated in the EU General Data Protection Regulation, and thus thinking carefully about when and how explanation from AI systems might improve accountability is timely. In this work, we review contexts in which explanation is currently required under the law, and then list the technical considerations that must be considered if we desired AI systems that could provide kinds of explanations that are currently required of humans.

Via

Access Paper or Ask Questions

Beyond Sparsity: Tree Regularization of Deep Models for Interpretability

Nov 16, 2017

Mike Wu, Michael C. Hughes, Sonali Parbhoo, Maurizio Zazzi, Volker Roth, Finale Doshi-Velez

Figure 1 for Beyond Sparsity: Tree Regularization of Deep Models for Interpretability

Figure 2 for Beyond Sparsity: Tree Regularization of Deep Models for Interpretability

Figure 3 for Beyond Sparsity: Tree Regularization of Deep Models for Interpretability

Figure 4 for Beyond Sparsity: Tree Regularization of Deep Models for Interpretability

Abstract:The lack of interpretability remains a key barrier to the adoption of deep models in many applications. In this work, we explicitly regularize deep models so human users might step through the process behind their predictions in little time. Specifically, we train deep time-series models so their class-probability predictions have high accuracy while being closely modeled by decision trees with few nodes. Using intuitive toy examples as well as medical tasks for treating sepsis and HIV, we demonstrate that this new tree regularization yields models that are easier for humans to simulate than simpler L1 or L2 penalties without sacrificing predictive power.

* To appear in AAAI 2018. Contains 9-page main paper and appendix with supplementary material

Via

Access Paper or Ask Questions

Uncertainty Decomposition in Bayesian Neural Networks with Latent Variables

Nov 11, 2017

Stefan Depeweg, José Miguel Hernández-Lobato, Finale Doshi-Velez, Steffen Udluft

Figure 1 for Uncertainty Decomposition in Bayesian Neural Networks with Latent Variables

Figure 2 for Uncertainty Decomposition in Bayesian Neural Networks with Latent Variables

Figure 3 for Uncertainty Decomposition in Bayesian Neural Networks with Latent Variables

Figure 4 for Uncertainty Decomposition in Bayesian Neural Networks with Latent Variables

Abstract:Bayesian neural networks (BNNs) with latent variables are probabilistic models which can automatically identify complex stochastic patterns in the data. We describe and study in these models a decomposition of predictive uncertainty into its epistemic and aleatoric components. First, we show how such a decomposition arises naturally in a Bayesian active learning scenario by following an information theoretic approach. Second, we use a similar decomposition to develop a novel risk sensitive objective for safe reinforcement learning (RL). This objective minimizes the effect of model bias in environments whose stochastic dynamics are described by BNNs with latent variables. Our experiments illustrate the usefulness of the resulting decomposition in active learning and safe RL settings.

* This article is superseded by arXiv:1710.07283

Via

Access Paper or Ask Questions

Robust and Efficient Transfer Learning with Hidden-Parameter Markov Decision Processes

Oct 31, 2017

Taylor Killian, Samuel Daulton, George Konidaris, Finale Doshi-Velez

Figure 1 for Robust and Efficient Transfer Learning with Hidden-Parameter Markov Decision Processes

Figure 2 for Robust and Efficient Transfer Learning with Hidden-Parameter Markov Decision Processes

Figure 3 for Robust and Efficient Transfer Learning with Hidden-Parameter Markov Decision Processes

Figure 4 for Robust and Efficient Transfer Learning with Hidden-Parameter Markov Decision Processes

Abstract:We introduce a new formulation of the Hidden Parameter Markov Decision Process (HiP-MDP), a framework for modeling families of related tasks using low-dimensional latent embeddings. Our new framework correctly models the joint uncertainty in the latent parameters and the state space. We also replace the original Gaussian Process-based model with a Bayesian Neural Network, enabling more scalable inference. Thus, we expand the scope of the HiP-MDP to applications with higher dimensions and more complex dynamics.

* To appear at NIPS 2017, selected for an oral presentation. 17 pages (incl references and appendix). Example code can be found at http://github.com/dtak/hip-mdp-public

Via

Access Paper or Ask Questions

Weighted Tensor Decomposition for Learning Latent Variables with Partial Data

Oct 18, 2017

Omer Gottesman, Weiwei Pan, Finale Doshi-Velez

Figure 1 for Weighted Tensor Decomposition for Learning Latent Variables with Partial Data

Figure 2 for Weighted Tensor Decomposition for Learning Latent Variables with Partial Data

Abstract:Tensor decomposition methods are popular tools for learning latent variables given only lower-order moments of the data. However, the standard assumption is that we have sufficient data to estimate these moments to high accuracy. In this work, we consider the case in which certain dimensions of the data are not always observed---common in applied settings, where not all measurements may be taken for all observations---resulting in moment estimates of varying quality. We derive a weighted tensor decomposition approach that is computationally as efficient as the non-weighted approach, and demonstrate that it outperforms methods that do not appropriately leverage these less-observed dimensions.

Via

Access Paper or Ask Questions

Prior matters: simple and general methods for evaluating and improving topic quality in topic modeling

Oct 14, 2017

Angela Fan, Finale Doshi-Velez, Luke Miratrix

Figure 1 for Prior matters: simple and general methods for evaluating and improving topic quality in topic modeling

Figure 2 for Prior matters: simple and general methods for evaluating and improving topic quality in topic modeling

Figure 3 for Prior matters: simple and general methods for evaluating and improving topic quality in topic modeling

Figure 4 for Prior matters: simple and general methods for evaluating and improving topic quality in topic modeling

Abstract:Latent Dirichlet Allocation (LDA) models trained without stopword removal often produce topics with high posterior probabilities on uninformative words, obscuring the underlying corpus content. Even when canonical stopwords are manually removed, uninformative words common in that corpus will still dominate the most probable words in a topic. In this work, we first show how the standard topic quality measures of coherence and pointwise mutual information act counter-intuitively in the presence of common but irrelevant words, making it difficult to even quantitatively identify situations in which topics may be dominated by stopwords. We propose an additional topic quality metric that targets the stopword problem, and show that it, unlike the standard measures, correctly correlates with human judgements of quality. We also propose a simple-to-implement strategy for generating topics that are evaluated to be of much higher quality by both human assessment and our new metric. This approach, a collection of informative priors easily introduced into most LDA-style inference methods, automatically promotes terms with domain relevance and demotes domain-specific stop words. We demonstrate this approach's effectiveness in three very different domains: Department of Labor accident reports, online health forum posts, and NIPS abstracts. Overall we find that current practices thought to solve this problem do not do so adequately, and that our proposal offers a substantial improvement for those interested in interpreting their topics as objects in their own right.

Via

Access Paper or Ask Questions