Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kush R. Varshney

Teaching machines to understand data science code by semantic enrichment of dataflow graphs

Jul 16, 2018

Evan Patterson, Ioana Baldini, Aleksandra Mojsilovic, Kush R. Varshney

Figure 1 for Teaching machines to understand data science code by semantic enrichment of dataflow graphs

Figure 2 for Teaching machines to understand data science code by semantic enrichment of dataflow graphs

Figure 3 for Teaching machines to understand data science code by semantic enrichment of dataflow graphs

Figure 4 for Teaching machines to understand data science code by semantic enrichment of dataflow graphs

Abstract:Your computer is continuously executing programs, but does it really understand them? Not in any meaningful sense. That burden falls upon human knowledge workers, who are increasingly asked to write and understand code. They would benefit greatly from intelligent tools that reveal the connections between their code and its subject matter. Towards this prospect, we develop an AI system that forms semantic representations of computer programs, using techniques from knowledge representation and program analysis. We focus on code written for data science, although our method is more generally applicable. The semantic representations are created through a novel algorithm for the semantic enrichment of dataflow graphs. This algorithm is undergirded by a new ontology language for modeling computer programs and a new ontology about data science, written in this language.

* 8 pages, 7 figures. To appear at the KDD '18 workshop on Fragile Earth: Theory Guided Data Science to Enhance Scientific Discovery

Via

Access Paper or Ask Questions

Proceedings of the 2018 ICML Workshop on Human Interpretability in Machine Learning

Jul 03, 2018

Been Kim, Kush R. Varshney, Adrian Weller

Abstract:This is the Proceedings of the 2018 ICML Workshop on Human Interpretability in Machine Learning (WHI 2018), which was held in Stockholm, Sweden, July 14, 2018. Invited speakers were Barbara Engelhardt, Cynthia Rudin, Fernanda Vi\'egas, and Martin Wattenberg.

Via

Access Paper or Ask Questions

Why Interpretability in Machine Learning? An Answer Using Distributed Detection and Data Fusion Theory

Jun 25, 2018

Kush R. Varshney, Prashant Khanduri, Pranay Sharma, Shan Zhang, Pramod K. Varshney

Figure 1 for Why Interpretability in Machine Learning? An Answer Using Distributed Detection and Data Fusion Theory

Abstract:As artificial intelligence is increasingly affecting all parts of society and life, there is growing recognition that human interpretability of machine learning models is important. It is often argued that accuracy or other similar generalization performance metrics must be sacrificed in order to gain interpretability. Such arguments, however, fail to acknowledge that the overall decision-making system is composed of two entities: the learned model and a human who fuses together model outputs with his or her own information. As such, the relevant performance criteria should be for the entire system, not just for the machine learning component. In this work, we characterize the performance of such two-node tandem data fusion systems using the theory of distributed detection. In doing so, we work in the population setting and model interpretable learned models as multi-level quantizers. We prove that under our abstraction, the overall system of a human with an interpretable classifier outperforms one with a black box classifier.

* presented at 2018 ICML Workshop on Human Interpretability in Machine Learning (WHI 2018), Stockholm, Sweden

Via

Access Paper or Ask Questions

Topological Data Analysis of Decision Boundaries with Application to Model Selection

May 25, 2018

Karthikeyan Natesan Ramamurthy, Kush R. Varshney, Krishnan Mody

Figure 1 for Topological Data Analysis of Decision Boundaries with Application to Model Selection

Figure 2 for Topological Data Analysis of Decision Boundaries with Application to Model Selection

Figure 3 for Topological Data Analysis of Decision Boundaries with Application to Model Selection

Figure 4 for Topological Data Analysis of Decision Boundaries with Application to Model Selection

Abstract:We propose the labeled \v{C}ech complex, the plain labeled Vietoris-Rips complex, and the locally scaled labeled Vietoris-Rips complex to perform persistent homology inference of decision boundaries in classification tasks. We provide theoretical conditions and analysis for recovering the homology of a decision boundary from samples. Our main objective is quantification of deep neural network complexity to enable matching of datasets to pre-trained models; we report results for experiments using MNIST, FashionMNIST, and CIFAR10.

* Reproducible software available, 17 pages, 10 figures, 12 tables

Via

Access Paper or Ask Questions

Fairness GAN

May 24, 2018

Prasanna Sattigeri, Samuel C. Hoffman, Vijil Chenthamarakshan, Kush R. Varshney

Abstract:In this paper, we introduce the Fairness GAN, an approach for generating a dataset that is plausibly similar to a given multimedia dataset, but is more fair with respect to protected attributes in allocative decision making. We propose a novel auxiliary classifier GAN that strives for demographic parity or equality of opportunity and show empirical results on several datasets, including the CelebFaces Attributes (CelebA) dataset, the Quick, Draw!\ dataset, and a dataset of soccer player images and the offenses they were called for. The proposed formulation is well-suited to absorbing unlabeled data; we leverage this to augment the soccer dataset with the much larger CelebA dataset. The methodology tends to improve demographic parity and equality of opportunity while generating plausible images.

Via

Access Paper or Ask Questions

Structure Learning from Time Series with False Discovery Control

May 24, 2018

Bernat Guillen Pegueroles, Bhanukiran Vinzamuri, Karthikeyan Shanmugam, Steve Hedden, Jonathan D. Moyer, Kush R. Varshney

Figure 1 for Structure Learning from Time Series with False Discovery Control

Figure 2 for Structure Learning from Time Series with False Discovery Control

Figure 3 for Structure Learning from Time Series with False Discovery Control

Figure 4 for Structure Learning from Time Series with False Discovery Control

Abstract:We consider the Granger causal structure learning problem from time series data. Granger causal algorithms predict a 'Granger causal effect' between two variables by testing if prediction error of one decreases significantly in the absence of the other variable among the predictor covariates. Almost all existing Granger causal algorithms condition on a large number of variables (all but two variables) to test for effects between a pair of variables. We propose a new structure learning algorithm called MMPC-p inspired by the well known MMHC algorithm for non-time series data. We show that under some assumptions, the algorithm provides false discovery rate control. The algorithm is sound and complete when given access to perfect directed information testing oracles. We also outline a novel tester for the linear Gaussian case. We show through our extensive experiments that the MMPC-p algorithm scales to larger problems and has improved statistical power compared to existing state of the art for large sparse graphs. We also apply our algorithm on a global development dataset and validate our findings with subject matter experts.

Via

Access Paper or Ask Questions

How an Electrical Engineer Became an Artificial Intelligence Researcher, a Multiphase Active Contours Analysis

Mar 29, 2018

Kush R. Varshney

Figure 1 for How an Electrical Engineer Became an Artificial Intelligence Researcher, a Multiphase Active Contours Analysis

Abstract:This essay examines how what is considered to be artificial intelligence (AI) has changed over time and come to intersect with the expertise of the author. Initially, AI developed on a separate trajectory, both topically and institutionally, from pattern recognition, neural information processing, decision and control systems, and allied topics by focusing on symbolic systems within computer science departments rather than on continuous systems in electrical engineering departments. The separate evolutions continued throughout the author's lifetime, with some crossover in reinforcement learning and graphical models, but were shocked into converging by the virality of deep learning, thus making an electrical engineer into an AI researcher. Now that this convergence has happened, opportunity exists to pursue an agenda that combines learning and reasoning bridged by interpretable machine learning models.

Via

Access Paper or Ask Questions

Neurology-as-a-Service for the Developing World

Nov 22, 2017

Tejas Dharamsi, Payel Das, Tejaswini Pedapati, Gregory Bramble, Vinod Muthusamy, Horst Samulowitz, Kush R. Varshney, Yuvaraj Rajamanickam, John Thomas, Justin Dauwels

Figure 1 for Neurology-as-a-Service for the Developing World

Figure 2 for Neurology-as-a-Service for the Developing World

Figure 3 for Neurology-as-a-Service for the Developing World

Abstract:Electroencephalography (EEG) is an extensively-used and well-studied technique in the field of medical diagnostics and treatment for brain disorders, including epilepsy, migraines, and tumors. The analysis and interpretation of EEGs require physicians to have specialized training, which is not common even among most doctors in the developed world, let alone the developing world where physician shortages plague society. This problem can be addressed by teleEEG that uses remote EEG analysis by experts or by local computer processing of EEGs. However, both of these options are prohibitively expensive and the second option requires abundant computing resources and infrastructure, which is another concern in developing countries where there are resource constraints on capital and computing infrastructure. In this work, we present a cloud-based deep neural network approach to provide decision support for non-specialist physicians in EEG analysis and interpretation. Named `neurology-as-a-service,' the approach requires almost no manual intervention in feature engineering and in the selection of an optimal architecture and hyperparameters of the neural network. In this study, we deploy a pipeline that includes moving EEG data to the cloud and getting optimal models for various classification tasks. Our initial prototype has been tested only in developed world environments to-date, but our intention is to test it in developing world environments in future work. We demonstrate the performance of our proposed approach using the BCI2000 EEG MMI dataset, on which our service attains 63.4% accuracy for the task of classifying real vs. imaginary activity performed by the subject, which is significantly higher than what is obtained with a shallow approach such as support vector machines.

* Presented at NIPS 2017 Workshop on Machine Learning for the Developing World

Via

Access Paper or Ask Questions

Distribution-Preserving k-Anonymity

Nov 05, 2017

Dennis Wei, Karthikeyan Natesan Ramamurthy, Kush R. Varshney

Figure 1 for Distribution-Preserving k-Anonymity

Figure 2 for Distribution-Preserving k-Anonymity

Figure 3 for Distribution-Preserving k-Anonymity

Figure 4 for Distribution-Preserving k-Anonymity

Abstract:Preserving the privacy of individuals by protecting their sensitive attributes is an important consideration during microdata release. However, it is equally important to preserve the quality or utility of the data for at least some targeted workloads. We propose a novel framework for privacy preservation based on the k-anonymity model that is ideally suited for workloads that require preserving the probability distribution of the quasi-identifier variables in the data. Our framework combines the principles of distribution-preserving quantization and k-member clustering, and we specialize it to two variants that respectively use intra-cluster and Gaussian dithering of cluster centers to achieve distribution preservation. We perform theoretical analysis of the proposed schemes in terms of distribution preservation, and describe their utility in workloads such as covariate shift and transfer learning where such a property is necessary. Using extensive experiments on real-world Medical Expenditure Panel Survey data, we demonstrate the merits of our algorithms over standard k-anonymization for a hallmark health care application where an insurance company wishes to understand the risk in entering a new market. Furthermore, by empirically quantifying the reidentification risk, we also show that the proposed approaches indeed maintain k-anonymity.

* Portions of this work were first presented at the 2015 SIAM International Conference on Data Mining

Via

Access Paper or Ask Questions

On the Safety of Machine Learning: Cyber-Physical Systems, Decision Sciences, and Data Products

Aug 22, 2017

Kush R. Varshney, Homa Alemzadeh

Figure 1 for On the Safety of Machine Learning: Cyber-Physical Systems, Decision Sciences, and Data Products

Abstract:Machine learning algorithms increasingly influence our decisions and interact with us in all parts of our daily lives. Therefore, just as we consider the safety of power plants, highways, and a variety of other engineered socio-technical systems, we must also take into account the safety of systems involving machine learning. Heretofore, the definition of safety has not been formalized in a machine learning context. In this paper, we do so by defining machine learning safety in terms of risk, epistemic uncertainty, and the harm incurred by unwanted outcomes. We then use this definition to examine safety in all sorts of applications in cyber-physical systems, decision sciences, and data products. We find that the foundational principle of modern statistical machine learning, empirical risk minimization, is not always a sufficient objective. Finally, we discuss how four different categories of strategies for achieving safety in engineering, including inherently safe design, safety reserves, safe fail, and procedural safeguards can be mapped to a machine learning context. We then discuss example techniques that can be adopted in each category, such as considering interpretability and causality of predictive models, objective functions beyond expected prediction accuracy, human involvement for labeling difficult or rare examples, and user experience design of software and open data.

* Big Data, 2017

Via

Access Paper or Ask Questions