Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mikhail Yurochkin

There is no trade-off: enforcing fairness can improve accuracy

Nov 06, 2020

Subha Maity, Debarghya Mukherjee, Mikhail Yurochkin, Yuekai Sun

Figure 1 for There is no trade-off: enforcing fairness can improve accuracy

Figure 2 for There is no trade-off: enforcing fairness can improve accuracy

Figure 3 for There is no trade-off: enforcing fairness can improve accuracy

Figure 4 for There is no trade-off: enforcing fairness can improve accuracy

Abstract:One of the main barriers to the broader adoption of algorithmic fairness in machine learning is the trade-off between fairness and performance of ML models: many practitioners are unwilling to sacrifice the performance of their ML model for fairness. In this paper, we show that this trade-off may not be necessary. If the algorithmic biases in an ML model are due to sampling biases in the training data, then enforcing algorithmic fairness may improve the performance of the ML model on unbiased test data. We study conditions under which enforcing algorithmic fairness helps practitioners learn the Bayes decision rule for (unbiased) test data from biased training data. We also demonstrate the practical implications of our theoretical results in real-world ML tasks.

Via

Access Paper or Ask Questions

Online Semi-Supervised Learning with Bandit Feedback

Oct 23, 2020

Sohini Upadhyay, Mikhail Yurochkin, Mayank Agarwal, Yasaman Khazaeni, DjallelBouneffouf

Figure 1 for Online Semi-Supervised Learning with Bandit Feedback

Figure 2 for Online Semi-Supervised Learning with Bandit Feedback

Abstract:We formulate a new problem at the intersectionof semi-supervised learning and contextual bandits,motivated by several applications including clini-cal trials and ad recommendations. We demonstratehow Graph Convolutional Network (GCN), a semi-supervised learning approach, can be adjusted tothe new problem formulation. We also propose avariant of the linear contextual bandit with semi-supervised missing rewards imputation. We thentake the best of both approaches to develop multi-GCN embedded contextual bandit. Our algorithmsare verified on several real world datasets.

Via

Access Paper or Ask Questions

Continuous Regularized Wasserstein Barycenters

Aug 28, 2020

Lingxiao Li, Aude Genevay, Mikhail Yurochkin, Justin Solomon

Figure 1 for Continuous Regularized Wasserstein Barycenters

Figure 2 for Continuous Regularized Wasserstein Barycenters

Figure 3 for Continuous Regularized Wasserstein Barycenters

Figure 4 for Continuous Regularized Wasserstein Barycenters

Abstract:Wasserstein barycenters provide a geometrically meaningful way to aggregate probability distributions, built on the theory of optimal transport. They are difficult to compute in practice, however, leading previous work to restrict their supports to finite sets of points. Leveraging a new dual formulation for the regularized Wasserstein barycenter problem, we introduce a stochastic algorithm that constructs a continuous approximation of the barycenter. We establish strong duality and use the corresponding primal-dual relationship to parametrize the barycenter implicitly using the dual potentials of regularized transport problems. The resulting problem can be solved with stochastic gradient descent, which yields an efficient online algorithm to approximate the barycenter of continuous distributions given sample access. We demonstrate the effectiveness of our approach and compare against previous work on synthetic examples and real-world applications.

Via

Access Paper or Ask Questions

IBM Federated Learning: an Enterprise Framework White Paper V0.1

Jul 22, 2020

Heiko Ludwig, Nathalie Baracaldo, Gegi Thomas, Yi Zhou, Ali Anwar, Shashank Rajamoni, Yuya Ong, Jayaram Radhakrishnan, Ashish Verma, Mathieu Sinn(+14 more)

Figure 1 for IBM Federated Learning: an Enterprise Framework White Paper V0.1

Figure 2 for IBM Federated Learning: an Enterprise Framework White Paper V0.1

Figure 3 for IBM Federated Learning: an Enterprise Framework White Paper V0.1

Figure 4 for IBM Federated Learning: an Enterprise Framework White Paper V0.1

Abstract:Federated Learning (FL) is an approach to conduct machine learning without centralizing training data in a single place, for reasons of privacy, confidentiality or data volume. However, solving federated machine learning problems raises issues above and beyond those of centralized machine learning. These issues include setting up communication infrastructure between parties, coordinating the learning process, integrating party results, understanding the characteristics of the training data sets of different participating parties, handling data heterogeneity, and operating with the absence of a verification data set. IBM Federated Learning provides infrastructure and coordination for federated learning. Data scientists can design and run federated learning jobs based on existing, centralized machine learning models and can provide high-level instructions on how to run the federation. The framework applies to both Deep Neural Networks as well as ``traditional'' approaches for the most common machine learning libraries. {\proj} enables data scientists to expand their scope from centralized to federated machine learning, minimizing the learning curve at the outset while also providing the flexibility to deploy to different compute environments and design custom fusion algorithms.

* 17 pages

Via

Access Paper or Ask Questions

Model Fusion with Kullback--Leibler Divergence

Jul 13, 2020

Sebastian Claici, Mikhail Yurochkin, Soumya Ghosh, Justin Solomon

Figure 1 for Model Fusion with Kullback--Leibler Divergence

Figure 2 for Model Fusion with Kullback--Leibler Divergence

Figure 3 for Model Fusion with Kullback--Leibler Divergence

Figure 4 for Model Fusion with Kullback--Leibler Divergence

Abstract:We propose a method to fuse posterior distributions learned from heterogeneous datasets. Our algorithm relies on a mean field assumption for both the fused model and the individual dataset posteriors and proceeds using a simple assign-and-average approach. The components of the dataset posteriors are assigned to the proposed global model components by solving a regularized variant of the assignment problem. The global components are then updated based on these assignments by their mean under a KL divergence. For exponential family variational distributions, our formulation leads to an efficient non-parametric algorithm for computing the fused model. Our algorithm is easy to describe and implement, efficient, and competitive with state-of-the-art on motion capture analysis, topic modeling, and federated learning of Bayesian neural networks.

* ICML 2020

Via

Access Paper or Ask Questions

SenSeI: Sensitive Set Invariance for Enforcing Individual Fairness

Jun 25, 2020

Mikhail Yurochkin, Yuekai Sun

Figure 1 for SenSeI: Sensitive Set Invariance for Enforcing Individual Fairness

Figure 2 for SenSeI: Sensitive Set Invariance for Enforcing Individual Fairness

Figure 3 for SenSeI: Sensitive Set Invariance for Enforcing Individual Fairness

Figure 4 for SenSeI: Sensitive Set Invariance for Enforcing Individual Fairness

Abstract:In this paper, we cast fair machine learning as invariant machine learning. We first formulate a version of individual fairness that enforces invariance on certain sensitive sets. We then design a transport-based regularizer that enforces this version of individual fairness and develop an algorithm to minimize the regularizer efficiently. Our theoretical results guarantee the proposed approach trains certifiably fair ML models. Finally, in the experimental studies we demonstrate improved fairness metrics in comparison to several recent fair training procedures on three ML tasks that are susceptible to algorithmic bias.

Via

Access Paper or Ask Questions

Two Simple Ways to Learn Individual Fairness Metrics from Data

Jun 19, 2020

Debarghya Mukherjee, Mikhail Yurochkin, Moulinath Banerjee, Yuekai Sun

Figure 1 for Two Simple Ways to Learn Individual Fairness Metrics from Data

Figure 2 for Two Simple Ways to Learn Individual Fairness Metrics from Data

Figure 3 for Two Simple Ways to Learn Individual Fairness Metrics from Data

Abstract:Individual fairness is an intuitive definition of algorithmic fairness that addresses some of the drawbacks of group fairness. Despite its benefits, it depends on a task specific fair metric that encodes our intuition of what is fair and unfair for the ML task at hand, and the lack of a widely accepted fair metric for many ML tasks is the main barrier to broader adoption of individual fairness. In this paper, we present two simple ways to learn fair metrics from a variety of data types. We show empirically that fair training with the learned metrics leads to improved fairness on three machine learning tasks susceptible to gender and racial biases. We also provide theoretical guarantees on the statistical performance of both approaches.

* To appear in ICML 2020

Via

Access Paper or Ask Questions

Auditing ML Models for Individual Bias and Unfairness

Mar 11, 2020

Songkai Xue, Mikhail Yurochkin, Yuekai Sun

Figure 1 for Auditing ML Models for Individual Bias and Unfairness

Figure 2 for Auditing ML Models for Individual Bias and Unfairness

Figure 3 for Auditing ML Models for Individual Bias and Unfairness

Figure 4 for Auditing ML Models for Individual Bias and Unfairness

Abstract:We consider the task of auditing ML models for individual bias/unfairness. We formalize the task in an optimization problem and develop a suite of inferential tools for the optimal value. Our tools permit us to obtain asymptotic confidence intervals and hypothesis tests that cover the target/control the Type I error rate exactly. To demonstrate the utility of our tools, we use them to reveal the gender and racial biases in Northpointe's COMPAS recidivism prediction instrument.

* In Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS) 2020

Via

Access Paper or Ask Questions

Federated Learning with Matched Averaging

Feb 15, 2020

Hongyi Wang, Mikhail Yurochkin, Yuekai Sun, Dimitris Papailiopoulos, Yasaman Khazaeni

Figure 1 for Federated Learning with Matched Averaging

Figure 2 for Federated Learning with Matched Averaging

Figure 3 for Federated Learning with Matched Averaging

Figure 4 for Federated Learning with Matched Averaging

Abstract:Federated learning allows edge devices to collaboratively learn a shared model while keeping the training data on device, decoupling the ability to do model training from the need to store the data in the cloud. We propose Federated matched averaging (FedMA) algorithm designed for federated learning of modern neural network architectures e.g. convolutional neural networks (CNNs) and LSTMs. FedMA constructs the shared global model in a layer-wise manner by matching and averaging hidden elements (i.e. channels for convolution layers; hidden states for LSTM; neurons for fully connected layers) with similar feature extraction signatures. Our experiments indicate that FedMA not only outperforms popular state-of-the-art federated learning algorithms on deep CNN and LSTM architectures trained on real world datasets, but also reduces the overall communication burden.

* Accepted by ICLR 2020

Via

Access Paper or Ask Questions

Alleviating Label Switching with Optimal Transport

Nov 10, 2019

Pierre Monteiller, Sebastian Claici, Edward Chien, Farzaneh Mirzazadeh, Justin Solomon, Mikhail Yurochkin

Figure 1 for Alleviating Label Switching with Optimal Transport

Figure 2 for Alleviating Label Switching with Optimal Transport

Figure 3 for Alleviating Label Switching with Optimal Transport

Figure 4 for Alleviating Label Switching with Optimal Transport

Abstract:Label switching is a phenomenon arising in mixture model posterior inference that prevents one from meaningfully assessing posterior statistics using standard Monte Carlo procedures. This issue arises due to invariance of the posterior under actions of a group; for example, permuting the ordering of mixture components has no effect on the likelihood. We propose a resolution to label switching that leverages machinery from optimal transport. Our algorithm efficiently computes posterior statistics in the quotient space of the symmetry group. We give conditions under which there is a meaningful solution to label switching and demonstrate advantages over alternative approaches on simulated and real data.

* 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

Via

Access Paper or Ask Questions