Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Latifur Khan

Adaptive Fairness-Aware Online Meta-Learning for Changing Environments

May 26, 2022

Chen Zhao, Feng Mi, Xintao Wu, Kai Jiang, Latifur Khan, Feng Chen

Figure 1 for Adaptive Fairness-Aware Online Meta-Learning for Changing Environments

Figure 2 for Adaptive Fairness-Aware Online Meta-Learning for Changing Environments

Figure 3 for Adaptive Fairness-Aware Online Meta-Learning for Changing Environments

Figure 4 for Adaptive Fairness-Aware Online Meta-Learning for Changing Environments

Abstract:The fairness-aware online learning framework has arisen as a powerful tool for the continual lifelong learning setting. The goal for the learner is to sequentially learn new tasks where they come one after another over time and the learner ensures the statistic parity of the new coming task across different protected sub-populations (e.g. race and gender). A major drawback of existing methods is that they make heavy use of the i.i.d assumption for data and hence provide static regret analysis for the framework. However, low static regret cannot imply a good performance in changing environments where tasks are sampled from heterogeneous distributions. To address the fairness-aware online learning problem in changing environments, in this paper, we first construct a novel regret metric FairSAR by adding long-term fairness constraints onto a strongly adapted loss regret. Furthermore, to determine a good model parameter at each round, we propose a novel adaptive fairness-aware online meta-learning algorithm, namely FairSAOML, which is able to adapt to changing environments in both bias control and model precision. The problem is formulated in the form of a bi-level convex-concave optimization with respect to the model's primal and dual parameters that are associated with the model's accuracy and fairness, respectively. The theoretic analysis provides sub-linear upper bounds for both loss regret and violation of cumulative fairness constraints. Our experimental evaluation on different real-world datasets with settings of changing environments suggests that the proposed FairSAOML significantly outperforms alternatives based on the best prior online learning approaches.

* Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2022. arXiv admin note: text overlap with arXiv:2108.09435

Via

Access Paper or Ask Questions

Uncertainty-Aware Reliable Text Classification

Jul 15, 2021

Yibo Hu, Latifur Khan

Figure 1 for Uncertainty-Aware Reliable Text Classification

Figure 2 for Uncertainty-Aware Reliable Text Classification

Figure 3 for Uncertainty-Aware Reliable Text Classification

Figure 4 for Uncertainty-Aware Reliable Text Classification

Abstract:Deep neural networks have significantly contributed to the success in predictive accuracy for classification tasks. However, they tend to make over-confident predictions in real-world settings, where domain shifting and out-of-distribution (OOD) examples exist. Most research on uncertainty estimation focuses on computer vision because it provides visual validation on uncertainty quality. However, few have been presented in the natural language process domain. Unlike Bayesian methods that indirectly infer uncertainty through weight uncertainties, current evidential uncertainty-based methods explicitly model the uncertainty of class probabilities through subjective opinions. They further consider inherent uncertainty in data with different root causes, vacuity (i.e., uncertainty due to a lack of evidence) and dissonance (i.e., uncertainty due to conflicting evidence). In our paper, we firstly apply evidential uncertainty in OOD detection for text classification tasks. We propose an inexpensive framework that adopts both auxiliary outliers and pseudo off-manifold samples to train the model with prior knowledge of a certain class, which has high vacuity for OOD samples. Extensive empirical experiments demonstrate that our model based on evidential uncertainty outperforms other counterparts for detecting OOD examples. Our approach can be easily deployed to traditional recurrent neural networks and fine-tuned pre-trained transformers.

* KDD 2021

Via

Access Paper or Ask Questions

Towards Self-Adaptive Metric Learning On the Fly

Apr 03, 2021

Yang Gao, Yi-Fan Li, Swarup Chandra, Latifur Khan, Bhavani Thuraisingham

Figure 1 for Towards Self-Adaptive Metric Learning On the Fly

Figure 2 for Towards Self-Adaptive Metric Learning On the Fly

Figure 3 for Towards Self-Adaptive Metric Learning On the Fly

Figure 4 for Towards Self-Adaptive Metric Learning On the Fly

Abstract:Good quality similarity metrics can significantly facilitate the performance of many large-scale, real-world applications. Existing studies have proposed various solutions to learn a Mahalanobis or bilinear metric in an online fashion by either restricting distances between similar (dissimilar) pairs to be smaller (larger) than a given lower (upper) bound or requiring similar instances to be separated from dissimilar instances with a given margin. However, these linear metrics learned by leveraging fixed bounds or margins may not perform well in real-world applications, especially when data distributions are complex. We aim to address the open challenge of "Online Adaptive Metric Learning" (OAML) for learning adaptive metric functions on the fly. Unlike traditional online metric learning methods, OAML is significantly more challenging since the learned metric could be non-linear and the model has to be self-adaptive as more instances are observed. In this paper, we present a new online metric learning framework that attempts to tackle the challenge by learning an ANN-based metric with adaptive model complexity from a stream of constraints. In particular, we propose a novel Adaptive-Bound Triplet Loss (ABTL) to effectively utilize the input constraints and present a novel Adaptive Hedge Update (AHU) method for online updating the model parameters. We empirically validate the effectiveness and efficacy of our framework on various applications such as real-world image classification, facial verification, and image retrieval.

* Accepted by WWW 2019 (Long Paper, Oral)

Via

Access Paper or Ask Questions

SetConv: A New Approach for Learning from Imbalanced Data

Apr 03, 2021

Yang Gao, Yi-Fan Li, Yu Lin, Charu Aggarwal, Latifur Khan

Figure 1 for SetConv: A New Approach for Learning from Imbalanced Data

Figure 2 for SetConv: A New Approach for Learning from Imbalanced Data

Figure 3 for SetConv: A New Approach for Learning from Imbalanced Data

Figure 4 for SetConv: A New Approach for Learning from Imbalanced Data

Abstract:For many real-world classification problems, e.g., sentiment classification, most existing machine learning methods are biased towards the majority class when the Imbalance Ratio (IR) is high. To address this problem, we propose a set convolution (SetConv) operation and an episodic training strategy to extract a single representative for each class, so that classifiers can later be trained on a balanced class distribution. We prove that our proposed algorithm is permutation-invariant despite the order of inputs, and experiments on multiple large-scale benchmark text datasets show the superiority of our proposed framework when compared to other SOTA methods.

* Accepted by EMNLP 2020 (11 pages, 9 figures)

Via

Access Paper or Ask Questions

PDFM: A Primal-Dual Fairness-Aware Framework for Meta-learning

Oct 05, 2020

Chen Zhao, Feng Chen, Zhuoyi Wang, Latifur Khan

Figure 1 for PDFM: A Primal-Dual Fairness-Aware Framework for Meta-learning

Figure 2 for PDFM: A Primal-Dual Fairness-Aware Framework for Meta-learning

Figure 3 for PDFM: A Primal-Dual Fairness-Aware Framework for Meta-learning

Figure 4 for PDFM: A Primal-Dual Fairness-Aware Framework for Meta-learning

Abstract:The problem of learning to generalize to unseen classes during training, known as few-shot classification, has attracted considerable attention. Initialization based methods, such as the gradient-based model agnostic meta-learning (MAML), tackle the few-shot learning problem by "learning to fine-tune". The goal of these approaches is to learn proper model initialization, so that the classifiers for new classes can be learned from a few labeled examples with a small number of gradient update steps. Few shot meta-learning is well-known with its fast-adapted capability and accuracy generalization onto unseen tasks. Learning fairly with unbiased outcomes is another significant hallmark of human intelligence, which is rarely touched in few-shot meta-learning. In this work, we propose a Primal-Dual Fair Meta-learning framework, namely PDFM, which learns to train fair machine learning models using only a few examples based on data from related tasks. The key idea is to learn a good initialization of a fair model's primal and dual parameters so that it can adapt to a new fair learning task via a few gradient update steps. Instead of manually tuning the dual parameters as hyperparameters via a grid search, PDFM optimizes the initialization of the primal and dual parameters jointly for fair meta-learning via a subgradient primal-dual approach. We further instantiate examples of bias controlling using mean difference and decision boundary covariance as fairness constraints to each task for supervised regression and classification, respectively. We demonstrate the versatility of our proposed approach by applying our approach to various real-world datasets. Our experiments show substantial improvements over the best prior work for this setting.

* submitted to IEEE Transactions on Knowledge and Data Engineering (TKDE)

Via

Access Paper or Ask Questions

Deep Learning on Knowledge Graph for Recommender System: A Survey

Mar 25, 2020

Yang Gao, Yi-Fan Li, Yu Lin, Hang Gao, Latifur Khan

Figure 1 for Deep Learning on Knowledge Graph for Recommender System: A Survey

Figure 2 for Deep Learning on Knowledge Graph for Recommender System: A Survey

Figure 3 for Deep Learning on Knowledge Graph for Recommender System: A Survey

Figure 4 for Deep Learning on Knowledge Graph for Recommender System: A Survey

Abstract:Recent advances in research have demonstrated the effectiveness of knowledge graphs (KG) in providing valuable external knowledge to improve recommendation systems (RS). A knowledge graph is capable of encoding high-order relations that connect two objects with one or multiple related attributes. With the help of the emerging Graph Neural Networks (GNN), it is possible to extract both object characteristics and relations from KG, which is an essential factor for successful recommendations. In this paper, we provide a comprehensive survey of the GNN-based knowledge-aware deep recommender systems. Specifically, we discuss the state-of-the-art frameworks with a focus on their core component, i.e., the graph embedding module, and how they address practical recommendation issues such as scalability, cold-start and so on. We further summarize the commonly-used benchmark datasets, evaluation metrics as well as open-source codes. Finally, we conclude the survey and propose potential research directions in this rapidly growing field.

* 6 figures, 5 tables

Via

Access Paper or Ask Questions

Co-Representation Learning For Classification and Novel Class Detection via Deep Networks

Nov 13, 2018

Zhuoyi Wang, Zelun Kong, Hemeng Tao, Swarup Chandra, Latifur Khan

Figure 1 for Co-Representation Learning For Classification and Novel Class Detection via Deep Networks

Figure 2 for Co-Representation Learning For Classification and Novel Class Detection via Deep Networks

Figure 3 for Co-Representation Learning For Classification and Novel Class Detection via Deep Networks

Figure 4 for Co-Representation Learning For Classification and Novel Class Detection via Deep Networks

Abstract:Deep Neural Network (DNN) has been largely demonstrated to be effective for closed-world classification problems where the total number of classes are known in advance. However, when the total number of classes that may occur during test time is unknown, DNNs notorious fail, i.e., DNN will make incorrect label prediction on instances from novel or unseen classes. This severely limits its utility in many real-world web applications, particularly when data occurs as a continuous stream. In this paper, we focus on addressing this key challenge by developing a two-channel DNN based co-representation learning framework that not only predicts instances from known classes, but also detects and adapts to the occurrence of novel class instances over time. Concretely, we propose a metric learning method using pairwise-constraint loss (PCL) function to learn a feature representation where intra-class compactness and inter-class separation is achieved. Moreover, we apply the temperature scaling scheme on the softmax function to replace traditional softmax output and design an open-world classifier. Our extensive empirical evaluation on benchmark datasets demonstrates the effectiveness of our framework compared to other competing techniques.

* 7 pages

Via

Access Paper or Ask Questions

Adaptive Image Stream Classification via Convolutional Neural Network with Intrinsic Similarity Metrics

Sep 27, 2018

Yang Gao, Swarup Chandra, Zhuoyi Wang, Latifur Khan

Figure 1 for Adaptive Image Stream Classification via Convolutional Neural Network with Intrinsic Similarity Metrics

Figure 2 for Adaptive Image Stream Classification via Convolutional Neural Network with Intrinsic Similarity Metrics

Figure 3 for Adaptive Image Stream Classification via Convolutional Neural Network with Intrinsic Similarity Metrics

Figure 4 for Adaptive Image Stream Classification via Convolutional Neural Network with Intrinsic Similarity Metrics

Abstract:When performing data classification over a stream of continuously occurring instances, a key challenge is to develop an open-world classifier that anticipates instances from an unknown class. Studies addressing this problem, typically called novel class detection, have considered classification methods that reactively adapt to such changes along the stream. Importantly, they rely on the property of cohesion and separation among instances in feature space. Instances belonging to the same class are assumed to be closer to each other (cohesion) than those belonging to different classes (separation). Unfortunately, this assumption may not have large support when dealing with high dimensional data such as images. In this paper, we address this key challenge by proposing a semisupervised multi-task learning framework called CSIM which aims to intrinsically search for a latent space suitable for detecting labels of instances from both known and unknown classes. Particularly, we utilize a convolution neural network layer that aids in the learning of a latent feature space suitable for novel class detection. We empirically measure the performance of CSIM over multiple realworld image datasets and demonstrate its superiority by comparing its performance with existing semi-supervised methods.

* 10 pages; KDD'18 Deep Learning Day, August 2018, London, UK

Via

Access Paper or Ask Questions

Cause Identification from Aviation Safety Incident Reports via Weakly Supervised Semantic Lexicon Construction

Jan 16, 2014

Muhammad Arshad Ul Abedin, Vincent Ng, Latifur Khan

Figure 1 for Cause Identification from Aviation Safety Incident Reports via Weakly Supervised Semantic Lexicon Construction

Figure 2 for Cause Identification from Aviation Safety Incident Reports via Weakly Supervised Semantic Lexicon Construction

Figure 3 for Cause Identification from Aviation Safety Incident Reports via Weakly Supervised Semantic Lexicon Construction

Figure 4 for Cause Identification from Aviation Safety Incident Reports via Weakly Supervised Semantic Lexicon Construction

Abstract:The Aviation Safety Reporting System collects voluntarily submitted reports on aviation safety incidents to facilitate research work aiming to reduce such incidents. To effectively reduce these incidents, it is vital to accurately identify why these incidents occurred. More precisely, given a set of possible causes, or shaping factors, this task of cause identification involves identifying all and only those shaping factors that are responsible for the incidents described in a report. We investigate two approaches to cause identification. Both approaches exploit information provided by a semantic lexicon, which is automatically constructed via Thelen and Riloffs Basilisk framework augmented with our linguistic and algorithmic modifications. The first approach labels a report using a simple heuristic, which looks for the words and phrases acquired during the semantic lexicon learning process in the report. The second approach recasts cause identification as a text classification problem, employing supervised and transductive text classification algorithms to learn models from incident reports labeled with shaping factors and using the models to label unseen reports. Our experiments show that both the heuristic-based approach and the learning-based approach (when given sufficient training data) outperform the baseline system significantly.

* Journal Of Artificial Intelligence Research, Volume 38, pages 569-631, 2010

Via

Access Paper or Ask Questions