Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Guangquan Zhang

Multi-class Classification with Fuzzy-feature Observations: Theory and Algorithms

Jun 09, 2022

Guangzhi Ma, Jie Lu, Feng Liu, Zhen Fang, Guangquan Zhang

Figure 1 for Multi-class Classification with Fuzzy-feature Observations: Theory and Algorithms

Figure 2 for Multi-class Classification with Fuzzy-feature Observations: Theory and Algorithms

Figure 3 for Multi-class Classification with Fuzzy-feature Observations: Theory and Algorithms

Figure 4 for Multi-class Classification with Fuzzy-feature Observations: Theory and Algorithms

Abstract:The theoretical analysis of multi-class classification has proved that the existing multi-class classification methods can train a classifier with high classification accuracy on the test set, when the instances are precise in the training and test sets with same distribution and enough instances can be collected in the training set. However, one limitation with multi-class classification has not been solved: how to improve the classification accuracy of multi-class classification problems when only imprecise observations are available. Hence, in this paper, we propose a novel framework to address a new realistic problem called multi-class classification with imprecise observations (MCIMO), where we need to train a classifier with fuzzy-feature observations. Firstly, we give the theoretical analysis of the MCIMO problem based on fuzzy Rademacher complexity. Then, two practical algorithms based on support vector machine and neural networks are constructed to solve the proposed new problem. Experiments on both synthetic and real-world datasets verify the rationality of our theoretical analysis and the efficacy of the proposed algorithms.

* This article has been accepted by IEEE Transactions on Cybernetics on June 4, 2022

Via

Access Paper or Ask Questions

Bayesian Transfer Learning: An Overview of Probabilistic Graphical Models for Transfer Learning

Sep 27, 2021

Junyu Xuan, Jie Lu, Guangquan Zhang

Figure 1 for Bayesian Transfer Learning: An Overview of Probabilistic Graphical Models for Transfer Learning

Figure 2 for Bayesian Transfer Learning: An Overview of Probabilistic Graphical Models for Transfer Learning

Figure 3 for Bayesian Transfer Learning: An Overview of Probabilistic Graphical Models for Transfer Learning

Figure 4 for Bayesian Transfer Learning: An Overview of Probabilistic Graphical Models for Transfer Learning

Abstract:Transfer learning where the behavior of extracting transferable knowledge from the source domain(s) and reusing this knowledge to target domain has become a research area of great interest in the field of artificial intelligence. Probabilistic graphical models (PGMs) have been recognized as a powerful tool for modeling complex systems with many advantages, e.g., the ability to handle uncertainty and possessing good interpretability. Considering the success of these two aforementioned research areas, it seems natural to apply PGMs to transfer learning. However, although there are already some excellent PGMs specific to transfer learning in the literature, the potential of PGMs for this problem is still grossly underestimated. This paper aims to boost the development of PGMs for transfer learning by 1) examining the pilot studies on PGMs specific to transfer learning, i.e., analyzing and summarizing the existing mechanisms particularly designed for knowledge transfer; 2) discussing examples of real-world transfer problems where existing PGMs have been successfully applied; and 3) exploring several potential research directions on transfer learning using PGM.

Via

Access Paper or Ask Questions

Deep Bayesian Estimation for Dynamic Treatment Regimes with a Long Follow-up Time

Sep 20, 2021

Adi Lin, Jie Lu, Junyu Xuan, Fujin Zhu, Guangquan Zhang

Figure 1 for Deep Bayesian Estimation for Dynamic Treatment Regimes with a Long Follow-up Time

Figure 2 for Deep Bayesian Estimation for Dynamic Treatment Regimes with a Long Follow-up Time

Figure 3 for Deep Bayesian Estimation for Dynamic Treatment Regimes with a Long Follow-up Time

Figure 4 for Deep Bayesian Estimation for Dynamic Treatment Regimes with a Long Follow-up Time

Abstract:Causal effect estimation for dynamic treatment regimes (DTRs) contributes to sequential decision making. However, censoring and time-dependent confounding under DTRs are challenging as the amount of observational data declines over time due to a reducing sample size but the feature dimension increases over time. Long-term follow-up compounds these challenges. Another challenge is the highly complex relationships between confounders, treatments, and outcomes, which causes the traditional and commonly used linear methods to fail. We combine outcome regression models with treatment models for high dimensional features using uncensored subjects that are small in sample size and we fit deep Bayesian models for outcome regression models to reveal the complex relationships between confounders, treatments, and outcomes. Also, the developed deep Bayesian models can model uncertainty and output the prediction variance which is essential for the safety-aware applications, such as self-driving cars and medical treatment design. The experimental results on medical simulations of HIV treatment show the ability of the proposed method to obtain stable and accurate dynamic causal effect estimation from observational data, especially with long-term follow-up. Our technique provides practical guidance for sequential decision making, and policy-making.

Via

Access Paper or Ask Questions

Learning Bounds for Open-Set Learning

Jun 30, 2021

Zhen Fang, Jie Lu, Anjin Liu, Feng Liu, Guangquan Zhang

Figure 1 for Learning Bounds for Open-Set Learning

Figure 2 for Learning Bounds for Open-Set Learning

Figure 3 for Learning Bounds for Open-Set Learning

Figure 4 for Learning Bounds for Open-Set Learning

Abstract:Traditional supervised learning aims to train a classifier in the closed-set world, where training and test samples share the same label space. In this paper, we target a more challenging and realistic setting: open-set learning (OSL), where there exist test samples from the classes that are unseen during training. Although researchers have designed many methods from the algorithmic perspectives, there are few methods that provide generalization guarantees on their ability to achieve consistent performance on different training samples drawn from the same distribution. Motivated by the transfer learning and probably approximate correct (PAC) theory, we make a bold attempt to study OSL by proving its generalization error-given training samples with size n, the estimation error will get close to order O_p(1/\sqrt{n}). This is the first study to provide a generalization bound for OSL, which we do by theoretically investigating the risk of the target classifier on unknown classes. According to our theory, a novel algorithm, called auxiliary open-set risk (AOSR) is proposed to address the OSL problem. Experiments verify the efficacy of AOSR. The code is available at github.com/Anjin-Liu/Openset_Learning_AOSR.

* Open-set Learning, Open-set Recognition, Machine Learning Theory

Via

Access Paper or Ask Questions

Automatic Learning to Detect Concept Drift

May 04, 2021

Hang Yu, Tianyu Liu, Jie Lu, Guangquan Zhang

Figure 1 for Automatic Learning to Detect Concept Drift

Figure 2 for Automatic Learning to Detect Concept Drift

Figure 3 for Automatic Learning to Detect Concept Drift

Figure 4 for Automatic Learning to Detect Concept Drift

Abstract:Many methods have been proposed to detect concept drift, i.e., the change in the distribution of streaming data, due to concept drift causes a decrease in the prediction accuracy of algorithms. However, the most of current detection methods are based on the assessment of the degree of change in the data distribution, cannot identify the type of concept drift. In this paper, we propose Active Drift Detection with Meta learning (Meta-ADD), a novel framework that learns to classify concept drift by tracking the changed pattern of error rates. Specifically, in the training phase, we extract meta-features based on the error rates of various concept drift, after which a meta-detector is developed via a prototypical neural network by representing various concept drift classes as corresponding prototypes. In the detection phase, the learned meta-detector is fine-tuned to adapt to the corresponding data stream via stream-based active learning. Hence, Meta-ADD uses machine learning to learn to detect concept drifts and identify their types automatically, which can directly support drift understand. The experiment results verify the effectiveness of Meta-ADD.

Via

Access Paper or Ask Questions

PAC-Bayes Bounds for Meta-learning with Data-Dependent Prior

Feb 07, 2021

Tianyu Liu, Jie Lu, Zheng Yan, Guangquan Zhang

Figure 1 for PAC-Bayes Bounds for Meta-learning with Data-Dependent Prior

Figure 2 for PAC-Bayes Bounds for Meta-learning with Data-Dependent Prior

Figure 3 for PAC-Bayes Bounds for Meta-learning with Data-Dependent Prior

Figure 4 for PAC-Bayes Bounds for Meta-learning with Data-Dependent Prior

Abstract:By leveraging experience from previous tasks, meta-learning algorithms can achieve effective fast adaptation ability when encountering new tasks. However it is unclear how the generalization property applies to new tasks. Probably approximately correct (PAC) Bayes bound theory provides a theoretical framework to analyze the generalization performance for meta-learning. We derive three novel generalisation error bounds for meta-learning based on PAC-Bayes relative entropy bound. Furthermore, using the empirical risk minimization (ERM) method, a PAC-Bayes bound for meta-learning with data-dependent prior is developed. Experiments illustrate that the proposed three PAC-Bayes bounds for meta-learning guarantee a competitive generalization performance guarantee, and the extended PAC-Bayes bound with data-dependent prior can achieve rapid convergence ability.

Via

Access Paper or Ask Questions

How does the Combined Risk Affect the Performance of Unsupervised Domain Adaptation Approaches?

Dec 30, 2020

Li Zhong, Zhen Fang, Feng Liu, Jie Lu, Bo Yuan, Guangquan Zhang

Figure 1 for How does the Combined Risk Affect the Performance of Unsupervised Domain Adaptation Approaches?

Figure 2 for How does the Combined Risk Affect the Performance of Unsupervised Domain Adaptation Approaches?

Figure 3 for How does the Combined Risk Affect the Performance of Unsupervised Domain Adaptation Approaches?

Figure 4 for How does the Combined Risk Affect the Performance of Unsupervised Domain Adaptation Approaches?

Abstract:Unsupervised domain adaptation (UDA) aims to train a target classifier with labeled samples from the source domain and unlabeled samples from the target domain. Classical UDA learning bounds show that target risk is upper bounded by three terms: source risk, distribution discrepancy, and combined risk. Based on the assumption that the combined risk is a small fixed value, methods based on this bound train a target classifier by only minimizing estimators of the source risk and the distribution discrepancy. However, the combined risk may increase when minimizing both estimators, which makes the target risk uncontrollable. Hence the target classifier cannot achieve ideal performance if we fail to control the combined risk. To control the combined risk, the key challenge takes root in the unavailability of the labeled samples in the target domain. To address this key challenge, we propose a method named E-MixNet. E-MixNet employs enhanced mixup, a generic vicinal distribution, on the labeled source samples and pseudo-labeled target samples to calculate a proxy of the combined risk. Experiments show that the proxy can effectively curb the increase of the combined risk when minimizing the source risk and distribution discrepancy. Furthermore, we show that if the proxy of the combined risk is added into loss functions of four representative UDA methods, their performance is also improved.

* 9 pages, 3 figures, Accepted by Association for the Advancement of Artificial Intelligence 2021 (AAAI 2021)

Via

Access Paper or Ask Questions

Concept Drift Detection: Dealing with MissingValues via Fuzzy Distance Estimations

Aug 09, 2020

Anjin Liu, Jie Lu, Guangquan Zhang

Abstract:In data streams, the data distribution of arriving observations at different time points may change - a phenomenon called concept drift. While detecting concept drift is a relatively mature area of study, solutions to the uncertainty introduced by observations with missing values have only been studied in isolation. No one has yet explored whether or how these solutions might impact drift detection performance. We, however, believe that data imputation methods may actually increase uncertainty in the data rather than reducing it. We also conjecture that imputation can introduce bias into the process of estimating distribution changes during drift detection, which can make it more difficult to train a learning model. Our idea is to focus on estimating the distance between observations rather than estimating the missing values, and to define membership functions that allocate observations to histogram bins according to the estimation errors. Our solution comprises a novel masked distance learning (MDL) algorithm to reduce the cumulative errors caused by iteratively estimating each missing value in an observation and a fuzzy-weighted frequency (FWF) method for identifying discrepancies in the data distribution. The concept drift detection algorithm proposed in this paper is a singular and unified algorithm that can handle missing values, but not an imputation algorithm combined with a concept drift detection algorithm. Experiments on both synthetic and real-world data sets demonstrate the advantages of this method and show its robustness in detecting drift in data with missing values. These findings reveal that missing values exert a profound impact on concept drift detection, but using fuzzy set theory to model observations can produce more reliable results than imputation.

* Accepted by IEEE Transactions on Fuzzy Systems

Via

Access Paper or Ask Questions

Learning from a Complementary-label Source Domain: Theory and Algorithms

Aug 04, 2020

Yiyang Zhang, Feng Liu, Zhen Fang, Bo Yuan, Guangquan Zhang, Jie Lu

Figure 1 for Learning from a Complementary-label Source Domain: Theory and Algorithms

Figure 2 for Learning from a Complementary-label Source Domain: Theory and Algorithms

Figure 3 for Learning from a Complementary-label Source Domain: Theory and Algorithms

Figure 4 for Learning from a Complementary-label Source Domain: Theory and Algorithms

Abstract:In unsupervised domain adaptation (UDA), a classifier for the target domain is trained with massive true-label data from the source domain and unlabeled data from the target domain. However, collecting fully-true-label data in the source domain is high-cost and sometimes impossible. Compared to the true labels, a complementary label specifies a class that a pattern does not belong to, hence collecting complementary labels would be less laborious than collecting true labels. Thus, in this paper, we propose a novel setting that the source domain is composed of complementary-label data, and a theoretical bound for it is first proved. We consider two cases of this setting, one is that the source domain only contains complementary-label data (completely complementary unsupervised domain adaptation, CC-UDA), and the other is that the source domain has plenty of complementary-label data and a small amount of true-label data (partly complementary unsupervised domain adaptation, PC-UDA). To this end, a complementary label adversarial network} (CLARINET) is proposed to solve CC-UDA and PC-UDA problems. CLARINET maintains two deep networks simultaneously, where one focuses on classifying complementary-label source data and the other takes care of source-to-target distributional adaptation. Experiments show that CLARINET significantly outperforms a series of competent baselines on handwritten-digits-recognition and objects-recognition tasks.

Via

Access Paper or Ask Questions

Clarinet: A One-step Approach Towards Budget-friendly Unsupervised Domain Adaptation

Jul 29, 2020

Yiyang Zhang, Feng Liu, Zhen Fang, Bo Yuan, Guangquan Zhang, Jie Lu

Figure 1 for Clarinet: A One-step Approach Towards Budget-friendly Unsupervised Domain Adaptation

Figure 2 for Clarinet: A One-step Approach Towards Budget-friendly Unsupervised Domain Adaptation

Figure 3 for Clarinet: A One-step Approach Towards Budget-friendly Unsupervised Domain Adaptation

Figure 4 for Clarinet: A One-step Approach Towards Budget-friendly Unsupervised Domain Adaptation

Abstract:In unsupervised domain adaptation (UDA), classifiers for the target domain are trained with massive true-label data from the source domain and unlabeled data from the target domain. However, it may be difficult to collect fully-true-label data in a source domain given a limited budget. To mitigate this problem, we consider a novel problem setting where the classifier for the target domain has to be trained with complementary-label data from the source domain and unlabeled data from the target domain named budget-friendly UDA (BFUDA). The key benefit is that it is much less costly to collect complementary-label source data (required by BFUDA) than collecting the true-label source data (required by ordinary UDA). To this end, the complementary label adversarial network (CLARINET) is proposed to solve the BFUDA problem. CLARINET maintains two deep networks simultaneously, where one focuses on classifying complementary-label source data and the other takes care of the source-to-target distributional adaptation. Experiments show that CLARINET significantly outperforms a series of competent baselines.

* This paper has been accepted by IJCAI-PRICAI 2020

Via

Access Paper or Ask Questions