Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yilong Yin

Towards Understanding Generalization of Macro-AUC in Multi-label Learning

May 09, 2023

Guoqiang Wu, Chongxuan Li, Yilong Yin

Figure 1 for Towards Understanding Generalization of Macro-AUC in Multi-label Learning

Figure 2 for Towards Understanding Generalization of Macro-AUC in Multi-label Learning

Figure 3 for Towards Understanding Generalization of Macro-AUC in Multi-label Learning

Figure 4 for Towards Understanding Generalization of Macro-AUC in Multi-label Learning

Abstract:Macro-AUC is the arithmetic mean of the class-wise AUCs in multi-label learning and is commonly used in practice. However, its theoretical understanding is far lacking. Toward solving it, we characterize the generalization properties of various learning algorithms based on the corresponding surrogate losses w.r.t. Macro-AUC. We theoretically identify a critical factor of the dataset affecting the generalization bounds: \emph{the label-wise class imbalance}. Our results on the imbalance-aware error bounds show that the widely-used univariate loss-based algorithm is more sensitive to the label-wise class imbalance than the proposed pairwise and reweighted loss-based ones, which probably implies its worse performance. Moreover, empirical results on various datasets corroborate our theory findings. To establish it, technically, we propose a new (and more general) McDiarmid-type concentration inequality, which may be of independent interest.

* Accepted in ICML 2023; Still in camera-ready stage

Via

Access Paper or Ask Questions

MetaViewer: Towards A Unified Multi-View Representation

Mar 11, 2023

Ren Wang, Haoliang Sun, Yuling Ma, Xiaoming Xi, Yilong Yin

Figure 1 for MetaViewer: Towards A Unified Multi-View Representation

Figure 2 for MetaViewer: Towards A Unified Multi-View Representation

Figure 3 for MetaViewer: Towards A Unified Multi-View Representation

Figure 4 for MetaViewer: Towards A Unified Multi-View Representation

Abstract:Existing multi-view representation learning methods typically follow a specific-to-uniform pipeline, extracting latent features from each view and then fusing or aligning them to obtain the unified object representation. However, the manually pre-specify fusion functions and view-private redundant information mixed in features potentially degrade the quality of the derived representation. To overcome them, we propose a novel bi-level-optimization-based multi-view learning framework, where the representation is learned in a uniform-to-specific manner. Specifically, we train a meta-learner, namely MetaViewer, to learn fusion and model the view-shared meta representation in outer-level optimization. Start with this meta representation, view-specific base-learners are then required to rapidly reconstruct the corresponding view in inner-level. MetaViewer eventually updates by observing reconstruction processes from uniform to specific over all views, and learns an optimal fusion scheme that separates and filters out view-private information. Extensive experimental results in downstream tasks such as classification and clustering demonstrate the effectiveness of our method.

* 8 pages, 5 figures, conference

Via

Access Paper or Ask Questions

Fine-Grained Classification with Noisy Labels

Mar 04, 2023

Qi Wei, Lei Feng, Haoliang Sun, Ren Wang, Chenhui Guo, Yilong Yin

Figure 1 for Fine-Grained Classification with Noisy Labels

Figure 2 for Fine-Grained Classification with Noisy Labels

Figure 3 for Fine-Grained Classification with Noisy Labels

Figure 4 for Fine-Grained Classification with Noisy Labels

Abstract:Learning with noisy labels (LNL) aims to ensure model generalization given a label-corrupted training set. In this work, we investigate a rarely studied scenario of LNL on fine-grained datasets (LNL-FG), which is more practical and challenging as large inter-class ambiguities among fine-grained classes cause more noisy labels. We empirically show that existing methods that work well for LNL fail to achieve satisfying performance for LNL-FG, arising the practical need of effective solutions for LNL-FG. To this end, we propose a novel framework called stochastic noise-tolerated supervised contrastive learning (SNSCL) that confronts label noise by encouraging distinguishable representation. Specifically, we design a noise-tolerated supervised contrastive learning loss that incorporates a weight-aware mechanism for noisy label correction and selectively updating momentum queue lists. By this mechanism, we mitigate the effects of noisy anchors and avoid inserting noisy labels into the momentum-updated queue. Besides, to avoid manually-defined augmentation strategies in contrastive learning, we propose an efficient stochastic module that samples feature embeddings from a generated distribution, which can also enhance the representation ability of deep models. SNSCL is general and compatible with prevailing robust LNL strategies to improve their performance for LNL-FG. Extensive experiments demonstrate the effectiveness of SNSCL.

* Accepted to CVPR 2023

Via

Access Paper or Ask Questions

Topological Structure Learning for Weakly-Supervised Out-of-Distribution Detection

Sep 16, 2022

Rundong He, Rongxue Li, Zhongyi Han, Yilong Yin

Figure 1 for Topological Structure Learning for Weakly-Supervised Out-of-Distribution Detection

Figure 2 for Topological Structure Learning for Weakly-Supervised Out-of-Distribution Detection

Figure 3 for Topological Structure Learning for Weakly-Supervised Out-of-Distribution Detection

Figure 4 for Topological Structure Learning for Weakly-Supervised Out-of-Distribution Detection

Abstract:Out-of-distribution (OOD) detection is the key to deploying models safely in the open world. For OOD detection, collecting sufficient in-distribution (ID) labeled data is usually more time-consuming and costly than unlabeled data. When ID labeled data is limited, the previous OOD detection methods are no longer superior due to their high dependence on the amount of ID labeled data. Based on limited ID labeled data and sufficient unlabeled data, we define a new setting called Weakly-Supervised Out-of-Distribution Detection (WSOOD). To solve the new problem, we propose an effective method called Topological Structure Learning (TSL). Firstly, TSL uses a contrastive learning method to build the initial topological structure space for ID and OOD data. Secondly, TSL mines effective topological connections in the initial topological space. Finally, based on limited ID labeled data and mined topological connections, TSL reconstructs the topological structure in a new topological space to increase the separability of ID and OOD instances. Extensive studies on several representative datasets show that TSL remarkably outperforms the state-of-the-art, verifying the validity and robustness of our method in the new setting of WSOOD.

Via

Access Paper or Ask Questions

Self-Filtering: A Noise-Aware Sample Selection for Label Noise with Confidence Penalization

Aug 24, 2022

Qi Wei, Haoliang Sun, Xiankai Lu, Yilong Yin

Figure 1 for Self-Filtering: A Noise-Aware Sample Selection for Label Noise with Confidence Penalization

Figure 2 for Self-Filtering: A Noise-Aware Sample Selection for Label Noise with Confidence Penalization

Figure 3 for Self-Filtering: A Noise-Aware Sample Selection for Label Noise with Confidence Penalization

Figure 4 for Self-Filtering: A Noise-Aware Sample Selection for Label Noise with Confidence Penalization

Abstract:Sample selection is an effective strategy to mitigate the effect of label noise in robust learning. Typical strategies commonly apply the small-loss criterion to identify clean samples. However, those samples lying around the decision boundary with large losses usually entangle with noisy examples, which would be discarded with this criterion, leading to the heavy degeneration of the generalization performance. In this paper, we propose a novel selection strategy, \textbf{S}elf-\textbf{F}il\textbf{t}ering (SFT), that utilizes the fluctuation of noisy examples in historical predictions to filter them, which can avoid the selection bias of the small-loss criterion for the boundary examples. Specifically, we introduce a memory bank module that stores the historical predictions of each example and dynamically updates to support the selection for the subsequent learning iteration. Besides, to reduce the accumulated error of the sample selection bias of SFT, we devise a regularization term to penalize the confident output distribution. By increasing the weight of the misclassified categories with this term, the loss function is robust to label noise in mild conditions. We conduct extensive experiments on three benchmarks with variant noise types and achieve the new state-of-the-art. Ablation studies and further analysis verify the virtue of SFT for sample selection in robust learning.

* European Conference on Computer Vision 2022
* 14 pages

Via

Access Paper or Ask Questions

DRNet: Decomposition and Reconstruction Network for Remote Physiological Measurement

Jun 20, 2022

Yuhang Dong, Gongping Yang, Yilong Yin

Figure 1 for DRNet: Decomposition and Reconstruction Network for Remote Physiological Measurement

Figure 2 for DRNet: Decomposition and Reconstruction Network for Remote Physiological Measurement

Figure 3 for DRNet: Decomposition and Reconstruction Network for Remote Physiological Measurement

Figure 4 for DRNet: Decomposition and Reconstruction Network for Remote Physiological Measurement

Abstract:Remote photoplethysmography (rPPG) based physiological measurement has great application values in affective computing, non-contact health monitoring, telehealth monitoring, etc, which has become increasingly important especially during the COVID-19 pandemic. Existing methods are generally divided into two groups. The first focuses on mining the subtle blood volume pulse (BVP) signals from face videos, but seldom explicitly models the noises that dominate face video content. They are susceptible to the noises and may suffer from poor generalization ability in unseen scenarios. The second focuses on modeling noisy data directly, resulting in suboptimal performance due to the lack of regularity of these severe random noises. In this paper, we propose a Decomposition and Reconstruction Network (DRNet) focusing on the modeling of physiological features rather than noisy data. A novel cycle loss is proposed to constrain the periodicity of physiological information. Besides, a plug-and-play Spatial Attention Block (SAB) is proposed to enhance features along with the spatial location information. Furthermore, an efficient Patch Cropping (PC) augmentation strategy is proposed to synthesize augmented samples with different noise and features. Extensive experiments on different public datasets as well as the cross-database testing demonstrate the effectiveness of our approach.

Via

Access Paper or Ask Questions

Neural Network Compression via Effective Filter Analysis and Hierarchical Pruning

Jun 07, 2022

Ziqi Zhou, Li Lian, Yilong Yin, Ze Wang

Figure 1 for Neural Network Compression via Effective Filter Analysis and Hierarchical Pruning

Figure 2 for Neural Network Compression via Effective Filter Analysis and Hierarchical Pruning

Figure 3 for Neural Network Compression via Effective Filter Analysis and Hierarchical Pruning

Figure 4 for Neural Network Compression via Effective Filter Analysis and Hierarchical Pruning

Abstract:Network compression is crucial to making the deep networks to be more efficient, faster, and generalizable to low-end hardware. Current network compression methods have two open problems: first, there lacks a theoretical framework to estimate the maximum compression rate; second, some layers may get over-prunned, resulting in significant network performance drop. To solve these two problems, this study propose a gradient-matrix singularity analysis-based method to estimate the maximum network redundancy. Guided by that maximum rate, a novel and efficient hierarchical network pruning algorithm is developed to maximally condense the neuronal network structure without sacrificing network performance. Substantial experiments are performed to demonstrate the efficacy of the new method for pruning several advanced convolutional neural network (CNN) architectures. Compared to existing pruning methods, the proposed pruning algorithm achieved state-of-the-art performance. At the same or similar compression ratio, the new method provided the highest network prediction accuracy as compared to other methods.

Via

Access Paper or Ask Questions

Active Source Free Domain Adaptation

May 22, 2022

Fan Wang, Zhongyi Han, Zhiyan Zhang, Yilong Yin

Figure 1 for Active Source Free Domain Adaptation

Figure 2 for Active Source Free Domain Adaptation

Figure 3 for Active Source Free Domain Adaptation

Figure 4 for Active Source Free Domain Adaptation

Abstract:Source free domain adaptation (SFDA) aims to transfer a trained source model to the unlabeled target domain without accessing the source data. However, the SFDA setting faces an effect bottleneck due to the absence of source data and target supervised information, as evidenced by the limited performance gains of newest SFDA methods. In this paper, for the first time, we introduce a more practical scenario called active source free domain adaptation (ASFDA) that permits actively selecting a few target data to be labeled by experts. To achieve that, we first find that those satisfying the properties of neighbor-chaotic, individual-different, and target-like are the best points to select, and we define them as the minimum happy (MH) points. We then propose minimum happy points learning (MHPL) to actively explore and exploit MH points. We design three unique strategies: neighbor ambient uncertainty, neighbor diversity relaxation, and one-shot querying, to explore the MH points. Further, to fully exploit MH points in the learning process, we design a neighbor focal loss that assigns the weighted neighbor purity to the cross-entropy loss of MH points to make the model focus more on them. Extensive experiments verify that MHPL remarkably exceeds the various types of baselines and achieves significant performance gains at a small cost of labeling.

* 9 pages (not including references and checklist), 4 figures,

Via

Access Paper or Ask Questions

Exploring Linear Feature Disentanglement For Neural Networks

Mar 22, 2022

Tiantian He, Zhibin Li, Yongshun Gong, Yazhou Yao, Xiushan Nie, Yilong Yin

Figure 1 for Exploring Linear Feature Disentanglement For Neural Networks

Figure 2 for Exploring Linear Feature Disentanglement For Neural Networks

Figure 3 for Exploring Linear Feature Disentanglement For Neural Networks

Figure 4 for Exploring Linear Feature Disentanglement For Neural Networks

Abstract:Non-linear activation functions, e.g., Sigmoid, ReLU, and Tanh, have achieved great success in neural networks (NNs). Due to the complex non-linear characteristic of samples, the objective of those activation functions is to project samples from their original feature space to a linear separable feature space. This phenomenon ignites our interest in exploring whether all features need to be transformed by all non-linear functions in current typical NNs, i.e., whether there exists a part of features arriving at the linear separable feature space in the intermediate layers, that does not require further non-linear variation but an affine transformation instead. To validate the above hypothesis, we explore the problem of linear feature disentanglement for neural networks in this paper. Specifically, we devise a learnable mask module to distinguish between linear and non-linear features. Through our designed experiments we found that some features reach the linearly separable space earlier than the others and can be detached partly from the NNs. The explored method also provides a readily feasible pruning strategy which barely affects the performance of the original model. We conduct our experiments on four datasets and present promising results.

Via

Access Paper or Ask Questions

Series Photo Selection via Multi-view Graph Learning

Mar 18, 2022

Jin Huang, Lu Zhang, Yongshun Gong, Jian Zhang, Xiushan Nie, Yilong Yin

Figure 1 for Series Photo Selection via Multi-view Graph Learning

Figure 2 for Series Photo Selection via Multi-view Graph Learning

Figure 3 for Series Photo Selection via Multi-view Graph Learning

Figure 4 for Series Photo Selection via Multi-view Graph Learning

Abstract:Series photo selection (SPS) is an important branch of the image aesthetics quality assessment, which focuses on finding the best one from a series of nearly identical photos. While a great progress has been observed, most of the existing SPS approaches concentrate solely on extracting features from the original image, neglecting that multiple views, e.g, saturation level, color histogram and depth of field of the image, will be of benefit to successfully reflecting the subtle aesthetic changes. Taken multi-view into consideration, we leverage a graph neural network to construct the relationships between multi-view features. Besides, multiple views are aggregated with an adaptive-weight self-attention module to verify the significance of each view. Finally, a siamese network is proposed to select the best one from a series of nearly identical photos. Experimental results demonstrate that our model accomplish the highest success rates compared with competitive methods.

Via

Access Paper or Ask Questions