Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Haixia Liu

BlockRR: A Unified Framework of RR-type Algorithms for Label Differential Privacy

Feb 03, 2026

Haixia Liu, Yi Ding

Abstract:In this paper, we introduce BlockRR, a novel and unified randomized-response mechanism for label differential privacy. This framework generalizes existed RR-type mechanisms as special cases under specific parameter settings, which eliminates the need for separate, case-by-case analysis. Theoretically, we prove that BlockRR satisfies $ε$-label DP. We also design a partition method for BlockRR based on a weight matrix derived from label prior information; the parallel composition principle ensures that the composition of two such mechanisms remains $ε$-label DP. Empirically, we evaluate BlockRR on two variants of CIFAR-10 with varying degrees of class imbalance. Results show that in the high-privacy and moderate-privacy regimes ($ε\leq 3.0$), our propsed method gets a better balance between test accuaracy and the average of per-class accuracy. In the low-privacy regime ($ε\geq 4.0$), all methods reduce BlockRR to standard RR without additional performance loss.

* 19 pages, 2 figures

Via

Access Paper or Ask Questions

RPWithPrior: Label Differential Privacy in Regression

Jan 30, 2026

Haixia Liu, Ruifan Huang

Abstract:With the wide application of machine learning techniques in practice, privacy preservation has gained increasing attention. Protecting user privacy with minimal accuracy loss is a fundamental task in the data analysis and mining community. In this paper, we focus on regression tasks under $ε$-label differential privacy guarantees. Some existing methods for regression with $ε$-label differential privacy, such as the RR-On-Bins mechanism, discretized the output space into finite bins and then applied RR algorithm. To efficiently determine these finite bins, the authors rounded the original responses down to integer values. However, such operations does not align well with real-world scenarios. To overcome these limitations, we model both original and randomized responses as continuous random variables, avoiding discretization entirely. Our novel approach estimates an optimal interval for randomized responses and introduces new algorithms designed for scenarios where a prior is either known or unknown. Additionally, we prove that our algorithm, RPWithPrior, guarantees $ε$-label differential privacy. Numerical results demonstrate that our approach gets better performance compared with the Gaussian, Laplace, Staircase, and RRonBins, Unbiased mechanisms on the Communities and Crime, Criteo Sponsored Search Conversion Log, California Housing datasets.

* 20 pages

Via

Access Paper or Ask Questions

Improving Classifier-Free Guidance of Flow Matching via Manifold Projection

Jan 29, 2026

Jian-Feng Cai, Haixia Liu, Zhengyi Su, Chao Wang

Abstract:Classifier-free guidance (CFG) is a widely used technique for controllable generation in diffusion and flow-based models. Despite its empirical success, CFG relies on a heuristic linear extrapolation that is often sensitive to the guidance scale. In this work, we provide a principled interpretation of CFG through the lens of optimization. We demonstrate that the velocity field in flow matching corresponds to the gradient of a sequence of smoothed distance functions, which guides latent variables toward the scaled target image set. This perspective reveals that the standard CFG formulation is an approximation of this gradient, where the prediction gap, the discrepancy between conditional and unconditional outputs, governs guidance sensitivity. Leveraging this insight, we reformulate the CFG sampling as a homotopy optimization with a manifold constraint. This formulation necessitates a manifold projection step, which we implement via an incremental gradient descent scheme during sampling. To improve computational efficiency and stability, we further enhance this iterative process with Anderson Acceleration without requiring additional model evaluations. Our proposed methods are training-free and consistently refine generation fidelity, prompt alignment, and robustness to the guidance scale. We validate their effectiveness across diverse benchmarks, demonstrating significant improvements on large-scale models such as DiT-XL-2-256, Flux, and Stable Diffusion 3.5.

* 24 pages, 14 figures

Via

Access Paper or Ask Questions

MAP-based Problem-Agnostic diffusion model for Inverse Problems

Jan 25, 2025

Pingping Tao, Haixia Liu, Jing Su, Xiaochen Yang, Hongchen Tan

Figure 1 for MAP-based Problem-Agnostic diffusion model for Inverse Problems

Figure 2 for MAP-based Problem-Agnostic diffusion model for Inverse Problems

Figure 3 for MAP-based Problem-Agnostic diffusion model for Inverse Problems

Figure 4 for MAP-based Problem-Agnostic diffusion model for Inverse Problems

Abstract:Diffusion models have indeed shown great promise in solving inverse problems in image processing. In this paper, we propose a novel, problem-agnostic diffusion model called the maximum a posteriori (MAP)-based guided term estimation method for inverse problems. We divide the conditional score function into two terms according to Bayes' rule: the unconditional score function and the guided term. We design the MAP-based guided term estimation method, while the unconditional score function is approximated by an existing score network. To estimate the guided term, we base on the assumption that the space of clean natural images is inherently smooth, and introduce a MAP estimate of the $t$-th latent variable. We then substitute this estimation into the expression of the inverse problem and obtain the approximation of the guided term. We evaluate our method extensively on super-resolution, inpainting, and denoising tasks, and demonstrate comparable performance to DDRM, DMPS, DPS and $\Pi$GDM.

* 13 pages, 6 figures

Via

Access Paper or Ask Questions

Learning with Noisy Labels: the Exploration of Error Bounds in Classification

Jan 25, 2025

Haixia Liu, Boxiao Li, Can Yang, Yang Wang

Abstract:Numerous studies have shown that label noise can lead to poor generalization performance, negatively affecting classification accuracy. Therefore, understanding the effectiveness of classifiers trained using deep neural networks in the presence of noisy labels is of considerable practical significance. In this paper, we focus on the error bounds of excess risks for classification problems with noisy labels within deep learning frameworks. We begin by exploring loss functions with noise-tolerant properties, ensuring that the empirical minimizer on noisy data aligns with that on the true data. Next, we estimate the error bounds of the excess risks, expressed as a sum of statistical error and approximation error. We estimate the statistical error on a dependent (mixing) sequence, bounding it with the help of the associated independent block sequence. For the approximation error, we first express the classifiers as the composition of the softmax function and a continuous function from $[0,1]^d$ to $\mathbb{R}^K$. The main task is then to estimate the approximation error for the continuous function from $[0,1]^d$ to $\mathbb{R}^K$. Finally, we focus on the curse of dimensionality based on the low-dimensional manifold assumption.

* 24 pages

Via

Access Paper or Ask Questions

Bridging Fairness Gaps: A (Conditional) Distance Covariance Perspective in Fairness Learning

Dec 01, 2024

Ruifan Huang, Haixia Liu

Figure 1 for Bridging Fairness Gaps: A (Conditional) Distance Covariance Perspective in Fairness Learning

Figure 2 for Bridging Fairness Gaps: A (Conditional) Distance Covariance Perspective in Fairness Learning

Figure 3 for Bridging Fairness Gaps: A (Conditional) Distance Covariance Perspective in Fairness Learning

Figure 4 for Bridging Fairness Gaps: A (Conditional) Distance Covariance Perspective in Fairness Learning

Abstract:We bridge fairness gaps from a statistical perspective by selectively utilizing either conditional distance covariance or distance covariance statistics as measures to assess the independence between predictions and sensitive attributes. We enhance fairness by incorporating sample (conditional) distance covariance as a manageable penalty term into the machine learning process. Additionally, we present the matrix form of empirical (conditional) distance covariance for parallel calculations to enhance computational efficiency. Theoretically, we provide a proof for the convergence between empirical and population (conditional) distance covariance, establishing necessary guarantees for batch computations. Through experiments conducted on a range of real-world datasets, we have demonstrated that our method effectively bridges the fairness gap in machine learning.

* 25 pages, 4 figures

Via

Access Paper or Ask Questions

The Exploration of Neural Collapse under Imbalanced Data

Nov 26, 2024

Haixia Liu

Figure 1 for The Exploration of Neural Collapse under Imbalanced Data

Figure 2 for The Exploration of Neural Collapse under Imbalanced Data

Figure 3 for The Exploration of Neural Collapse under Imbalanced Data

Figure 4 for The Exploration of Neural Collapse under Imbalanced Data

Abstract:Neural collapse, a newly identified characteristic, describes a property of solutions during model training. In this paper, we explore neural collapse in the context of imbalanced data. We consider the $L$-extended unconstrained feature model with a bias term and provide a theoretical analysis of global minimizer. Our findings include: (1) Features within the same class converge to their class mean, similar to both the balanced case and the imbalanced case without bias. (2) The geometric structure is mainly on the left orthonormal transformation of the product of $L$ linear classifiers and the right transformation of the class-mean matrix. (3) Some rows of the left orthonormal transformation of the product of $L$ linear classifiers collapse to zeros and others are orthogonal, which relies on the singular values of $\hat Y=(I_K-1/N\mathbf{n}1^\top_K)D$, where $K$ is class size, $\mathbf{n}$ is the vector of sample size for each class, $D$ is the diagonal matrix whose diagonal entries are given by $\sqrt{\mathbf{n}}$. Similar results are for the columns of the right orthonormal transformation of the product of class-mean matrix and $D$. (4) The $i$-th row of the left orthonormal transformation of the product of $L$ linear classifiers aligns with the $i$-th column of the right orthonormal transformation of the product of class-mean matrix and $D$. (5) We provide the estimation of singular values about $\hat Y$. Our numerical experiments support these theoretical findings.

* 26pages, 4figures

Via

Access Paper or Ask Questions

Exploring the Unexplored: Understanding the Impact of Layer Adjustments on Image Classification

Jan 25, 2024

Haixia Liu, Tim Brailsford, James Goulding, Gavin Smith, Larry Bull

Abstract:This paper investigates how adjustments to deep learning architectures impact model performance in image classification. Small-scale experiments generate initial insights although the trends observed are not consistent with the entire dataset. Filtering operations in the image processing pipeline are crucial, with image filtering before pre-processing yielding better results. The choice and order of layers as well as filter placement significantly impact model performance. This study provides valuable insights into optimizing deep learning models, with potential avenues for future research including collaborative platforms.

Via

Access Paper or Ask Questions

On Cooperative Coevolution and Global Crossover

Aug 12, 2023

Larry Bull, Haixia Liu

Figure 1 for On Cooperative Coevolution and Global Crossover

Figure 2 for On Cooperative Coevolution and Global Crossover

Figure 3 for On Cooperative Coevolution and Global Crossover

Figure 4 for On Cooperative Coevolution and Global Crossover

Abstract:Cooperative coevolutionary algorithms (CCEAs) divide a given problem in to a number of subproblems and use an evolutionary algorithm to solve each subproblem. This short paper is concerned with the scenario under which only a single, global fitness measure exists. By removing the typically used subproblem partnering mechanism, it is suggested that such CCEAs can be viewed as making use of a generalised version of the global crossover operator introduced in early Evolution Strategies. Using the well-known NK model of fitness landscapes, the effects of varying aspects of global crossover with respect to the ruggedness of the underlying fitness landscape are explored. Results suggest improvements over the most widely used form of CCEAs, something further demonstrated using other well-known test functions.

Via

Access Paper or Ask Questions

Citation Sentiment Changes Analysis

Oct 01, 2020

Haixia Liu

Figure 1 for Citation Sentiment Changes Analysis

Figure 2 for Citation Sentiment Changes Analysis

Figure 3 for Citation Sentiment Changes Analysis

Figure 4 for Citation Sentiment Changes Analysis

Abstract:Metrics for measuring the citation sentiment changes were introduced. Citation sentiment changes can be observed from global citation sentiment sequences (GCSSs). With respect to a cited paper, the citation sentiment sequences were analysed across a collection of citing papers ordered by the published time. For analysing GCSSs, Eddy Dissipation Rate (EDR) was adopted, with the hypothesis that the GCSSs pattern differences can be spotted by EDR based method. Preliminary evidence showed that EDR based method holds the potential for analysing a publication's impact in a time series fashion.

Via

Access Paper or Ask Questions