Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Qizhen Ying

Missing Data Imputation by Reducing Mutual Information with Rectified Flows

May 16, 2025

Jiahao Yu, Qizhen Ying, Leyang Wang, Ziyue Jiang, Song Liu

Abstract:This paper introduces a novel iterative method for missing data imputation that sequentially reduces the mutual information between data and their corresponding missing mask. Inspired by GAN-based approaches, which train generators to decrease the predictability of missingness patterns, our method explicitly targets the reduction of mutual information. Specifically, our algorithm iteratively minimizes the KL divergence between the joint distribution of the imputed data and missing mask, and the product of their marginals from the previous iteration. We show that the optimal imputation under this framework corresponds to solving an ODE, whose velocity field minimizes a rectified flow training objective. We further illustrate that some existing imputation techniques can be interpreted as approximate special cases of our mutual-information-reducing framework. Comprehensive experiments on synthetic and real-world datasets validate the efficacy of our proposed approach, demonstrating superior imputation performance.

Via

Access Paper or Ask Questions

High-Dimensional Differential Parameter Inference in Exponential Family using Time Score Matching

Oct 14, 2024

Daniel J. Williams, Leyang Wang, Qizhen Ying, Song Liu, Mladen Kolar

Figure 1 for High-Dimensional Differential Parameter Inference in Exponential Family using Time Score Matching

Figure 2 for High-Dimensional Differential Parameter Inference in Exponential Family using Time Score Matching

Figure 3 for High-Dimensional Differential Parameter Inference in Exponential Family using Time Score Matching

Figure 4 for High-Dimensional Differential Parameter Inference in Exponential Family using Time Score Matching

Abstract:This paper addresses differential inference in time-varying parametric probabilistic models, like graphical models with changing structures. Instead of estimating a high-dimensional model at each time and inferring changes later, we directly learn the differential parameter, i.e., the time derivative of the parameter. The main idea is treating the time score function of an exponential family model as a linear model of the differential parameter for direct estimation. We use time score matching to estimate parameter derivatives. We prove the consistency of a regularized score matching objective and demonstrate the finite-sample normality of a debiased estimator in high-dimensional settings. Our methodology effectively infers differential structures in high-dimensional graphical models, verified on simulated and real-world datasets.

* Daniel J. Williams and Leyang Wang contributed equally to this work

Via

Access Paper or Ask Questions