Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rajesh Shrestha

Taming Score-Based Denoisers in ADMM: A Convergent Plug-and-Play Framework

Mar 10, 2026

Rajesh Shrestha, Xiao Fu

Abstract:While score-based generative models have emerged as powerful priors for solving inverse problems, directly integrating them into optimization algorithms such as ADMM remains nontrivial. Two central challenges arise: i) the mismatch between the noisy data manifolds used to train the score functions and the geometry of ADMM iterates, especially due to the influence of dual variables, and ii) the lack of convergence understanding when ADMM is equipped with score-based denoisers. To address the manifold mismatch issue, we propose ADMM plug-and-play (ADMM-PnP) with the AC-DC denoiser, a new framework that embeds a three-stage denoiser into ADMM: (1) auto-correction (AC) via additive Gaussian noise, (2) directional correction (DC) using conditional Langevin dynamics, and (3) score-based denoising. In terms of convergence, we establish two results: first, under proper denoiser parameters, each ADMM iteration is a weakly nonexpansive operator, ensuring high-probability fixed-point $\textit{ball convergence}$ using a constant step size; second, under more relaxed conditions, the AC-DC denoiser is a bounded denoiser, which leads to convergence under an adaptive step size schedule. Experiments on a range of inverse problems demonstrate that our method consistently improves solution quality over a variety of baselines.

Via

Access Paper or Ask Questions

Distribution Matching via Generalized Consistency Models

Aug 17, 2025

Sagar Shrestha, Rajesh Shrestha, Tri Nguyen, Subash Timilsina

Abstract:Recent advancement in generative models have demonstrated remarkable performance across various data modalities. Beyond their typical use in data synthesis, these models play a crucial role in distribution matching tasks such as latent variable modeling, domain translation, and domain adaptation. Generative Adversarial Networks (GANs) have emerged as the preferred method of distribution matching due to their efficacy in handling high-dimensional data and their flexibility in accommodating various constraints. However, GANs often encounter challenge in training due to their bi-level min-max optimization objective and susceptibility to mode collapse. In this work, we propose a novel approach for distribution matching inspired by the consistency models employed in Continuous Normalizing Flow (CNF). Our model inherits the advantages of CNF models, such as having a straight forward norm minimization objective, while remaining adaptable to different constraints similar to GANs. We provide theoretical validation of our proposed objective and demonstrate its performance through experiments on synthetic and real-world datasets.

Via

Access Paper or Ask Questions

Downlink MIMO Channel Estimation from Bits: Recoverability and Algorithm

Nov 25, 2024

Rajesh Shrestha, Mingjie Shao, Mingyi Hong, Wing-Kin Ma, Xiao Fu

Figure 1 for Downlink MIMO Channel Estimation from Bits: Recoverability and Algorithm

Figure 2 for Downlink MIMO Channel Estimation from Bits: Recoverability and Algorithm

Figure 3 for Downlink MIMO Channel Estimation from Bits: Recoverability and Algorithm

Figure 4 for Downlink MIMO Channel Estimation from Bits: Recoverability and Algorithm

Abstract:In frequency division duplex (FDD) massive MIMO systems, a major challenge lies in acquiring the downlink channel state information}\ (CSI) at the base station (BS) from limited feedback sent by the user equipment (UE). To tackle this fundamental task, our contribution is twofold: First, a simple feedback framework is proposed, where a compression and Gaussian dithering-based quantization strategy is adopted at the UE side, and then a maximum likelihood estimator (MLE) is formulated at the BS side. Recoverability of the MIMO channel under the widely used double directional model is established. Specifically, analyses are presented for two compression schemes -- showing one being more overhead-economical and the other computationally lighter at the UE side. Second, to realize the MLE, an alternating direction method of multipliers (ADMM) algorithm is proposed. The algorithm is carefully designed to integrate a sophisticated harmonic retrieval (HR) solver as subroutine, which turns out to be the key of effectively tackling this hard MLE problem.Extensive numerical experiments are conducted to validate the efficacy of our approach.

Via

Access Paper or Ask Questions

Conditional Image Generation with Pretrained Generative Model

Dec 20, 2023

Rajesh Shrestha, Bowen Xie

Abstract:In recent years, diffusion models have gained popularity for their ability to generate higher-quality images in comparison to GAN models. However, like any other large generative models, these models require a huge amount of data, computational resources, and meticulous tuning for successful training. This poses a significant challenge, rendering it infeasible for most individuals. As a result, the research community has devised methods to leverage pre-trained unconditional diffusion models with additional guidance for the purpose of conditional image generative. These methods enable conditional image generations on diverse inputs and, most importantly, circumvent the need for training the diffusion model. In this paper, our objective is to reduce the time-required and computational overhead introduced by the addition of guidance in diffusion models -- while maintaining comparable image quality. We propose a set of methods based on our empirical analysis, demonstrating a reduction in computation time by approximately threefold.

Via

Access Paper or Ask Questions

Natural Gradient Methods: Perspectives, Efficient-Scalable Approximations, and Analysis

Mar 06, 2023

Rajesh Shrestha

Figure 1 for Natural Gradient Methods: Perspectives, Efficient-Scalable Approximations, and Analysis

Figure 2 for Natural Gradient Methods: Perspectives, Efficient-Scalable Approximations, and Analysis

Figure 3 for Natural Gradient Methods: Perspectives, Efficient-Scalable Approximations, and Analysis

Figure 4 for Natural Gradient Methods: Perspectives, Efficient-Scalable Approximations, and Analysis

Abstract:Natural Gradient Descent, a second-degree optimization method motivated by the information geometry, makes use of the Fisher Information Matrix instead of the Hessian which is typically used. However, in many cases, the Fisher Information Matrix is equivalent to the Generalized Gauss-Newton Method, that both approximate the Hessian. It is an appealing method to be used as an alternative to stochastic gradient descent, potentially leading to faster convergence. However, being a second-order method makes it infeasible to be used directly in problems with a huge number of parameters and data. This is evident from the community of deep learning sticking with the stochastic gradient descent method since the beginning. In this paper, we look at the different perspectives on the natural gradient method, study the current developments on its efficient-scalable empirical approximations, and finally examine their performance with extensive experiments.

* 14 pages

Via

Access Paper or Ask Questions