Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Locality in Image Diffusion Models Emerges from Data Statistics

Sep 11, 2025

Artem Lukoianov, Chenyang Yuan, Justin Solomon, Vincent Sitzmann

Figure 1 for Locality in Image Diffusion Models Emerges from Data Statistics

Figure 2 for Locality in Image Diffusion Models Emerges from Data Statistics

Figure 3 for Locality in Image Diffusion Models Emerges from Data Statistics

Figure 4 for Locality in Image Diffusion Models Emerges from Data Statistics

Share this with someone who'll enjoy it:

Abstract:Among generative models, diffusion models are uniquely intriguing due to the existence of a closed-form optimal minimizer of their training objective, often referred to as the optimal denoiser. However, diffusion using this optimal denoiser merely reproduces images in the training set and hence fails to capture the behavior of deep diffusion models. Recent work has attempted to characterize this gap between the optimal denoiser and deep diffusion models, proposing analytical, training-free models that can generate images that resemble those generated by a trained UNet. The best-performing method hypothesizes that shift equivariance and locality inductive biases of convolutional neural networks are the cause of the performance gap, hence incorporating these assumptions into its analytical model. In this work, we present evidence that the locality in deep diffusion models emerges as a statistical property of the image dataset, not due to the inductive bias of convolutional neural networks. Specifically, we demonstrate that an optimal parametric linear denoiser exhibits similar locality properties to the deep neural denoisers. We further show, both theoretically and experimentally, that this locality arises directly from the pixel correlations present in natural image datasets. Finally, we use these insights to craft an analytical denoiser that better matches scores predicted by a deep diffusion model than the prior expert-crafted alternative.

* 30 pages, 18 figures, 6 tables

View paper on

Share this with someone who'll enjoy it:

Title:Locality in Image Diffusion Models Emerges from Data Statistics

Paper and Code