Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks

Apr 24, 2025

Yinqi Li, Hong Chang, Ruibing Hou, Shiguang Shan, Xilin Chen

Share this with someone who'll enjoy it:

Abstract:Diffusion models have shown remarkable progress in various generative tasks such as image and video generation. This paper studies the problem of leveraging pretrained diffusion models for performing discriminative tasks. Specifically, we extend the discriminative capability of pretrained frozen generative diffusion models from the classification task to the more complex object detection task, by "inverting" a pretrained layout-to-image diffusion model. To this end, a gradient-based discrete optimization approach for replacing the heavy prediction enumeration process, and a prior distribution model for making more accurate use of the Bayes' rule, are proposed respectively. Empirical results show that this method is on par with basic discriminative object detection baselines on COCO dataset. In addition, our method can greatly speed up the previous diffusion-based method for classification without sacrificing accuracy. Code and models are available at https://github.com/LiYinqi/DIVE .

* Accepted by IEEE Transactions on Multimedia

View paper on

Share this with someone who'll enjoy it:

Title:DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks

Paper and Code