Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xiangkun Chen

DiffHash: Text-Guided Targeted Attack via Diffusion Models against Deep Hashing Image Retrieval

Sep 16, 2025

Zechao Liu, Zheng Zhou, Xiangkun Chen, Tao Liang, Dapeng Lang

Figure 1 for DiffHash: Text-Guided Targeted Attack via Diffusion Models against Deep Hashing Image Retrieval

Figure 2 for DiffHash: Text-Guided Targeted Attack via Diffusion Models against Deep Hashing Image Retrieval

Figure 3 for DiffHash: Text-Guided Targeted Attack via Diffusion Models against Deep Hashing Image Retrieval

Figure 4 for DiffHash: Text-Guided Targeted Attack via Diffusion Models against Deep Hashing Image Retrieval

Abstract:Deep hashing models have been widely adopted to tackle the challenges of large-scale image retrieval. However, these approaches face serious security risks due to their vulnerability to adversarial examples. Despite the increasing exploration of targeted attacks on deep hashing models, existing approaches still suffer from a lack of multimodal guidance, reliance on labeling information and dependence on pixel-level operations for attacks. To address these limitations, we proposed DiffHash, a novel diffusion-based targeted attack for deep hashing. Unlike traditional pixel-based attacks that directly modify specific pixels and lack multimodal guidance, our approach focuses on optimizing the latent representations of images, guided by text information generated by a Large Language Model (LLM) for the target image. Furthermore, we designed a multi-space hash alignment network to align the high-dimension image space and text space to the low-dimension binary hash space. During reconstruction, we also incorporated text-guided attention mechanisms to refine adversarial examples, ensuring them aligned with the target semantics while maintaining visual plausibility. Extensive experiments have demonstrated that our method outperforms state-of-the-art (SOTA) targeted attack methods, achieving better black-box transferability and offering more excellent stability across datasets.

Via

Access Paper or Ask Questions