Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:CoAR: Concept Injection into Autoregressive Models for Personalized Text-to-Image Generation

Aug 10, 2025

Fangtai Wu, Mushui Liu, Weijie He, Wanggui He, Hao Jiang, Zhao Wang, Yunlong Yu

Figure 1 for CoAR: Concept Injection into Autoregressive Models for Personalized Text-to-Image Generation

Figure 2 for CoAR: Concept Injection into Autoregressive Models for Personalized Text-to-Image Generation

Figure 3 for CoAR: Concept Injection into Autoregressive Models for Personalized Text-to-Image Generation

Figure 4 for CoAR: Concept Injection into Autoregressive Models for Personalized Text-to-Image Generation

Share this with someone who'll enjoy it:

Abstract:The unified autoregressive (AR) model excels at multimodal understanding and generation, but its potential for customized image generation remains underexplored. Existing customized generation methods rely on full fine-tuning or adapters, making them costly and prone to overfitting or catastrophic forgetting. In this paper, we propose \textbf{CoAR}, a novel framework for injecting subject concepts into the unified AR models while keeping all pre-trained parameters completely frozen. CoAR learns effective, specific subject representations with only a minimal number of parameters using a Layerwise Multimodal Context Learning strategy. To address overfitting and language drift, we further introduce regularization that preserves the pre-trained distribution and anchors context tokens to improve subject fidelity and re-contextualization. Additionally, CoAR supports training-free subject customization in a user-provided style. Experiments demonstrate that CoAR achieves superior performance on both subject-driven personalization and style personalization, while delivering significant gains in computational and memory efficiency. Notably, CoAR tunes less than \textbf{0.05\%} of the parameters while achieving competitive performance compared to recent Proxy-Tuning. Code: https://github.com/KZF-kzf/CoAR

View paper on

Share this with someone who'll enjoy it:

Title:CoAR: Concept Injection into Autoregressive Models for Personalized Text-to-Image Generation

Paper and Code