Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning

Aug 09, 2025

Shihao Yuan, Yahui Liu, Yang Yue, Jingyuan Zhang, Wangmeng Zuo, Qi Wang, Fuzheng Zhang, Guorui Zhou

Figure 1 for AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning

Figure 2 for AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning

Figure 3 for AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning

Figure 4 for AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning

Share this with someone who'll enjoy it:

Abstract:Inspired by the success of reinforcement learning (RL) in refining large language models (LLMs), we propose AR-GRPO, an approach to integrate online RL training into autoregressive (AR) image generation models. We adapt the Group Relative Policy Optimization (GRPO) algorithm to refine the vanilla autoregressive models' outputs by carefully designed reward functions that evaluate generated images across multiple quality dimensions, including perceptual quality, realism, and semantic fidelity. We conduct comprehensive experiments on both class-conditional (i.e., class-to-image) and text-conditional (i.e., text-to-image) image generation tasks, demonstrating that our RL-enhanced framework significantly improves both the image quality and human preference of generated images compared to the standard AR baselines. Our results show consistent improvements across various evaluation metrics, establishing the viability of RL-based optimization for AR image generation and opening new avenues for controllable and high-quality image synthesis. The source codes and models are available at: https://github.com/Kwai-Klear/AR-GRPO.

* 27 pages, 15 figures

View paper on

Share this with someone who'll enjoy it:

Title:AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning

Paper and Code