Picture for Yuhang Zang

Yuhang Zang

$\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models

Add code
Oct 02, 2025
Viaarxiv icon

CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning

Add code
Sep 26, 2025
Viaarxiv icon

SPARK: Synergistic Policy And Reward Co-Evolving Framework

Add code
Sep 26, 2025
Figure 1 for SPARK: Synergistic Policy And Reward Co-Evolving Framework
Figure 2 for SPARK: Synergistic Policy And Reward Co-Evolving Framework
Figure 3 for SPARK: Synergistic Policy And Reward Co-Evolving Framework
Figure 4 for SPARK: Synergistic Policy And Reward Co-Evolving Framework
Viaarxiv icon

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Add code
Sep 26, 2025
Viaarxiv icon

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Add code
Aug 28, 2025
Viaarxiv icon

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Add code
Aug 27, 2025
Viaarxiv icon

DiCache: Let Diffusion Model Determine Its Own Cache

Add code
Aug 24, 2025
Figure 1 for DiCache: Let Diffusion Model Determine Its Own Cache
Figure 2 for DiCache: Let Diffusion Model Determine Its Own Cache
Figure 3 for DiCache: Let Diffusion Model Determine Its Own Cache
Figure 4 for DiCache: Let Diffusion Model Determine Its Own Cache
Viaarxiv icon

SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience

Add code
Aug 06, 2025
Viaarxiv icon

Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models

Add code
Aug 01, 2025
Viaarxiv icon

Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation

Add code
Jul 03, 2025
Viaarxiv icon