Picture for Yuhang Zang

Yuhang Zang

Visual-ERM: Reward Modeling for Visual Equivalence

Add code
Mar 13, 2026
Viaarxiv icon

From Sparse to Dense: Multi-View GRPO for Flow Models via Augmented Condition Space

Add code
Mar 13, 2026
Viaarxiv icon

EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models

Add code
Mar 12, 2026
Viaarxiv icon

Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation

Add code
Mar 12, 2026
Viaarxiv icon

Visual Self-Refine: A Pixel-Guided Paradigm for Accurate Chart Parsing

Add code
Feb 18, 2026
Viaarxiv icon

Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

Add code
Feb 09, 2026
Viaarxiv icon

Unified Personalized Reward Model for Vision Generation

Add code
Feb 02, 2026
Viaarxiv icon

EMemBench: Interactive Benchmarking of Episodic Memory for VLM Agents

Add code
Jan 23, 2026
Viaarxiv icon

Think Visually, Reason Textually: Vision-Language Synergy in ARC

Add code
Nov 19, 2025
Figure 1 for Think Visually, Reason Textually: Vision-Language Synergy in ARC
Figure 2 for Think Visually, Reason Textually: Vision-Language Synergy in ARC
Figure 3 for Think Visually, Reason Textually: Vision-Language Synergy in ARC
Figure 4 for Think Visually, Reason Textually: Vision-Language Synergy in ARC
Viaarxiv icon

Generative Photographic Control for Scene-Consistent Video Cinematic Editing

Add code
Nov 17, 2025
Viaarxiv icon