Picture for Yuhang Zang

Yuhang Zang

Pave-GRPO: Beyond Instantaneous Guidance through Principled Average Velocity Decomposition

Add code
Jun 01, 2026
Viaarxiv icon

Skill-as-Pseudocode: Refactoring Skill Libraries to Pseudocode for LLM Agents

Add code
May 27, 2026
Viaarxiv icon

ETCHR: Editing To Clarify and Harness Reasoning

Add code
May 22, 2026
Viaarxiv icon

SetCon: Towards Open-Ended Referring Segmentation via Set-Level Concept Prediction

Add code
May 19, 2026
Viaarxiv icon

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

Add code
May 11, 2026
Viaarxiv icon

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Add code
Mar 26, 2026
Viaarxiv icon

Visual-ERM: Reward Modeling for Visual Equivalence

Add code
Mar 13, 2026
Viaarxiv icon

From Sparse to Dense: Multi-View GRPO for Flow Models via Augmented Condition Space

Add code
Mar 13, 2026
Viaarxiv icon

Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation

Add code
Mar 12, 2026
Viaarxiv icon

EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models

Add code
Mar 12, 2026
Viaarxiv icon