Picture for Yuhang Zang

Yuhang Zang

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

Add code
Jun 17, 2026
Viaarxiv icon

CapRL++: Unified Reinforcement Learning with Verifiable Rewards for Dense Image and Video Captioning

Add code
Jun 08, 2026
Viaarxiv icon

AdaGRPO: A Capability-Aware Adaptive Enhancement for Flow-based GRPO

Add code
Jun 05, 2026
Viaarxiv icon

OVO-S-Bench: A Hierarchical Benchmark for Streaming Spatial Intelligence in Multimodal LLMs

Add code
Jun 02, 2026
Viaarxiv icon

Pave-GRPO: Beyond Instantaneous Guidance through Principled Average Velocity Decomposition

Add code
Jun 01, 2026
Viaarxiv icon

Skill-as-Pseudocode: Refactoring Skill Libraries to Pseudocode for LLM Agents

Add code
May 27, 2026
Viaarxiv icon

ETCHR: Editing To Clarify and Harness Reasoning

Add code
May 22, 2026
Viaarxiv icon

SetCon: Towards Open-Ended Referring Segmentation via Set-Level Concept Prediction

Add code
May 19, 2026
Viaarxiv icon

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

Add code
May 11, 2026
Viaarxiv icon

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Add code
Mar 26, 2026
Viaarxiv icon