Picture for Yizhou Wang

Yizhou Wang

Inline Critic Steers Image Editing

Add code
May 12, 2026
Viaarxiv icon

Beyond Thinking: Imagining in 360$^\circ$ for Humanoid Visual Search

Add code
May 09, 2026
Viaarxiv icon

Hierarchical Visual Agent: Managing Contexts in Joint Image-Text Space for Advanced Chart Reasoning

Add code
May 05, 2026
Viaarxiv icon

GazeVLA: Learning Human Intention for Robotic Manipulation

Add code
Apr 24, 2026
Viaarxiv icon

AdaTracker: Learning Adaptive In-Context Policy for Cross-Embodiment Active Visual Tracking

Add code
Apr 22, 2026
Viaarxiv icon

EgoSelf: From Memory to Personalized Egocentric Assistant

Add code
Apr 22, 2026
Viaarxiv icon

Distorted or Fabricated? A Survey on Hallucination in Video LLMs

Add code
Apr 14, 2026
Viaarxiv icon

Visually-grounded Humanoid Agents

Add code
Apr 09, 2026
Viaarxiv icon

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Add code
Mar 26, 2026
Viaarxiv icon

Less Data, Faster Convergence: Goal-Driven Data Optimization for Multimodal Instruction Tuning

Add code
Mar 12, 2026
Viaarxiv icon