Picture for Binhe Yu

Binhe Yu

VisualThink-VLA: Visual Intermediate Reasoning for Effective and Low-Latency Vision-Language-Action Policies

Add code
May 28, 2026
Viaarxiv icon

LMMs Meet Object-Centric Vision: Understanding, Segmentation, Editing and Generation

Add code
Apr 13, 2026
Viaarxiv icon

AnyMS: Bottom-up Attention Decoupling for Layout-guided and Training-free Multi-subject Customization

Add code
Dec 29, 2025
Viaarxiv icon