Picture for Xuhong Huang

Xuhong Huang

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Add code
May 28, 2026
Viaarxiv icon

FineVLA: Fine-Grained Instruction Alignment for Steerable Vision-Language-Action Policies

Add code
May 26, 2026
Viaarxiv icon