Picture for Xintong Hu

Xintong Hu

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Add code
May 28, 2026
Viaarxiv icon

FineVLA: Fine-Grained Instruction Alignment for Steerable Vision-Language-Action Policies

Add code
May 26, 2026
Viaarxiv icon

Tree of Preferences for Diversified Recommendation

Add code
Dec 24, 2025
Viaarxiv icon

Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis

Add code
Jul 08, 2025
Viaarxiv icon

Guiding LLM-based Smart Contract Generation with Finite State Machine

Add code
May 13, 2025
Viaarxiv icon