Picture for Quanxin Shou

Quanxin Shou

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Add code
May 06, 2026
Viaarxiv icon

Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis

Add code
Apr 01, 2026
Viaarxiv icon

HALO: A Unified Vision-Language-Action Model for Embodied Multimodal Chain-of-Thought Reasoning

Add code
Feb 24, 2026
Viaarxiv icon

WMPO: World Model-based Policy Optimization for Vision-Language-Action Models

Add code
Nov 12, 2025
Figure 1 for WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
Figure 2 for WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
Figure 3 for WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
Figure 4 for WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
Viaarxiv icon