Picture for Yueting Zhuang

Yueting Zhuang

VisualThink-VLA: Visual Intermediate Reasoning for Effective and Low-Latency Vision-Language-Action Policies

Add code
May 28, 2026
Viaarxiv icon

MAIGO: Mitigating Lost-in-Conversation with History-Cleaned On-Policy Self-Distillation

Add code
May 26, 2026
Viaarxiv icon

InstructSAM: Segment Any Instance with Any Instructions

Add code
May 25, 2026
Viaarxiv icon

CrossView Suite: Harnessing Cross-view Spatial Intelligence of MLLMs with Dataset, Model and Benchmark

Add code
May 18, 2026
Viaarxiv icon

Self-Distilled Agentic Reinforcement Learning

Add code
May 14, 2026
Viaarxiv icon

Milestone-Guided Policy Learning for Long-Horizon Language Agents

Add code
May 07, 2026
Viaarxiv icon

SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments

Add code
Apr 15, 2026
Viaarxiv icon

UI-Zoomer: Uncertainty-Driven Adaptive Zoom-In for GUI Grounding

Add code
Apr 15, 2026
Viaarxiv icon

UI-Copilot: Advancing Long-Horizon GUI Automation via Tool-Integrated Policy Optimization

Add code
Apr 15, 2026
Viaarxiv icon

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Add code
Apr 13, 2026
Viaarxiv icon