Picture for Yucheng Hu

Yucheng Hu

Veo-Act: How Far Can Frontier Video Models Advance Generalizable Robot Manipulation?

Add code
Apr 06, 2026
Viaarxiv icon

Realtime-VLA V2: Learning to Run VLAs Fast, Smooth, and Accurate

Add code
Mar 27, 2026
Viaarxiv icon

BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation

Add code
Feb 11, 2026
Viaarxiv icon

DyTopo: Dynamic Topology Routing for Multi-Agent Reasoning via Semantic Matching

Add code
Feb 05, 2026
Viaarxiv icon

CLM-Bench: Benchmarking and Analyzing Cross-lingual Misalignment of LLMs in Knowledge Editing

Add code
Jan 24, 2026
Viaarxiv icon

VLM4VLA: Revisiting Vision-Language-Models in Vision-Language-Action Models

Add code
Jan 06, 2026
Viaarxiv icon

Rethinking Agent Design: From Top-Down Workflows to Bottom-Up Skill Evolution

Add code
May 23, 2025
Figure 1 for Rethinking Agent Design: From Top-Down Workflows to Bottom-Up Skill Evolution
Figure 2 for Rethinking Agent Design: From Top-Down Workflows to Bottom-Up Skill Evolution
Figure 3 for Rethinking Agent Design: From Top-Down Workflows to Bottom-Up Skill Evolution
Figure 4 for Rethinking Agent Design: From Top-Down Workflows to Bottom-Up Skill Evolution
Viaarxiv icon

UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent

Add code
Jan 31, 2025
Figure 1 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Figure 2 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Figure 3 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Figure 4 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Viaarxiv icon

Improving Vision-Language-Action Model with Online Reinforcement Learning

Add code
Jan 28, 2025
Figure 1 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Figure 2 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Figure 3 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Figure 4 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Viaarxiv icon

Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations

Add code
Dec 19, 2024
Figure 1 for Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations
Figure 2 for Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations
Figure 3 for Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations
Figure 4 for Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations
Viaarxiv icon