Picture for Yucheng Zhao

Yucheng Zhao

Robotic Scene Cloning:Advancing Zero-Shot Robotic Scene Adaptation in Manipulation via Visual Prompt Editing

Add code
Mar 10, 2026
Viaarxiv icon

DM0: An Embodied-Native Vision-Language-Action Model towards Physical AI

Add code
Feb 16, 2026
Viaarxiv icon

IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction

Add code
Oct 09, 2025
Figure 1 for IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction
Figure 2 for IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction
Figure 3 for IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction
Figure 4 for IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction
Viaarxiv icon

LLaDA-VLA: Vision Language Diffusion Action Models

Add code
Sep 10, 2025
Viaarxiv icon

Holistic Tokenizer for Autoregressive Image Generation

Add code
Jul 03, 2025
Viaarxiv icon

ROSA: Harnessing Robot States for Vision-Language and Action Alignment

Add code
Jun 16, 2025
Viaarxiv icon

PADriver: Towards Personalized Autonomous Driving

Add code
May 08, 2025
Viaarxiv icon

Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness

Add code
Apr 02, 2025
Figure 1 for Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
Figure 2 for Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
Figure 3 for Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
Figure 4 for Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
Viaarxiv icon

Reconstructive Visual Instruction Tuning

Add code
Oct 12, 2024
Viaarxiv icon

Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving?

Add code
May 28, 2024
Viaarxiv icon