Picture for Xiang Zhu

Xiang Zhu

HARP-VLA: Human-Robot Aligned Representation Learning for Vision-Language-Action Model

Add code
May 29, 2026
Viaarxiv icon

Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt

Add code
May 27, 2025
Figure 1 for Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt
Figure 2 for Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt
Figure 3 for Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt
Figure 4 for Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt
Viaarxiv icon

High-Quality Spatial Reconstruction and Orthoimage Generation Using Efficient 2D Gaussian Splatting

Add code
Mar 25, 2025
Figure 1 for High-Quality Spatial Reconstruction and Orthoimage Generation Using Efficient 2D Gaussian Splatting
Figure 2 for High-Quality Spatial Reconstruction and Orthoimage Generation Using Efficient 2D Gaussian Splatting
Figure 3 for High-Quality Spatial Reconstruction and Orthoimage Generation Using Efficient 2D Gaussian Splatting
Figure 4 for High-Quality Spatial Reconstruction and Orthoimage Generation Using Efficient 2D Gaussian Splatting
Viaarxiv icon

A Multi-Sensor Fusion Approach for Rapid Orthoimage Generation in Large-Scale UAV Mapping

Add code
Mar 05, 2025
Viaarxiv icon

Video Super-Resolution: All You Need is a Video Diffusion Model

Add code
Mar 05, 2025
Figure 1 for Video Super-Resolution: All You Need is a Video Diffusion Model
Figure 2 for Video Super-Resolution: All You Need is a Video Diffusion Model
Figure 3 for Video Super-Resolution: All You Need is a Video Diffusion Model
Figure 4 for Video Super-Resolution: All You Need is a Video Diffusion Model
Viaarxiv icon

UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent

Add code
Jan 31, 2025
Figure 1 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Figure 2 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Figure 3 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Figure 4 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Viaarxiv icon

Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model Learning

Add code
Aug 26, 2024
Figure 1 for Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model Learning
Figure 2 for Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model Learning
Figure 3 for Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model Learning
Figure 4 for Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model Learning
Viaarxiv icon

RCDM: Enabling Robustness for Conditional Diffusion Model

Add code
Aug 05, 2024
Figure 1 for RCDM: Enabling Robustness for Conditional Diffusion Model
Figure 2 for RCDM: Enabling Robustness for Conditional Diffusion Model
Figure 3 for RCDM: Enabling Robustness for Conditional Diffusion Model
Figure 4 for RCDM: Enabling Robustness for Conditional Diffusion Model
Viaarxiv icon

Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology

Add code
Feb 27, 2024
Viaarxiv icon

Stylized Table Tennis Robots Skill Learning with Incomplete Human Demonstrations

Add code
Sep 16, 2023
Viaarxiv icon