Picture for Zhuoyang Liu

Zhuoyang Liu

Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models

Add code
Mar 17, 2026
Viaarxiv icon

Learnable Template Matching Approach for Micro-Deformation Monitoring based on Integrated Sensing and Communication Platform

Add code
Mar 12, 2026
Viaarxiv icon

TwinRL-VLA: Digital Twin-Driven Reinforcement Learning for Real-World Robotic Manipulation

Add code
Feb 09, 2026
Viaarxiv icon

RoboMIND 2.0: A Multimodal, Bimanual Mobile Manipulation Dataset for Generalizable Embodied Intelligence

Add code
Dec 31, 2025
Viaarxiv icon

Aerial Vision-Language Navigation with a Unified Framework for Spatial, Temporal and Embodied Reasoning

Add code
Dec 09, 2025
Viaarxiv icon

MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation

Add code
Sep 30, 2025
Figure 1 for MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
Figure 2 for MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
Figure 3 for MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
Figure 4 for MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
Viaarxiv icon

AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation

Add code
Jul 02, 2025
Figure 1 for AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation
Figure 2 for AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation
Figure 3 for AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation
Figure 4 for AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation
Viaarxiv icon

H2R: A Human-to-Robot Data Augmentation for Robot Pre-training from Videos

Add code
May 17, 2025
Figure 1 for H2R: A Human-to-Robot Data Augmentation for Robot Pre-training from Videos
Figure 2 for H2R: A Human-to-Robot Data Augmentation for Robot Pre-training from Videos
Figure 3 for H2R: A Human-to-Robot Data Augmentation for Robot Pre-training from Videos
Figure 4 for H2R: A Human-to-Robot Data Augmentation for Robot Pre-training from Videos
Viaarxiv icon

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model

Add code
Mar 13, 2025
Viaarxiv icon

Holographic Intelligence Surface Assisted Integrated Sensing and Communication

Add code
Jun 07, 2024
Figure 1 for Holographic Intelligence Surface Assisted Integrated Sensing and Communication
Figure 2 for Holographic Intelligence Surface Assisted Integrated Sensing and Communication
Figure 3 for Holographic Intelligence Surface Assisted Integrated Sensing and Communication
Figure 4 for Holographic Intelligence Surface Assisted Integrated Sensing and Communication
Viaarxiv icon