Picture for Xiaoyu Chen

Xiaoyu Chen

villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models

Add code
Jul 31, 2025
Viaarxiv icon

PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models

Add code
Jul 23, 2025
Figure 1 for PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models
Figure 2 for PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models
Figure 3 for PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models
Figure 4 for PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models
Viaarxiv icon

The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability

Add code
Jun 11, 2025
Viaarxiv icon

Automatic Evaluation Metrics for Document-level Translation: Overview, Challenges and Trends

Add code
Apr 21, 2025
Viaarxiv icon

A temporal scale transformer framework for precise remaining useful life prediction in fuel cells

Add code
Apr 08, 2025
Viaarxiv icon

Near-infrared Image Deblurring and Event Denoising with Synergistic Neuromorphic Imaging

Add code
Mar 05, 2025
Figure 1 for Near-infrared Image Deblurring and Event Denoising with Synergistic Neuromorphic Imaging
Figure 2 for Near-infrared Image Deblurring and Event Denoising with Synergistic Neuromorphic Imaging
Figure 3 for Near-infrared Image Deblurring and Event Denoising with Synergistic Neuromorphic Imaging
Figure 4 for Near-infrared Image Deblurring and Event Denoising with Synergistic Neuromorphic Imaging
Viaarxiv icon

BP-GPT: Auditory Neural Decoding Using fMRI-prompted LLM

Add code
Feb 21, 2025
Viaarxiv icon

UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent

Add code
Jan 31, 2025
Figure 1 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Figure 2 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Figure 3 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Figure 4 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Viaarxiv icon

Improving Vision-Language-Action Model with Online Reinforcement Learning

Add code
Jan 28, 2025
Figure 1 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Figure 2 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Figure 3 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Figure 4 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Viaarxiv icon

Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations

Add code
Dec 19, 2024
Figure 1 for Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations
Figure 2 for Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations
Figure 3 for Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations
Figure 4 for Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations
Viaarxiv icon