Picture for Jianyu Chen

Jianyu Chen

villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models

Add code
Jul 31, 2025
Viaarxiv icon

Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt

Add code
May 27, 2025
Viaarxiv icon

MARGE: Improving Math Reasoning for LLMs with Guided Exploration

Add code
May 18, 2025
Viaarxiv icon

Whleaper: A 10-DOF Flexible Bipedal Wheeled Robot

Add code
Apr 30, 2025
Figure 1 for Whleaper: A 10-DOF Flexible Bipedal Wheeled Robot
Figure 2 for Whleaper: A 10-DOF Flexible Bipedal Wheeled Robot
Figure 3 for Whleaper: A 10-DOF Flexible Bipedal Wheeled Robot
Figure 4 for Whleaper: A 10-DOF Flexible Bipedal Wheeled Robot
Viaarxiv icon

UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent

Add code
Jan 31, 2025
Figure 1 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Figure 2 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Figure 3 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Figure 4 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Viaarxiv icon

Improving Vision-Language-Action Model with Online Reinforcement Learning

Add code
Jan 28, 2025
Figure 1 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Figure 2 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Figure 3 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Figure 4 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Viaarxiv icon

AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image Segmentation

Add code
Jan 06, 2025
Figure 1 for AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image Segmentation
Figure 2 for AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image Segmentation
Figure 3 for AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image Segmentation
Figure 4 for AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image Segmentation
Viaarxiv icon

Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations

Add code
Dec 19, 2024
Figure 1 for Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations
Figure 2 for Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations
Figure 3 for Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations
Figure 4 for Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations
Viaarxiv icon

Prediction with Action: Visual Policy Learning via Joint Denoising Process

Add code
Nov 27, 2024
Figure 1 for Prediction with Action: Visual Policy Learning via Joint Denoising Process
Figure 2 for Prediction with Action: Visual Policy Learning via Joint Denoising Process
Figure 3 for Prediction with Action: Visual Policy Learning via Joint Denoising Process
Figure 4 for Prediction with Action: Visual Policy Learning via Joint Denoising Process
Viaarxiv icon

Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model Learning

Add code
Aug 26, 2024
Figure 1 for Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model Learning
Figure 2 for Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model Learning
Figure 3 for Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model Learning
Figure 4 for Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model Learning
Viaarxiv icon