Picture for Jianyu Chen

Jianyu Chen

VLAW: Iterative Co-Improvement of Vision-Language-Action Policy and World Model

Add code
Feb 15, 2026
Viaarxiv icon

BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation

Add code
Feb 11, 2026
Viaarxiv icon

VLM4VLA: Revisiting Vision-Language-Models in Vision-Language-Action Models

Add code
Jan 06, 2026
Viaarxiv icon

villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models

Add code
Jul 31, 2025
Viaarxiv icon

Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt

Add code
May 27, 2025
Figure 1 for Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt
Figure 2 for Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt
Figure 3 for Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt
Figure 4 for Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt
Viaarxiv icon

MARGE: Improving Math Reasoning for LLMs with Guided Exploration

Add code
May 18, 2025
Figure 1 for MARGE: Improving Math Reasoning for LLMs with Guided Exploration
Figure 2 for MARGE: Improving Math Reasoning for LLMs with Guided Exploration
Figure 3 for MARGE: Improving Math Reasoning for LLMs with Guided Exploration
Figure 4 for MARGE: Improving Math Reasoning for LLMs with Guided Exploration
Viaarxiv icon

Whleaper: A 10-DOF Flexible Bipedal Wheeled Robot

Add code
Apr 30, 2025
Figure 1 for Whleaper: A 10-DOF Flexible Bipedal Wheeled Robot
Figure 2 for Whleaper: A 10-DOF Flexible Bipedal Wheeled Robot
Figure 3 for Whleaper: A 10-DOF Flexible Bipedal Wheeled Robot
Figure 4 for Whleaper: A 10-DOF Flexible Bipedal Wheeled Robot
Viaarxiv icon

UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent

Add code
Jan 31, 2025
Figure 1 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Figure 2 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Figure 3 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Figure 4 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Viaarxiv icon

Improving Vision-Language-Action Model with Online Reinforcement Learning

Add code
Jan 28, 2025
Figure 1 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Figure 2 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Figure 3 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Figure 4 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Viaarxiv icon

AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image Segmentation

Add code
Jan 06, 2025
Figure 1 for AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image Segmentation
Figure 2 for AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image Segmentation
Figure 3 for AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image Segmentation
Figure 4 for AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image Segmentation
Viaarxiv icon