Picture for Donglin Wang

Donglin Wang

Balancing Signal and Variance: Adaptive Offline RL Post-Training for VLA Flow Models

Add code
Sep 04, 2025
Figure 1 for Balancing Signal and Variance: Adaptive Offline RL Post-Training for VLA Flow Models
Figure 2 for Balancing Signal and Variance: Adaptive Offline RL Post-Training for VLA Flow Models
Figure 3 for Balancing Signal and Variance: Adaptive Offline RL Post-Training for VLA Flow Models
Figure 4 for Balancing Signal and Variance: Adaptive Offline RL Post-Training for VLA Flow Models
Viaarxiv icon

Long-VLA: Unleashing Long-Horizon Capability of Vision Language Action Model for Robot Manipulation

Add code
Aug 28, 2025
Viaarxiv icon

ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver

Add code
Aug 14, 2025
Viaarxiv icon

GCHR : Goal-Conditioned Hindsight Regularization for Sample-Efficient Reinforcement Learning

Add code
Aug 08, 2025
Viaarxiv icon

Multi-Task Multi-Agent Reinforcement Learning via Skill Graphs

Add code
Jul 09, 2025
Viaarxiv icon

CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding

Add code
Jun 16, 2025
Figure 1 for CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding
Figure 2 for CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding
Figure 3 for CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding
Figure 4 for CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding
Viaarxiv icon

RationalVLA: A Rational Vision-Language-Action Model with Dual System

Add code
Jun 12, 2025
Viaarxiv icon

Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only

Add code
May 22, 2025
Viaarxiv icon

VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL

Add code
May 21, 2025
Figure 1 for VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL
Figure 2 for VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL
Figure 3 for VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL
Figure 4 for VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL
Viaarxiv icon

SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning

Add code
May 18, 2025
Viaarxiv icon