Picture for Xiaopeng Lin

Xiaopeng Lin

IntentVLA: Short-Horizon Intent Modeling for Aliased Robot Manipulation

Add code
May 14, 2026
Viaarxiv icon

FrameSkip: Learning from Fewer but More Informative Frames in VLA Training

Add code
May 13, 2026
Viaarxiv icon

MDN: Parallelizing Stepwise Momentum for Delta Linear Attention

Add code
May 07, 2026
Viaarxiv icon

3D-Mix for VLA: A Plug-and-Play Module for Integrating VGGT-based 3D Information into Vision-Language-Action Models

Add code
Mar 25, 2026
Viaarxiv icon

PEARL: Personalized Streaming Video Understanding Model

Add code
Mar 20, 2026
Viaarxiv icon

ScalSelect: Scalable Training-Free Multimodal Data Selection for Efficient Visual Instruction Tuning

Add code
Feb 12, 2026
Viaarxiv icon

AD-MIR: Bridging the Gap from Perception to Persuasion in Advertising Video Understanding via Structured Reasoning

Add code
Feb 07, 2026
Viaarxiv icon

LangForce: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries

Add code
Jan 27, 2026
Viaarxiv icon

BayesianVLA: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries

Add code
Jan 21, 2026
Viaarxiv icon

TwinBrainVLA: Unleashing the Potential of Generalist VLMs for Embodied Tasks via Asymmetric Mixture-of-Transformers

Add code
Jan 20, 2026
Viaarxiv icon