Picture for Xin Ye

Xin Ye

Arizona State University

EmbodiedMidtrain: Bridging the Gap between Vision-Language Models and Vision-Language-Action Models via Mid-training

Add code
Apr 21, 2026
Viaarxiv icon

ExploreVLA: Dense World Modeling and Exploration for End-to-End Autonomous Driving

Add code
Apr 03, 2026
Viaarxiv icon

Reconstruction Matters: Learning Geometry-Aligned BEV Representation through 3D Gaussian Splatting

Add code
Mar 19, 2026
Viaarxiv icon

EvoPrune: Early-Stage Visual Token Pruning for Efficient MLLMs

Add code
Mar 04, 2026
Viaarxiv icon

ALPBench: A Benchmark for Attribution-level Long-term Personal Behavior Understanding

Add code
Feb 03, 2026
Viaarxiv icon

When to Invoke: Refining LLM Fairness with Toxicity Assessment

Add code
Jan 14, 2026
Viaarxiv icon

MoE-DisCo:Low Economy Cost Training Mixture-of-Experts Models

Add code
Jan 11, 2026
Viaarxiv icon

UniDrive-WM: Unified Understanding, Planning and Generation World Model For Autonomous Driving

Add code
Jan 07, 2026
Viaarxiv icon

Fast Quiet-STaR: Thinking Without Thought Tokens

Add code
May 23, 2025
Viaarxiv icon

LTDA-Drive: LLMs-guided Generative Models based Long-tail Data Augmentation for Autonomous Driving

Add code
May 21, 2025
Figure 1 for LTDA-Drive: LLMs-guided Generative Models based Long-tail Data Augmentation for Autonomous Driving
Figure 2 for LTDA-Drive: LLMs-guided Generative Models based Long-tail Data Augmentation for Autonomous Driving
Figure 3 for LTDA-Drive: LLMs-guided Generative Models based Long-tail Data Augmentation for Autonomous Driving
Figure 4 for LTDA-Drive: LLMs-guided Generative Models based Long-tail Data Augmentation for Autonomous Driving
Viaarxiv icon