Picture for Ruqi Zhang

Ruqi Zhang

Efficient and Explainable End-to-End Autonomous Driving via Masked Vision-Language-Action Diffusion

Add code
Feb 24, 2026
Viaarxiv icon

Why Any-Order Autoregressive Models Need Two-Stream Attention: A Structural-Semantic Tradeoff

Add code
Feb 17, 2026
Viaarxiv icon

Learning Self-Correction in Vision-Language Models via Rollout Augmentation

Add code
Feb 09, 2026
Viaarxiv icon

Modular Safety Guardrails Are Necessary for Foundation-Model-Enabled Robots in the Real World

Add code
Feb 03, 2026
Viaarxiv icon

CANDI: Hybrid Discrete-Continuous Diffusion Models

Add code
Oct 26, 2025
Viaarxiv icon

Reward-Shifted Speculative Sampling Is An Efficient Test-Time Weak-to-Strong Aligner

Add code
Aug 20, 2025
Viaarxiv icon

ViLaD: A Large Vision Language Diffusion Framework for End-to-End Autonomous Driving

Add code
Aug 18, 2025
Viaarxiv icon

Stacey: Promoting Stochastic Steepest Descent via Accelerated $\ell_p$-Smooth Nonconvex Optimization

Add code
Jun 07, 2025
Viaarxiv icon

Inference Acceleration of Autoregressive Normalizing Flows by Selective Jacobi Decoding

Add code
May 30, 2025
Viaarxiv icon

Sherlock: Self-Correcting Reasoning in Vision-Language Models

Add code
May 28, 2025
Viaarxiv icon