Picture for Xiaodan Liang

Xiaodan Liang

PhysEditWorld: A Large-Scale Dataset Toward Physics-Editable World Models

Add code
Jun 25, 2026
Viaarxiv icon

Latent Visual States for Efficient Multimodal Reasoning

Add code
Jun 23, 2026
Viaarxiv icon

Intend, Reflect, Refine: An Adaptive Multimodal Reflection Framework for Autonomous Driving

Add code
Jun 22, 2026
Viaarxiv icon

iTryOn: Mastering Interactive Video Virtual Try-On with Spatial-Semantic Guidance

Add code
May 20, 2026
Viaarxiv icon

SeePhys Pro: Diagnosing Modality Transfer and Blind-Training Effects in Multimodal RLVR for Physics Reasoning

Add code
May 10, 2026
Viaarxiv icon

RePO-VLA: Recovery-Driven Policy Optimization for Vision-Language-Action Models

Add code
May 10, 2026
Viaarxiv icon

A1: A Fully Transparent Open-Source, Adaptive and Efficient Truncated Vision-Language-Action Model

Add code
Apr 07, 2026
Viaarxiv icon

ManipArena: Comprehensive Real-world Evaluation of Reasoning-Oriented Generalist Robot Manipulation

Add code
Mar 30, 2026
Viaarxiv icon

From Pixels to Digital Agents: An Empirical Study on the Taxonomy and Technological Trends of Reinforcement Learning Environments

Add code
Mar 25, 2026
Viaarxiv icon

MagicSeg: Open-World Segmentation Pretraining via Counterfactural Diffusion-Based Auto-Generation

Add code
Mar 20, 2026
Viaarxiv icon