Picture for Xiaodan Liang

Xiaodan Liang

RePO-VLA: Recovery-Driven Policy Optimization for Vision-Language-Action Models

Add code
May 10, 2026
Viaarxiv icon

SeePhys Pro: Diagnosing Modality Transfer and Blind-Training Effects in Multimodal RLVR for Physics Reasoning

Add code
May 10, 2026
Viaarxiv icon

A1: A Fully Transparent Open-Source, Adaptive and Efficient Truncated Vision-Language-Action Model

Add code
Apr 07, 2026
Viaarxiv icon

ManipArena: Comprehensive Real-world Evaluation of Reasoning-Oriented Generalist Robot Manipulation

Add code
Mar 30, 2026
Viaarxiv icon

From Pixels to Digital Agents: An Empirical Study on the Taxonomy and Technological Trends of Reinforcement Learning Environments

Add code
Mar 25, 2026
Viaarxiv icon

MagicSeg: Open-World Segmentation Pretraining via Counterfactural Diffusion-Based Auto-Generation

Add code
Mar 20, 2026
Viaarxiv icon

AnyCrowd: Instance-Isolated Identity-Pose Binding for Arbitrary Multi-Character Animation

Add code
Mar 16, 2026
Viaarxiv icon

World2Act: Latent Action Post-Training via Skill-Compositional World Models

Add code
Mar 11, 2026
Viaarxiv icon

Implicit Geometry Representations for Vision-and-Language Navigation from Web Videos

Add code
Mar 10, 2026
Viaarxiv icon

Choose What to Observe: Task-Aware Semantic-Geometric Representations for Visuomotor Policy

Add code
Mar 09, 2026
Viaarxiv icon