Picture for Xiaodan Liang

Xiaodan Liang

CARE What Fails: Contrastive Anchored-REflection for Verifiable Multimodal

Add code
Dec 22, 2025
Viaarxiv icon

OmniGen: Unified Multimodal Sensor Generation for Autonomous Driving

Add code
Dec 16, 2025
Viaarxiv icon

GLaD: Geometric Latent Distillation for Vision-Language-Action Models

Add code
Dec 10, 2025
Viaarxiv icon

DirectSwap: Mask-Free Cross-Identity Training and Benchmarking for Expression-Consistent Video Head Swapping

Add code
Dec 10, 2025
Viaarxiv icon

SpatialDreamer: Incentivizing Spatial Reasoning via Active Mental Imagery

Add code
Dec 08, 2025
Viaarxiv icon

Video Spatial Reasoning with Object-Centric 3D Rollout

Add code
Nov 17, 2025
Viaarxiv icon

GRPO-Guard: Mitigating Implicit Over-Optimization in Flow Matching via Regulated Clipping

Add code
Oct 25, 2025
Viaarxiv icon

Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI

Add code
Oct 06, 2025
Figure 1 for Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI
Figure 2 for Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI
Figure 3 for Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI
Figure 4 for Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI
Viaarxiv icon

Embodied Arena: A Comprehensive, Unified, and Evolving Evaluation Platform for Embodied AI

Add code
Sep 18, 2025
Viaarxiv icon

LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation

Add code
Aug 11, 2025
Figure 1 for LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation
Figure 2 for LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation
Figure 3 for LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation
Figure 4 for LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation
Viaarxiv icon