Picture for Ming-Yu Liu

Ming-Yu Liu

Causal-rCM: A Unified Teacher-Forcing and Self-Forcing Open Recipe for Autoregressive Diffusion Distillation in Streaming Video Generation and Interactive World Models

Add code
Jun 24, 2026
Viaarxiv icon

SC3-Eval: Evaluating Robot Foundation Models via Self-Consistent Video Generation

Add code
Jun 17, 2026
Viaarxiv icon

Cosmos 3: Omnimodal World Models for Physical AI

Add code
Jun 01, 2026
Viaarxiv icon

Benchmarking Single-Factor Physical Video-to-Audio Generation

Add code
May 28, 2026
Viaarxiv icon

Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music

Add code
Apr 13, 2026
Viaarxiv icon

AVO: Agentic Variation Operators for Autonomous Evolutionary Search

Add code
Mar 25, 2026
Viaarxiv icon

VLM-AutoDrive: Post-Training Vision-Language Models for Safety-Critical Autonomous Driving Events

Add code
Mar 18, 2026
Viaarxiv icon

LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion

Add code
Feb 12, 2026
Viaarxiv icon

SAGE: Scalable Agentic 3D Scene Generation for Embodied AI

Add code
Feb 10, 2026
Viaarxiv icon

DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

Add code
Feb 06, 2026
Viaarxiv icon