Picture for Ming-Yu Liu

Ming-Yu Liu

Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music

Add code
Apr 13, 2026
Viaarxiv icon

AVO: Agentic Variation Operators for Autonomous Evolutionary Search

Add code
Mar 25, 2026
Viaarxiv icon

VLM-AutoDrive: Post-Training Vision-Language Models for Safety-Critical Autonomous Driving Events

Add code
Mar 18, 2026
Viaarxiv icon

LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion

Add code
Feb 12, 2026
Viaarxiv icon

SAGE: Scalable Agentic 3D Scene Generation for Embodied AI

Add code
Feb 10, 2026
Viaarxiv icon

DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

Add code
Feb 06, 2026
Viaarxiv icon

DuoGen: Towards General Purpose Interleaved Multimodal Generation

Add code
Feb 03, 2026
Viaarxiv icon

Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning

Add code
Jan 22, 2026
Viaarxiv icon

VibeTensor: System Software for Deep Learning, Fully Generated by AI Agents

Add code
Jan 21, 2026
Viaarxiv icon

PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation

Add code
Jan 07, 2026
Viaarxiv icon