Picture for Yufei Liu

Yufei Liu

OmniDrive: An LLM-Choreographed Multi-Agent World Model with Unified Latent Co-Compression for Multi-View Driving Video Generation

Add code
Jun 16, 2026
Viaarxiv icon

TRIDENT: Breaking the Hybrid-Safety-Physics Coupling for Provably Safe Multi-Agent Reinforcement Learning

Add code
Jun 16, 2026
Viaarxiv icon

ACE-Ego-0: Unifying Egocentric Human and Robotic Data for VLA Pretraining

Add code
Jun 15, 2026
Viaarxiv icon

ARGUS: Stacked Multi-View Identity Mosaic Injection for Subject-Preserving Video Generation

Add code
Jun 10, 2026
Viaarxiv icon

VTouch++: A Multimodal Dataset with Vision-Based Tactile Enhancement for Bimanual Manipulation

Add code
Apr 22, 2026
Viaarxiv icon

GeoLoco: Leveraging 3D Geometric Priors from Visual Foundation Model for Robust RGB-Only Humanoid Locomotion

Add code
Mar 08, 2026
Viaarxiv icon

Precision over Diversity: High-Precision Reward Generalizes to Robust Instruction Following

Add code
Jan 08, 2026
Viaarxiv icon

InternVLA-A1: Unifying Understanding, Generation and Action for Robotic Manipulation

Add code
Jan 05, 2026
Viaarxiv icon

Next Tokens Denoising for Speech Synthesis

Add code
Jul 30, 2025
Viaarxiv icon

SynPo: Boosting Training-Free Few-Shot Medical Segmentation via High-Quality Negative Prompts

Add code
Jun 18, 2025
Viaarxiv icon