Picture for Daili Hua

Daili Hua

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

Add code
May 21, 2026
Viaarxiv icon

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Add code
Apr 06, 2026
Viaarxiv icon

Research on World Models Is Not Merely Injecting World Knowledge into Specific Tasks

Add code
Feb 02, 2026
Viaarxiv icon

VABench: A Comprehensive Benchmark for Audio-Video Generation

Add code
Dec 10, 2025
Viaarxiv icon