Picture for Chuanhao Li

Chuanhao Li

Yume-1.5: A Text-Controlled Interactive World Generation Model

Add code
Dec 26, 2025
Viaarxiv icon

SVBench: Evaluation of Video Generation Models on Social Reasoning

Add code
Dec 25, 2025
Viaarxiv icon

Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection

Add code
Dec 18, 2025
Viaarxiv icon

Composition-Incremental Learning for Compositional Generalization

Add code
Nov 12, 2025
Viaarxiv icon

Dialogue as Discovery: Navigating Human Intent Through Principled Inquiry

Add code
Oct 31, 2025
Viaarxiv icon

From Pixels to Paths: A Multi-Agent Framework for Editable Scientific Illustration

Add code
Oct 31, 2025
Viaarxiv icon

MDK12-Bench: A Comprehensive Evaluation of Multimodal Large Language Models on Multidisciplinary Exams

Add code
Aug 09, 2025
Viaarxiv icon

Yume: An Interactive World Generation Model

Add code
Jul 23, 2025
Figure 1 for Yume: An Interactive World Generation Model
Figure 2 for Yume: An Interactive World Generation Model
Figure 3 for Yume: An Interactive World Generation Model
Figure 4 for Yume: An Interactive World Generation Model
Viaarxiv icon

Sekai: A Video Dataset towards World Exploration

Add code
Jun 18, 2025
Viaarxiv icon

A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation

Add code
Jun 11, 2025
Viaarxiv icon