Picture for Chuanhao Li

Chuanhao Li

ProSoftArena: Benchmarking Hierarchical Capabilities of Multimodal Agents in Professional Software Environments

Add code
Dec 30, 2025
Viaarxiv icon

Yume-1.5: A Text-Controlled Interactive World Generation Model

Add code
Dec 26, 2025
Viaarxiv icon

SVBench: Evaluation of Video Generation Models on Social Reasoning

Add code
Dec 25, 2025
Viaarxiv icon

Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection

Add code
Dec 18, 2025
Figure 1 for Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection
Figure 2 for Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection
Figure 3 for Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection
Figure 4 for Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection
Viaarxiv icon

Composition-Incremental Learning for Compositional Generalization

Add code
Nov 12, 2025
Viaarxiv icon

From Pixels to Paths: A Multi-Agent Framework for Editable Scientific Illustration

Add code
Oct 31, 2025
Viaarxiv icon

Dialogue as Discovery: Navigating Human Intent Through Principled Inquiry

Add code
Oct 31, 2025
Viaarxiv icon

MDK12-Bench: A Comprehensive Evaluation of Multimodal Large Language Models on Multidisciplinary Exams

Add code
Aug 09, 2025
Viaarxiv icon

Yume: An Interactive World Generation Model

Add code
Jul 23, 2025
Figure 1 for Yume: An Interactive World Generation Model
Figure 2 for Yume: An Interactive World Generation Model
Figure 3 for Yume: An Interactive World Generation Model
Figure 4 for Yume: An Interactive World Generation Model
Viaarxiv icon

Sekai: A Video Dataset towards World Exploration

Add code
Jun 18, 2025
Viaarxiv icon