Picture for Haodong Li

Haodong Li

PEARL: Personalized Streaming Video Understanding Model

Add code
Mar 20, 2026
Viaarxiv icon

DVD: Deterministic Video Depth Estimation with Generative Priors

Add code
Mar 12, 2026
Viaarxiv icon

WebVR: Benchmarking Multimodal LLMs for WebPage Recreation from Videos via Human-Aligned Visual Rubrics

Add code
Mar 11, 2026
Viaarxiv icon

CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation

Add code
Mar 09, 2026
Viaarxiv icon

UniM: A Unified Any-to-Any Interleaved Multimodal Benchmark

Add code
Mar 05, 2026
Viaarxiv icon

Proact-VL: A Proactive VideoLLM for Real-Time AI Companions

Add code
Mar 03, 2026
Viaarxiv icon

GENIUS: Generative Fluid Intelligence Evaluation Suite

Add code
Feb 11, 2026
Viaarxiv icon

Chain of Mindset: Reasoning with Adaptive Cognitive Modes

Add code
Feb 10, 2026
Viaarxiv icon

GEBench: Benchmarking Image Generation Models as GUI Environments

Add code
Feb 09, 2026
Viaarxiv icon

Rolling Sink: Bridging Limited-Horizon Training and Open-Ended Testing in Autoregressive Video Diffusion

Add code
Feb 08, 2026
Viaarxiv icon