Picture for Hao Wu

Hao Wu

Member, IEEE

ToolSelf: Unifying Task Execution and Self-Reconfiguration via Tool-Driven Intrinsic Adaptation

Add code
Feb 08, 2026
Viaarxiv icon

ViCA: Efficient Multimodal LLMs with Vision-Only Cross-Attention

Add code
Feb 07, 2026
Viaarxiv icon

WADEPre: A Wavelet-based Decomposition Model for Extreme Precipitation Nowcasting with Multi-Scale Learning

Add code
Feb 02, 2026
Viaarxiv icon

Do Models Hear Like Us? Probing the Representational Alignment of Audio LLMs and Naturalistic EEG

Add code
Jan 23, 2026
Viaarxiv icon

Zero-Permission Manipulation: Can We Trust Large Multimodal Model Powered GUI Agents?

Add code
Jan 18, 2026
Viaarxiv icon

Speak While Watching: Unleashing TRUE Real-Time Video Understanding Capability of Multimodal Large Language Models

Add code
Jan 11, 2026
Viaarxiv icon

FaST: Efficient and Effective Long-Horizon Forecasting for Large-Scale Spatial-Temporal Graphs via Mixture-of-Experts

Add code
Jan 08, 2026
Viaarxiv icon

MiMo-V2-Flash Technical Report

Add code
Jan 08, 2026
Viaarxiv icon

Advanced Global Wildfire Activity Modeling with Hierarchical Graph ODE

Add code
Jan 04, 2026
Viaarxiv icon

Bridging the Perception-Cognition Gap:Re-engineering SAM2 with Hilbert-Mamba for Robust VLM-based Medical Diagnosis

Add code
Dec 30, 2025
Viaarxiv icon