Picture for Ziwei Liu

Ziwei Liu

Nanyang Technological University

FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model

Add code
Jul 02, 2025
Viaarxiv icon

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Add code
Jun 26, 2025
Viaarxiv icon

MMSearch-R1: Incentivizing LMMs to Search

Add code
Jun 25, 2025
Viaarxiv icon

Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning

Add code
Jun 16, 2025
Viaarxiv icon

Branch, or Layer? Zeroth-Order Optimization for Continual Learning of Vision-Language Models

Add code
Jun 14, 2025
Viaarxiv icon

Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers

Add code
Jun 09, 2025
Viaarxiv icon

GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior

Add code
Jun 09, 2025
Viaarxiv icon

Video World Models with Long-term Spatial Memory

Add code
Jun 05, 2025
Viaarxiv icon

HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation

Add code
Jun 04, 2025
Viaarxiv icon

Research on feature fusion and multimodal patent text based on graph attention network

Add code
May 26, 2025
Viaarxiv icon