Picture for Ziwei Liu

Ziwei Liu

Nanyang Technological University

Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning

Add code
Jun 16, 2025
Viaarxiv icon

Branch, or Layer? Zeroth-Order Optimization for Continual Learning of Vision-Language Models

Add code
Jun 14, 2025
Viaarxiv icon

Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers

Add code
Jun 09, 2025
Viaarxiv icon

GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior

Add code
Jun 09, 2025
Viaarxiv icon

Video World Models with Long-term Spatial Memory

Add code
Jun 05, 2025
Viaarxiv icon

HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation

Add code
Jun 04, 2025
Viaarxiv icon

Research on feature fusion and multimodal patent text based on graph attention network

Add code
May 26, 2025
Viaarxiv icon

Streamline Without Sacrifice -- Squeeze out Computation Redundancy in LMM

Add code
May 21, 2025
Viaarxiv icon

3D Scene Generation: A Survey

Add code
May 08, 2025
Viaarxiv icon

GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography

Add code
Apr 10, 2025
Viaarxiv icon