Picture for Ziwei Liu

Ziwei Liu

Nanyang Technological University

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Add code
Oct 16, 2025
Viaarxiv icon

RealDPO: Real or Not Real, that is the Preference

Add code
Oct 16, 2025
Viaarxiv icon

VideoLucy: Deep Memory Backtracking for Long Video Understanding

Add code
Oct 14, 2025
Figure 1 for VideoLucy: Deep Memory Backtracking for Long Video Understanding
Figure 2 for VideoLucy: Deep Memory Backtracking for Long Video Understanding
Figure 3 for VideoLucy: Deep Memory Backtracking for Long Video Understanding
Figure 4 for VideoLucy: Deep Memory Backtracking for Long Video Understanding
Viaarxiv icon

VChain: Chain-of-Visual-Thought for Reasoning in Video Generation

Add code
Oct 06, 2025
Viaarxiv icon

A Lightweight Convolution and Vision Transformer integrated model with Multi-scale Self-attention Mechanism

Add code
Aug 23, 2025
Viaarxiv icon

Collaborative Multi-Modal Coding for High-Quality 3D Generation

Add code
Aug 21, 2025
Viaarxiv icon

Has GPT-5 Achieved Spatial Intelligence? An Empirical Study

Add code
Aug 18, 2025
Viaarxiv icon

4DNeX: Feed-Forward 4D Generative Modeling Made Easy

Add code
Aug 18, 2025
Viaarxiv icon

Cut2Next: Generating Next Shot via In-Context Tuning

Add code
Aug 12, 2025
Viaarxiv icon

Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity

Add code
Aug 07, 2025
Viaarxiv icon