Picture for Ziwei Liu

Ziwei Liu

Nanyang Technological University

ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors

Add code
Mar 04, 2026
Viaarxiv icon

UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

Add code
Mar 03, 2026
Viaarxiv icon

BFA++: Hierarchical Best-Feature-Aware Token Prune for Multi-View Vision Language Action Model

Add code
Feb 24, 2026
Viaarxiv icon

A Very Big Video Reasoning Suite

Add code
Feb 24, 2026
Viaarxiv icon

JavisDiT++: Unified Modeling and Optimization for Joint Audio-Video Generation

Add code
Feb 22, 2026
Viaarxiv icon

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Add code
Feb 09, 2026
Viaarxiv icon

Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

Add code
Feb 09, 2026
Viaarxiv icon

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

Add code
Jan 29, 2026
Viaarxiv icon

Continual GUI Agents

Add code
Jan 29, 2026
Viaarxiv icon

OnlineSI: Taming Large Language Model for Online 3D Understanding and Grounding

Add code
Jan 23, 2026
Viaarxiv icon