Picture for Liang Pan

Liang Pan

SEE4D: Pose-Free 4D Generation via Auto-Regressive Video Inpainting

Add code
Oct 30, 2025
Viaarxiv icon

VideoLucy: Deep Memory Backtracking for Long Video Understanding

Add code
Oct 14, 2025
Figure 1 for VideoLucy: Deep Memory Backtracking for Long Video Understanding
Figure 2 for VideoLucy: Deep Memory Backtracking for Long Video Understanding
Figure 3 for VideoLucy: Deep Memory Backtracking for Long Video Understanding
Figure 4 for VideoLucy: Deep Memory Backtracking for Long Video Understanding
Viaarxiv icon

Collaborative Multi-Modal Coding for High-Quality 3D Generation

Add code
Aug 21, 2025
Viaarxiv icon

4DNeX: Feed-Forward 4D Generative Modeling Made Easy

Add code
Aug 18, 2025
Figure 1 for 4DNeX: Feed-Forward 4D Generative Modeling Made Easy
Figure 2 for 4DNeX: Feed-Forward 4D Generative Modeling Made Easy
Figure 3 for 4DNeX: Feed-Forward 4D Generative Modeling Made Easy
Figure 4 for 4DNeX: Feed-Forward 4D Generative Modeling Made Easy
Viaarxiv icon

Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity

Add code
Aug 07, 2025
Figure 1 for Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity
Figure 2 for Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity
Figure 3 for Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity
Figure 4 for Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity
Viaarxiv icon

MOSPA: Human Motion Generation Driven by Spatial Audio

Add code
Jul 16, 2025
Viaarxiv icon

Diff$^2$I2P: Differentiable Image-to-Point Cloud Registration with Diffusion Prior

Add code
Jul 09, 2025
Viaarxiv icon

ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model

Add code
Jun 11, 2025
Figure 1 for ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model
Figure 2 for ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model
Figure 3 for ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model
Figure 4 for ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model
Viaarxiv icon

TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization

Add code
Mar 25, 2025
Viaarxiv icon

SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data Pretraining

Add code
Mar 25, 2025
Viaarxiv icon