Picture for Xingang Pan

Xingang Pan

From Sparse to Dense: Multi-View GRPO for Flow Models via Augmented Condition Space

Add code
Mar 13, 2026
Viaarxiv icon

Hand2World: Autoregressive Egocentric Interaction Generation via Free-Space Hand Gestures

Add code
Feb 13, 2026
Viaarxiv icon

4RC: 4D Reconstruction via Conditional Querying Anytime and Anywhere

Add code
Feb 10, 2026
Viaarxiv icon

PnP-U3D: Plug-and-Play 3D Framework Bridging Autoregression and Diffusion for Unified Understanding and Generation

Add code
Feb 03, 2026
Viaarxiv icon

PI-Light: Physics-Inspired Diffusion for Full-Image Relighting

Add code
Jan 29, 2026
Viaarxiv icon

StoryMem: Multi-shot Long Video Storytelling with Memory

Add code
Dec 22, 2025
Figure 1 for StoryMem: Multi-shot Long Video Storytelling with Memory
Figure 2 for StoryMem: Multi-shot Long Video Storytelling with Memory
Figure 3 for StoryMem: Multi-shot Long Video Storytelling with Memory
Figure 4 for StoryMem: Multi-shot Long Video Storytelling with Memory
Viaarxiv icon

Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers

Add code
Dec 18, 2025
Figure 1 for Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
Figure 2 for Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
Figure 3 for Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
Figure 4 for Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
Viaarxiv icon

BokehDepth: Enhancing Monocular Depth Estimation through Bokeh Generation

Add code
Dec 13, 2025
Viaarxiv icon

FastMesh: Efficient Artistic Mesh Generation via Component Decoupling

Add code
Aug 27, 2025
Viaarxiv icon

STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer

Add code
Aug 14, 2025
Viaarxiv icon