Picture for Xinggang Wang

Xinggang Wang

TransLight: Image-Guided Customized Lighting Control with Generative Decoupling

Add code
Aug 20, 2025
Viaarxiv icon

Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds

Add code
Aug 20, 2025
Viaarxiv icon

LENS: Learning to Segment Anything with Unified Reinforced Reasoning

Add code
Aug 19, 2025
Viaarxiv icon

Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices

Add code
Aug 12, 2025
Viaarxiv icon

ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving

Add code
Jun 09, 2025
Viaarxiv icon

Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency

Add code
Jun 09, 2025
Viaarxiv icon

PixelHacker: Image Inpainting with Structural and Semantic Consistency

Add code
Apr 30, 2025
Viaarxiv icon

STP4D: Spatio-Temporal-Prompt Consistent Modeling for Text-to-4D Gaussian Splatting

Add code
Apr 25, 2025
Viaarxiv icon

MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling

Add code
Mar 17, 2025
Viaarxiv icon

Towards Fast, Memory-based and Data-Efficient Vision-Language Policy

Add code
Mar 13, 2025
Figure 1 for Towards Fast, Memory-based and Data-Efficient Vision-Language Policy
Figure 2 for Towards Fast, Memory-based and Data-Efficient Vision-Language Policy
Figure 3 for Towards Fast, Memory-based and Data-Efficient Vision-Language Policy
Figure 4 for Towards Fast, Memory-based and Data-Efficient Vision-Language Policy
Viaarxiv icon