Picture for Hengshuang Zhao

Hengshuang Zhao

BOOD: Boundary-based Out-Of-Distribution Data Generation

Add code
Aug 01, 2025
Viaarxiv icon

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Add code
Jul 17, 2025
Viaarxiv icon

Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching

Add code
Jul 03, 2025
Viaarxiv icon

LiteReality: Graphics-Ready 3D Scene Reconstruction from RGB-D Scans

Add code
Jul 03, 2025
Viaarxiv icon

DreamComposer++: Empowering Diffusion Models with Multi-View Conditions for 3D Content Generation

Add code
Jul 03, 2025
Viaarxiv icon

TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization

Add code
Jun 17, 2025
Viaarxiv icon

FocalClick-XL: Towards Unified and High-quality Interactive Segmentation

Add code
Jun 17, 2025
Viaarxiv icon

PlayerOne: Egocentric World Simulator

Add code
Jun 11, 2025
Viaarxiv icon

LayerFlow: A Unified Model for Layer-aware Video Generation

Add code
Jun 04, 2025
Viaarxiv icon

GenSpace: Benchmarking Spatially-Aware Image Generation

Add code
May 30, 2025
Viaarxiv icon