Picture for Hengshuang Zhao

Hengshuang Zhao

DiffCamera: Arbitrary Refocusing on Images

Add code
Sep 30, 2025
Viaarxiv icon

Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search

Add code
Sep 09, 2025
Viaarxiv icon

ROSE: Remove Objects with Side Effects in Videos

Add code
Aug 26, 2025
Viaarxiv icon

Train Once, Deploy Anywhere: Realize Data-Efficient Dynamic Object Manipulation

Add code
Aug 19, 2025
Viaarxiv icon

BOOD: Boundary-based Out-Of-Distribution Data Generation

Add code
Aug 01, 2025
Viaarxiv icon

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Add code
Jul 17, 2025
Viaarxiv icon

DreamComposer++: Empowering Diffusion Models with Multi-View Conditions for 3D Content Generation

Add code
Jul 03, 2025
Viaarxiv icon

Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching

Add code
Jul 03, 2025
Viaarxiv icon

LiteReality: Graphics-Ready 3D Scene Reconstruction from RGB-D Scans

Add code
Jul 03, 2025
Viaarxiv icon

FocalClick-XL: Towards Unified and High-quality Interactive Segmentation

Add code
Jun 17, 2025
Viaarxiv icon