Picture for Hang Zhao

Hang Zhao

FocusDiff: Advancing Fine-Grained Text-Image Alignment for Autoregressive Visual Generation through RL

Add code
Jun 05, 2025
Viaarxiv icon

DiffDecompose: Layer-Wise Decomposition of Alpha-Composited Images via Diffusion Transformers

Add code
May 30, 2025
Viaarxiv icon

Impromptu VLA: Open Weights and Open Data for Driving Vision-Language-Action Models

Add code
May 29, 2025
Viaarxiv icon

Diffusion-Based Generative Models for 3D Occupancy Prediction in Autonomous Driving

Add code
May 29, 2025
Viaarxiv icon

Designing Pin-pression Gripper and Learning its Dexterous Grasping with Online In-hand Adjustment

Add code
May 25, 2025
Viaarxiv icon

Challenger: Affordable Adversarial Driving Video Generation

Add code
May 21, 2025
Viaarxiv icon

Conditioning Matters: Training Diffusion Policies is Faster Than You Think

Add code
May 16, 2025
Viaarxiv icon

PIN-WM: Learning Physics-INformed World Models for Non-Prehensile Manipulation

Add code
Apr 23, 2025
Viaarxiv icon

Deliberate Planning of 3D Bin Packing on Packing Configuration Trees

Add code
Apr 06, 2025
Viaarxiv icon

Towards Reliable Time Series Forecasting under Future Uncertainty: Ambiguity and Novelty Rejection Mechanisms

Add code
Mar 25, 2025
Viaarxiv icon