Picture for Hang Zhao

Hang Zhao

DriveAgent-R1: Advancing VLM-based Autonomous Driving with Hybrid Thinking and Active Perception

Add code
Jul 28, 2025
Viaarxiv icon

Reusing Attention for One-stage Lane Topology Understanding

Add code
Jul 23, 2025
Viaarxiv icon

Morpheus: A Neural-driven Animatronic Face with Hybrid Actuation and Diverse Emotion Control

Add code
Jul 22, 2025
Viaarxiv icon

FocusDiff: Advancing Fine-Grained Text-Image Alignment for Autoregressive Visual Generation through RL

Add code
Jun 05, 2025
Viaarxiv icon

DiffDecompose: Layer-Wise Decomposition of Alpha-Composited Images via Diffusion Transformers

Add code
May 30, 2025
Viaarxiv icon

Impromptu VLA: Open Weights and Open Data for Driving Vision-Language-Action Models

Add code
May 29, 2025
Viaarxiv icon

Diffusion-Based Generative Models for 3D Occupancy Prediction in Autonomous Driving

Add code
May 29, 2025
Viaarxiv icon

Designing Pin-pression Gripper and Learning its Dexterous Grasping with Online In-hand Adjustment

Add code
May 25, 2025
Viaarxiv icon

Challenger: Affordable Adversarial Driving Video Generation

Add code
May 21, 2025
Viaarxiv icon

Conditioning Matters: Training Diffusion Policies is Faster Than You Think

Add code
May 16, 2025
Viaarxiv icon