Picture for Hanwang Zhang

Hanwang Zhang

Reasoning Physical Video Generation with Diffusion Timestep Tokens via Reinforcement Learning

Add code
Apr 22, 2025
Viaarxiv icon

Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens

Add code
Apr 20, 2025
Viaarxiv icon

VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models

Add code
Apr 17, 2025
Viaarxiv icon

Generalized Visual Relation Detection with Diffusion Models

Add code
Apr 16, 2025
Viaarxiv icon

Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene

Add code
Mar 19, 2025
Viaarxiv icon

Project-Probe-Aggregate: Efficient Fine-Tuning for Group Robustness

Add code
Mar 12, 2025
Viaarxiv icon

Generalized Kullback-Leibler Divergence Loss

Add code
Mar 11, 2025
Viaarxiv icon

Seeing World Dynamics in a Nutshell

Add code
Feb 05, 2025
Figure 1 for Seeing World Dynamics in a Nutshell
Figure 2 for Seeing World Dynamics in a Nutshell
Figure 3 for Seeing World Dynamics in a Nutshell
Figure 4 for Seeing World Dynamics in a Nutshell
Viaarxiv icon

Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation

Add code
Jan 27, 2025
Figure 1 for Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation
Figure 2 for Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation
Figure 3 for Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation
Figure 4 for Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation
Viaarxiv icon

Pushing Rendering Boundaries: Hard Gaussian Splatting

Add code
Dec 06, 2024
Viaarxiv icon