Picture for Haoyu Wu

Haoyu Wu

Oedipus and the Sphinx: Benchmarking and Improving Visual Language Models for Complex Graphic Reasoning

Add code
Aug 01, 2025
Viaarxiv icon

Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

Add code
Jul 10, 2025
Viaarxiv icon

DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations

Add code
May 26, 2025
Viaarxiv icon

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

Add code
Apr 11, 2025
Viaarxiv icon

Fast Autoregressive Video Generation with Diagonal Decoding

Add code
Mar 18, 2025
Figure 1 for Fast Autoregressive Video Generation with Diagonal Decoding
Figure 2 for Fast Autoregressive Video Generation with Diagonal Decoding
Figure 3 for Fast Autoregressive Video Generation with Diagonal Decoding
Figure 4 for Fast Autoregressive Video Generation with Diagonal Decoding
Viaarxiv icon

CoheDancers: Enhancing Interactive Group Dance Generation through Music-Driven Coherence Decomposition

Add code
Dec 26, 2024
Figure 1 for CoheDancers: Enhancing Interactive Group Dance Generation through Music-Driven Coherence Decomposition
Figure 2 for CoheDancers: Enhancing Interactive Group Dance Generation through Music-Driven Coherence Decomposition
Figure 3 for CoheDancers: Enhancing Interactive Group Dance Generation through Music-Driven Coherence Decomposition
Figure 4 for CoheDancers: Enhancing Interactive Group Dance Generation through Music-Driven Coherence Decomposition
Viaarxiv icon

VideoDPO: Omni-Preference Alignment for Video Diffusion Generation

Add code
Dec 18, 2024
Figure 1 for VideoDPO: Omni-Preference Alignment for Video Diffusion Generation
Figure 2 for VideoDPO: Omni-Preference Alignment for Video Diffusion Generation
Figure 3 for VideoDPO: Omni-Preference Alignment for Video Diffusion Generation
Figure 4 for VideoDPO: Omni-Preference Alignment for Video Diffusion Generation
Viaarxiv icon

MLI-NeRF: Multi-Light Intrinsic-Aware Neural Radiance Fields

Add code
Nov 26, 2024
Figure 1 for MLI-NeRF: Multi-Light Intrinsic-Aware Neural Radiance Fields
Figure 2 for MLI-NeRF: Multi-Light Intrinsic-Aware Neural Radiance Fields
Figure 3 for MLI-NeRF: Multi-Light Intrinsic-Aware Neural Radiance Fields
Figure 4 for MLI-NeRF: Multi-Light Intrinsic-Aware Neural Radiance Fields
Viaarxiv icon

Importance-based Token Merging for Diffusion Models

Add code
Nov 23, 2024
Figure 1 for Importance-based Token Merging for Diffusion Models
Figure 2 for Importance-based Token Merging for Diffusion Models
Figure 3 for Importance-based Token Merging for Diffusion Models
Figure 4 for Importance-based Token Merging for Diffusion Models
Viaarxiv icon

Direct and Explicit 3D Generation from a Single Image

Add code
Nov 17, 2024
Figure 1 for Direct and Explicit 3D Generation from a Single Image
Figure 2 for Direct and Explicit 3D Generation from a Single Image
Figure 3 for Direct and Explicit 3D Generation from a Single Image
Figure 4 for Direct and Explicit 3D Generation from a Single Image
Viaarxiv icon