Picture for Han Yan

Han Yan

I3DM: Implicit 3D-aware Memory Retrieval and Injection for Consistent Video Scene Generation

Add code
Mar 24, 2026
Viaarxiv icon

MWM: Mobile World Models for Action-Conditioned Consistent Prediction

Add code
Mar 08, 2026
Viaarxiv icon

BachVid: Training-Free Video Generation with Consistent Background and Character

Add code
Oct 24, 2025
Viaarxiv icon

BAG: Body-Aligned 3D Wearable Asset Generation

Add code
Jan 27, 2025
Figure 1 for BAG: Body-Aligned 3D Wearable Asset Generation
Figure 2 for BAG: Body-Aligned 3D Wearable Asset Generation
Figure 3 for BAG: Body-Aligned 3D Wearable Asset Generation
Figure 4 for BAG: Body-Aligned 3D Wearable Asset Generation
Viaarxiv icon

PhyCAGE: Physically Plausible Compositional 3D Asset Generation from a Single Image

Add code
Nov 27, 2024
Figure 1 for PhyCAGE: Physically Plausible Compositional 3D Asset Generation from a Single Image
Figure 2 for PhyCAGE: Physically Plausible Compositional 3D Asset Generation from a Single Image
Figure 3 for PhyCAGE: Physically Plausible Compositional 3D Asset Generation from a Single Image
Figure 4 for PhyCAGE: Physically Plausible Compositional 3D Asset Generation from a Single Image
Viaarxiv icon

NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and Generation

Add code
Mar 27, 2024
Figure 1 for NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and Generation
Figure 2 for NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and Generation
Figure 3 for NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and Generation
Figure 4 for NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and Generation
Viaarxiv icon

Frankenstein: Generating Semantic-Compositional 3D Scenes in One Tri-Plane

Add code
Mar 24, 2024
Figure 1 for Frankenstein: Generating Semantic-Compositional 3D Scenes in One Tri-Plane
Figure 2 for Frankenstein: Generating Semantic-Compositional 3D Scenes in One Tri-Plane
Figure 3 for Frankenstein: Generating Semantic-Compositional 3D Scenes in One Tri-Plane
Figure 4 for Frankenstein: Generating Semantic-Compositional 3D Scenes in One Tri-Plane
Viaarxiv icon

RIS-Enabled Joint Near-Field 3D Localization and Synchronization in SISO Multipath Environments

Add code
Mar 11, 2024
Figure 1 for RIS-Enabled Joint Near-Field 3D Localization and Synchronization in SISO Multipath Environments
Figure 2 for RIS-Enabled Joint Near-Field 3D Localization and Synchronization in SISO Multipath Environments
Figure 3 for RIS-Enabled Joint Near-Field 3D Localization and Synchronization in SISO Multipath Environments
Figure 4 for RIS-Enabled Joint Near-Field 3D Localization and Synchronization in SISO Multipath Environments
Viaarxiv icon

BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation

Add code
Jan 31, 2024
Figure 1 for BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation
Figure 2 for BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation
Figure 3 for BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation
Figure 4 for BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation
Viaarxiv icon

Rethinking Cross-Attention for Infrared and Visible Image Fusion

Add code
Jan 22, 2024
Viaarxiv icon