Picture for Zishan Liu

Zishan Liu

SpatialForge: Bootstrapping 3D-Aware Spatial Reasoning from Open-World 2D Images

Add code
May 12, 2026
Viaarxiv icon

Imagine a City: CityGenAgent for Procedural 3D City Generation

Add code
Feb 05, 2026
Viaarxiv icon

More than Vanilla Fusion: a Simple, Decoupling-free, Attention Module for Multimodal Fusion Based on Signal Theory

Add code
Dec 12, 2023
Figure 1 for More than Vanilla Fusion: a Simple, Decoupling-free, Attention Module for Multimodal Fusion Based on Signal Theory
Figure 2 for More than Vanilla Fusion: a Simple, Decoupling-free, Attention Module for Multimodal Fusion Based on Signal Theory
Figure 3 for More than Vanilla Fusion: a Simple, Decoupling-free, Attention Module for Multimodal Fusion Based on Signal Theory
Viaarxiv icon

Learning Audio-Visual embedding for Wild Person Verification

Add code
Sep 09, 2022
Figure 1 for Learning Audio-Visual embedding for Wild Person Verification
Figure 2 for Learning Audio-Visual embedding for Wild Person Verification
Figure 3 for Learning Audio-Visual embedding for Wild Person Verification
Figure 4 for Learning Audio-Visual embedding for Wild Person Verification
Viaarxiv icon