Picture for Siyu Zhu

Siyu Zhu

AnimateAnything: Fine-Grained Open Domain Image Animation with Motion Guidance

Add code
Dec 04, 2023
Viaarxiv icon

Improving Adversarial Transferability by Stable Diffusion

Add code
Nov 18, 2023
Viaarxiv icon

Fine-grained Text-Video Retrieval with Frozen Image Encoders

Add code
Jul 14, 2023
Figure 1 for Fine-grained Text-Video Retrieval with Frozen Image Encoders
Figure 2 for Fine-grained Text-Video Retrieval with Frozen Image Encoders
Figure 3 for Fine-grained Text-Video Retrieval with Frozen Image Encoders
Figure 4 for Fine-grained Text-Video Retrieval with Frozen Image Encoders
Viaarxiv icon

UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model

Add code
May 22, 2023
Figure 1 for UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model
Figure 2 for UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model
Figure 3 for UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model
Figure 4 for UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model
Viaarxiv icon

Monocular Scene Reconstruction with 3D SDF Transformers

Add code
Jan 31, 2023
Viaarxiv icon

Towards Robust Video Instance Segmentation with Temporal-Aware Transformer

Add code
Jan 20, 2023
Viaarxiv icon

Learning Aligned Cross-modal Representations for Referring Image Segmentation

Add code
Jan 16, 2023
Figure 1 for Learning Aligned Cross-modal Representations for Referring Image Segmentation
Figure 2 for Learning Aligned Cross-modal Representations for Referring Image Segmentation
Figure 3 for Learning Aligned Cross-modal Representations for Referring Image Segmentation
Figure 4 for Learning Aligned Cross-modal Representations for Referring Image Segmentation
Viaarxiv icon

RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments

Add code
Jul 26, 2022
Figure 1 for RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments
Figure 2 for RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments
Figure 3 for RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments
Figure 4 for RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments
Viaarxiv icon

RCP: Recurrent Closest Point for Scene Flow Estimation on 3D Point Clouds

Add code
May 24, 2022
Figure 1 for RCP: Recurrent Closest Point for Scene Flow Estimation on 3D Point Clouds
Figure 2 for RCP: Recurrent Closest Point for Scene Flow Estimation on 3D Point Clouds
Figure 3 for RCP: Recurrent Closest Point for Scene Flow Estimation on 3D Point Clouds
Figure 4 for RCP: Recurrent Closest Point for Scene Flow Estimation on 3D Point Clouds
Viaarxiv icon

NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation

Add code
Mar 03, 2022
Figure 1 for NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation
Figure 2 for NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation
Figure 3 for NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation
Figure 4 for NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation
Viaarxiv icon