Alert button
Picture for Siyu Zhu

Siyu Zhu

Alert button

Fine-grained Text-Video Retrieval with Frozen Image Encoders

Add code
Bookmark button
Alert button
Jul 14, 2023
Zuozhuo Dai, Fangtao Shao, Qingkun Su, Zilong Dong, Siyu Zhu

Figure 1 for Fine-grained Text-Video Retrieval with Frozen Image Encoders
Figure 2 for Fine-grained Text-Video Retrieval with Frozen Image Encoders
Figure 3 for Fine-grained Text-Video Retrieval with Frozen Image Encoders
Figure 4 for Fine-grained Text-Video Retrieval with Frozen Image Encoders
Viaarxiv icon

UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model

Add code
Bookmark button
Alert button
May 22, 2023
Zhenghao Zhang, Zhichao Wei, Shengfan Zhang, Zuozhuo Dai, Siyu Zhu

Figure 1 for UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model
Figure 2 for UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model
Figure 3 for UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model
Figure 4 for UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model
Viaarxiv icon

Monocular Scene Reconstruction with 3D SDF Transformers

Add code
Bookmark button
Alert button
Jan 31, 2023
Weihao Yuan, Xiaodong Gu, Heng Li, Zilong Dong, Siyu Zhu

Figure 1 for Monocular Scene Reconstruction with 3D SDF Transformers
Figure 2 for Monocular Scene Reconstruction with 3D SDF Transformers
Figure 3 for Monocular Scene Reconstruction with 3D SDF Transformers
Figure 4 for Monocular Scene Reconstruction with 3D SDF Transformers
Viaarxiv icon

Towards Robust Video Instance Segmentation with Temporal-Aware Transformer

Add code
Bookmark button
Alert button
Jan 20, 2023
Zhenghao Zhang, Fangtao Shao, Zuozhuo Dai, Siyu Zhu

Viaarxiv icon

Learning Aligned Cross-modal Representations for Referring Image Segmentation

Add code
Bookmark button
Alert button
Jan 16, 2023
Zhichao Wei, Xiaohao Chen, Mingqiang Chen, Siyu Zhu

Figure 1 for Learning Aligned Cross-modal Representations for Referring Image Segmentation
Figure 2 for Learning Aligned Cross-modal Representations for Referring Image Segmentation
Figure 3 for Learning Aligned Cross-modal Representations for Referring Image Segmentation
Figure 4 for Learning Aligned Cross-modal Representations for Referring Image Segmentation
Viaarxiv icon

RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments

Add code
Bookmark button
Alert button
Jul 26, 2022
Jiahui Zhang, Shitao Tang, Kejie Qiu, Rui Huang, Chuan Fang, Le Cui, Zilong Dong, Siyu Zhu, Ping Tan

Figure 1 for RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments
Figure 2 for RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments
Figure 3 for RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments
Figure 4 for RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments
Viaarxiv icon

RCP: Recurrent Closest Point for Scene Flow Estimation on 3D Point Clouds

Add code
Bookmark button
Alert button
May 24, 2022
Xiaodong Gu, Chengzhou Tang, Weihao Yuan, Zuozhuo Dai, Siyu Zhu, Ping Tan

Figure 1 for RCP: Recurrent Closest Point for Scene Flow Estimation on 3D Point Clouds
Figure 2 for RCP: Recurrent Closest Point for Scene Flow Estimation on 3D Point Clouds
Figure 3 for RCP: Recurrent Closest Point for Scene Flow Estimation on 3D Point Clouds
Figure 4 for RCP: Recurrent Closest Point for Scene Flow Estimation on 3D Point Clouds
Viaarxiv icon

NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation

Add code
Bookmark button
Alert button
Mar 03, 2022
Weihao Yuan, Xiaodong Gu, Zuozhuo Dai, Siyu Zhu, Ping Tan

Figure 1 for NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation
Figure 2 for NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation
Figure 3 for NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation
Figure 4 for NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation
Viaarxiv icon

QuadTree Attention for Vision Transformers

Add code
Bookmark button
Alert button
Jan 08, 2022
Shitao Tang, Jiahui Zhang, Siyu Zhu, Ping Tan

Figure 1 for QuadTree Attention for Vision Transformers
Figure 2 for QuadTree Attention for Vision Transformers
Figure 3 for QuadTree Attention for Vision Transformers
Figure 4 for QuadTree Attention for Vision Transformers
Viaarxiv icon

GB-CosFace: Rethinking Softmax-based Face Recognition from the Perspective of Open Set Classification

Add code
Bookmark button
Alert button
Nov 22, 2021
Lizhe Liu, Mingqiang Chen, Xiaohao Chen, Siyu Zhu, Ping Tan

Figure 1 for GB-CosFace: Rethinking Softmax-based Face Recognition from the Perspective of Open Set Classification
Figure 2 for GB-CosFace: Rethinking Softmax-based Face Recognition from the Perspective of Open Set Classification
Figure 3 for GB-CosFace: Rethinking Softmax-based Face Recognition from the Perspective of Open Set Classification
Figure 4 for GB-CosFace: Rethinking Softmax-based Face Recognition from the Perspective of Open Set Classification
Viaarxiv icon