Alert button
Picture for Ziyun Zeng

Ziyun Zeng

Alert button

GMMFormer: Gaussian-Mixture-Model based Transformer for Efficient Partially Relevant Video Retrieval

Add code
Bookmark button
Alert button
Oct 08, 2023
Yuting Wang, Jinpeng Wang, Bin Chen, Ziyun Zeng, Shu-Tao Xia

Figure 1 for GMMFormer: Gaussian-Mixture-Model based Transformer for Efficient Partially Relevant Video Retrieval
Figure 2 for GMMFormer: Gaussian-Mixture-Model based Transformer for Efficient Partially Relevant Video Retrieval
Figure 3 for GMMFormer: Gaussian-Mixture-Model based Transformer for Efficient Partially Relevant Video Retrieval
Figure 4 for GMMFormer: Gaussian-Mixture-Model based Transformer for Efficient Partially Relevant Video Retrieval
Viaarxiv icon

Making LLaMA SEE and Draw with SEED Tokenizer

Add code
Bookmark button
Alert button
Oct 02, 2023
Yuying Ge, Sijie Zhao, Ziyun Zeng, Yixiao Ge, Chen Li, Xintao Wang, Ying Shan

Viaarxiv icon

VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation

Add code
Bookmark button
Alert button
Aug 28, 2023
Xudong Wang, Ishan Misra, Ziyun Zeng, Rohit Girdhar, Trevor Darrell

Figure 1 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Figure 2 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Figure 3 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Figure 4 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Viaarxiv icon

MISSRec: Pre-training and Transferring Multi-modal Interest-aware Sequence Representation for Recommendation

Add code
Bookmark button
Alert button
Aug 22, 2023
Jinpeng Wang, Ziyun Zeng, Yunxiao Wang, Yuting Wang, Xingyu Lu, Tianxiang Li, Jun Yuan, Rui Zhang, Hai-Tao Zheng, Shu-Tao Xia

Figure 1 for MISSRec: Pre-training and Transferring Multi-modal Interest-aware Sequence Representation for Recommendation
Figure 2 for MISSRec: Pre-training and Transferring Multi-modal Interest-aware Sequence Representation for Recommendation
Figure 3 for MISSRec: Pre-training and Transferring Multi-modal Interest-aware Sequence Representation for Recommendation
Figure 4 for MISSRec: Pre-training and Transferring Multi-modal Interest-aware Sequence Representation for Recommendation
Viaarxiv icon

Planting a SEED of Vision in Large Language Model

Add code
Bookmark button
Alert button
Jul 16, 2023
Yuying Ge, Yixiao Ge, Ziyun Zeng, Xintao Wang, Ying Shan

Figure 1 for Planting a SEED of Vision in Large Language Model
Figure 2 for Planting a SEED of Vision in Large Language Model
Figure 3 for Planting a SEED of Vision in Large Language Model
Figure 4 for Planting a SEED of Vision in Large Language Model
Viaarxiv icon

TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at Scale

Add code
Bookmark button
Alert button
May 23, 2023
Ziyun Zeng, Yixiao Ge, Zhan Tong, Xihui Liu, Shu-Tao Xia, Ying Shan

Figure 1 for TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at Scale
Figure 2 for TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at Scale
Figure 3 for TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at Scale
Figure 4 for TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at Scale
Viaarxiv icon

Contrastive Masked Autoencoders for Self-Supervised Video Hashing

Add code
Bookmark button
Alert button
Nov 23, 2022
Yuting Wang, Jinpeng Wang, Bin Chen, Ziyun Zeng, Shutao Xia

Figure 1 for Contrastive Masked Autoencoders for Self-Supervised Video Hashing
Figure 2 for Contrastive Masked Autoencoders for Self-Supervised Video Hashing
Figure 3 for Contrastive Masked Autoencoders for Self-Supervised Video Hashing
Figure 4 for Contrastive Masked Autoencoders for Self-Supervised Video Hashing
Viaarxiv icon

Learning Transferable Spatiotemporal Representations from Natural Script Knowledge

Add code
Bookmark button
Alert button
Sep 30, 2022
Ziyun Zeng, Yuying Ge, Xihui Liu, Bin Chen, Ping Luo, Shu-Tao Xia, Yixiao Ge

Figure 1 for Learning Transferable Spatiotemporal Representations from Natural Script Knowledge
Figure 2 for Learning Transferable Spatiotemporal Representations from Natural Script Knowledge
Figure 3 for Learning Transferable Spatiotemporal Representations from Natural Script Knowledge
Figure 4 for Learning Transferable Spatiotemporal Representations from Natural Script Knowledge
Viaarxiv icon

Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval

Add code
Bookmark button
Alert button
Feb 10, 2022
Jinpeng Wang, Bin Chen, Dongliang Liao, Ziyun Zeng, Gongfu Li, Shu-Tao Xia, Jin Xu

Figure 1 for Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval
Figure 2 for Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval
Figure 3 for Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval
Figure 4 for Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval
Viaarxiv icon