Alert button
Picture for Yuying Ge

Yuying Ge

Alert button

Planting a SEED of Vision in Large Language Model

Add code
Bookmark button
Alert button
Jul 16, 2023
Yuying Ge, Yixiao Ge, Ziyun Zeng, Xintao Wang, Ying Shan

Figure 1 for Planting a SEED of Vision in Large Language Model
Figure 2 for Planting a SEED of Vision in Large Language Model
Figure 3 for Planting a SEED of Vision in Large Language Model
Figure 4 for Planting a SEED of Vision in Large Language Model
Viaarxiv icon

JourneyDB: A Benchmark for Generative Image Understanding

Add code
Bookmark button
Alert button
Jul 03, 2023
Junting Pan, Keqiang Sun, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Hongsheng Li

Figure 1 for JourneyDB: A Benchmark for Generative Image Understanding
Figure 2 for JourneyDB: A Benchmark for Generative Image Understanding
Figure 3 for JourneyDB: A Benchmark for Generative Image Understanding
Figure 4 for JourneyDB: A Benchmark for Generative Image Understanding
Viaarxiv icon

Align, Adapt and Inject: Sound-guided Unified Image Generation

Add code
Bookmark button
Alert button
Jun 20, 2023
Yue Yang, Kaipeng Zhang, Yuying Ge, Wenqi Shao, Zeyue Xue, Yu Qiao, Ping Luo

Figure 1 for Align, Adapt and Inject: Sound-guided Unified Image Generation
Figure 2 for Align, Adapt and Inject: Sound-guided Unified Image Generation
Figure 3 for Align, Adapt and Inject: Sound-guided Unified Image Generation
Figure 4 for Align, Adapt and Inject: Sound-guided Unified Image Generation
Viaarxiv icon

Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models

Add code
Bookmark button
Alert button
Jun 15, 2023
Junting Pan, Ziyi Lin, Yuying Ge, Xiatian Zhu, Renrui Zhang, Yi Wang, Yu Qiao, Hongsheng Li

Figure 1 for Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models
Figure 2 for Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models
Figure 3 for Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models
Figure 4 for Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models
Viaarxiv icon

Self-Play and Self-Describe: Policy Adaptation with Vision-Language Foundation Models

Add code
Bookmark button
Alert button
Dec 14, 2022
Yuying Ge, Annabella Macaluso, Li Erran Li, Ping Luo, Xiaolong Wang

Figure 1 for Self-Play and Self-Describe: Policy Adaptation with Vision-Language Foundation Models
Figure 2 for Self-Play and Self-Describe: Policy Adaptation with Vision-Language Foundation Models
Figure 3 for Self-Play and Self-Describe: Policy Adaptation with Vision-Language Foundation Models
Figure 4 for Self-Play and Self-Describe: Policy Adaptation with Vision-Language Foundation Models
Viaarxiv icon

Learning Transferable Spatiotemporal Representations from Natural Script Knowledge

Add code
Bookmark button
Alert button
Sep 30, 2022
Ziyun Zeng, Yuying Ge, Xihui Liu, Bin Chen, Ping Luo, Shu-Tao Xia, Yixiao Ge

Figure 1 for Learning Transferable Spatiotemporal Representations from Natural Script Knowledge
Figure 2 for Learning Transferable Spatiotemporal Representations from Natural Script Knowledge
Figure 3 for Learning Transferable Spatiotemporal Representations from Natural Script Knowledge
Figure 4 for Learning Transferable Spatiotemporal Representations from Natural Script Knowledge
Viaarxiv icon

MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval

Add code
Bookmark button
Alert button
Apr 26, 2022
Yuying Ge, Yixiao Ge, Xihui Liu, Alex Jinpeng Wang, Jianping Wu, Ying Shan, Xiaohu Qie, Ping Luo

Figure 1 for MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval
Figure 2 for MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval
Figure 3 for MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval
Figure 4 for MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval
Viaarxiv icon

All in One: Exploring Unified Video-Language Pre-training

Add code
Bookmark button
Alert button
Mar 14, 2022
Alex Jinpeng Wang, Yixiao Ge, Rui Yan, Yuying Ge, Xudong Lin, Guanyu Cai, Jianping Wu, Ying Shan, Xiaohu Qie, Mike Zheng Shou

Figure 1 for All in One: Exploring Unified Video-Language Pre-training
Figure 2 for All in One: Exploring Unified Video-Language Pre-training
Figure 3 for All in One: Exploring Unified Video-Language Pre-training
Figure 4 for All in One: Exploring Unified Video-Language Pre-training
Viaarxiv icon

MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning

Add code
Bookmark button
Alert button
Jan 13, 2022
Yuying Ge, Yibing Song, Ruimao Zhang, Ping Luo

Figure 1 for MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning
Figure 2 for MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning
Figure 3 for MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning
Figure 4 for MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning
Viaarxiv icon

BridgeFormer: Bridging Video-text Retrieval with Multiple Choice Questions

Add code
Bookmark button
Alert button
Jan 13, 2022
Yuying Ge, Yixiao Ge, Xihui Liu, Dian Li, Ying Shan, Xiaohu Qie, Ping Luo

Figure 1 for BridgeFormer: Bridging Video-text Retrieval with Multiple Choice Questions
Figure 2 for BridgeFormer: Bridging Video-text Retrieval with Multiple Choice Questions
Figure 3 for BridgeFormer: Bridging Video-text Retrieval with Multiple Choice Questions
Figure 4 for BridgeFormer: Bridging Video-text Retrieval with Multiple Choice Questions
Viaarxiv icon