Alert button
Picture for Deyao Zhu

Deyao Zhu

Alert button

MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens

Add code
Bookmark button
Alert button
Apr 04, 2024
Kirolos Ataallah, Xiaoqian Shen, Eslam Abdelrahman, Essam Sleiman, Deyao Zhu, Jian Ding, Mohamed Elhoseiny

Viaarxiv icon

MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning

Add code
Bookmark button
Alert button
Oct 26, 2023
Jun Chen, Deyao Zhu, Xiaoqian Shen, Xiang Li, Zechun Liu, Pengchuan Zhang, Raghuraman Krishnamoorthi, Vikas Chandra, Yunyang Xiong, Mohamed Elhoseiny

Viaarxiv icon

Exploring Open-Vocabulary Semantic Segmentation without Human Labels

Add code
Bookmark button
Alert button
Jun 01, 2023
Jun Chen, Deyao Zhu, Guocheng Qian, Bernard Ghanem, Zhicheng Yan, Chenchen Zhu, Fanyi Xiao, Mohamed Elhoseiny, Sean Chang Culatana

Figure 1 for Exploring Open-Vocabulary Semantic Segmentation without Human Labels
Figure 2 for Exploring Open-Vocabulary Semantic Segmentation without Human Labels
Figure 3 for Exploring Open-Vocabulary Semantic Segmentation without Human Labels
Figure 4 for Exploring Open-Vocabulary Semantic Segmentation without Human Labels
Viaarxiv icon

MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models

Add code
Bookmark button
Alert button
Apr 20, 2023
Deyao Zhu, Jun Chen, Xiaoqian Shen, Xiang Li, Mohamed Elhoseiny

Figure 1 for MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models
Figure 2 for MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models
Figure 3 for MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models
Figure 4 for MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models
Viaarxiv icon

Video ChatCaptioner: Towards Enriched Spatiotemporal Descriptions

Add code
Bookmark button
Alert button
Apr 13, 2023
Jun Chen, Deyao Zhu, Kilichbek Haydarov, Xiang Li, Mohamed Elhoseiny

Figure 1 for Video ChatCaptioner: Towards Enriched Spatiotemporal Descriptions
Figure 2 for Video ChatCaptioner: Towards Enriched Spatiotemporal Descriptions
Figure 3 for Video ChatCaptioner: Towards Enriched Spatiotemporal Descriptions
Figure 4 for Video ChatCaptioner: Towards Enriched Spatiotemporal Descriptions
Viaarxiv icon

ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions

Add code
Bookmark button
Alert button
Mar 12, 2023
Deyao Zhu, Jun Chen, Kilichbek Haydarov, Xiaoqian Shen, Wenxuan Zhang, Mohamed Elhoseiny

Figure 1 for ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions
Figure 2 for ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions
Figure 3 for ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions
Figure 4 for ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions
Viaarxiv icon

Guiding Online Reinforcement Learning with Action-Free Offline Pretraining

Add code
Bookmark button
Alert button
Jan 30, 2023
Deyao Zhu, Yuhui Wang, Jürgen Schmidhuber, Mohamed Elhoseiny

Figure 1 for Guiding Online Reinforcement Learning with Action-Free Offline Pretraining
Figure 2 for Guiding Online Reinforcement Learning with Action-Free Offline Pretraining
Figure 3 for Guiding Online Reinforcement Learning with Action-Free Offline Pretraining
Figure 4 for Guiding Online Reinforcement Learning with Action-Free Offline Pretraining
Viaarxiv icon

Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 09, 2022
Deyao Zhu, Li Erran Li, Mohamed Elhoseiny

Figure 1 for Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning
Figure 2 for Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning
Figure 3 for Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning
Figure 4 for Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning
Viaarxiv icon

Social-Implicit: Rethinking Trajectory Prediction Evaluation and The Effectiveness of Implicit Maximum Likelihood Estimation

Add code
Bookmark button
Alert button
Mar 06, 2022
Abduallah Mohamed, Deyao Zhu, Warren Vu, Mohamed Elhoseiny, Christian Claudel

Figure 1 for Social-Implicit: Rethinking Trajectory Prediction Evaluation and The Effectiveness of Implicit Maximum Likelihood Estimation
Figure 2 for Social-Implicit: Rethinking Trajectory Prediction Evaluation and The Effectiveness of Implicit Maximum Likelihood Estimation
Figure 3 for Social-Implicit: Rethinking Trajectory Prediction Evaluation and The Effectiveness of Implicit Maximum Likelihood Estimation
Figure 4 for Social-Implicit: Rethinking Trajectory Prediction Evaluation and The Effectiveness of Implicit Maximum Likelihood Estimation
Viaarxiv icon