Alert button
Picture for Bin Zhu

Bin Zhu

Alert button

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Feb 04, 2024
Bin Lin, Zhenyu Tang, Yang Ye, Jiaxi Cui, Bin Zhu, Peng Jin, Jinfa Huang, Junwu Zhang, Munan Ning, Li Yuan

Viaarxiv icon

Video Editing for Video Retrieval

Feb 04, 2024
Bin Zhu, Kevin Flanagan, Adriano Fragomeni, Michael Wray, Dima Damen

Viaarxiv icon

FoodLMM: A Versatile Food Assistant using Large Multi-modal Model

Dec 22, 2023
Yuehao Yin, Huiyan Qi, Bin Zhu, Jingjing Chen, Yu-Gang Jiang, Chong-Wah Ngo

Viaarxiv icon

Benchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language Models

Dec 21, 2023
Jingwei Yi, Yueqi Xie, Bin Zhu, Keegan Hines, Emre Kiciman, Guangzhong Sun, Xing Xie, Fangzhao Wu

Viaarxiv icon

CAR: Consolidation, Augmentation and Regulation for Recipe Retrieval

Dec 08, 2023
Fangzhou Song, Bin Zhu, Yanbin Hao, Shuo Wang, Xiangnan He

Viaarxiv icon

Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models

Nov 28, 2023
Munan Ning, Bin Zhu, Yujia Xie, Bin Lin, Jiaxi Cui, Lu Yuan, Dongdong Chen, Li Yuan

Viaarxiv icon

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Nov 21, 2023
Bin Lin, Yang Ye, Bin Zhu, Jiaxi Cui, Munan Ning, Peng Jin, Li Yuan

Viaarxiv icon

LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Oct 14, 2023
Bin Zhu, Bin Lin, Munan Ning, Yang Yan, Jiaxi Cui, HongFa Wang, Yatian Pang, Wenhao Jiang, Junwu Zhang, Zongwei Li, Wancai Zhang, Zhifeng Li, Wei Liu, Li Yuan

Figure 1 for LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Figure 2 for LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Figure 3 for LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Figure 4 for LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Viaarxiv icon

X-Transfer: A Transfer Learning-Based Framework for Robust GAN-Generated Fake Image Detection

Oct 07, 2023
Lei Zhang, Hao Chen, Shu Hu, Bin Zhu, Xi Wu, Jinrong Hu, Xin Wang

Figure 1 for X-Transfer: A Transfer Learning-Based Framework for Robust GAN-Generated Fake Image Detection
Figure 2 for X-Transfer: A Transfer Learning-Based Framework for Robust GAN-Generated Fake Image Detection
Figure 3 for X-Transfer: A Transfer Learning-Based Framework for Robust GAN-Generated Fake Image Detection
Figure 4 for X-Transfer: A Transfer Learning-Based Framework for Robust GAN-Generated Fake Image Detection
Viaarxiv icon