Alert button
Picture for Zejun Li

Zejun Li

Alert button

DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning

Add code
Bookmark button
Alert button
Apr 02, 2024
Mengfei Du, Binhao Wu, Jiwen Zhang, Zhihao Fan, Zejun Li, Ruipu Luo, Xuanjing Huang, Zhongyu Wei

Viaarxiv icon

ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks

Add code
Bookmark button
Alert button
Oct 17, 2023
Zejun Li, Ye Wang, Mengfei Du, Qingwen Liu, Binhao Wu, Jiwen Zhang, Chengxing Zhou, Zhihao Fan, Jie Fu, Jingjing Chen, Xuanjing Huang, Zhongyu Wei

Figure 1 for ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks
Figure 2 for ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks
Figure 3 for ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks
Figure 4 for ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks
Viaarxiv icon

A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-training

Add code
Bookmark button
Alert button
Jun 11, 2022
Zhihao Fan, Zhongyu Wei, Jingjing Chen, Siyuan Wang, Zejun Li, Jiarong Xu, Xuanjing Huang

Figure 1 for A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-training
Figure 2 for A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-training
Figure 3 for A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-training
Figure 4 for A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-training
Viaarxiv icon

MVP: Multi-Stage Vision-Language Pre-Training via Multi-Level Semantic Alignment

Add code
Bookmark button
Alert button
Jan 29, 2022
Zejun Li, Zhihao Fan, Huaixiao Tou, Zhongyu Wei

Figure 1 for MVP: Multi-Stage Vision-Language Pre-Training via Multi-Level Semantic Alignment
Figure 2 for MVP: Multi-Stage Vision-Language Pre-Training via Multi-Level Semantic Alignment
Figure 3 for MVP: Multi-Stage Vision-Language Pre-Training via Multi-Level Semantic Alignment
Figure 4 for MVP: Multi-Stage Vision-Language Pre-Training via Multi-Level Semantic Alignment
Viaarxiv icon

Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval

Add code
Bookmark button
Alert button
Nov 05, 2021
Zhihao Fan, Zhongyu Wei, Zejun Li, Siyuan Wang, Jianqing Fan

Figure 1 for Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval
Figure 2 for Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval
Figure 3 for Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval
Figure 4 for Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval
Viaarxiv icon

Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval

Add code
Bookmark button
Alert button
Sep 12, 2021
Zhihao Fan, Zhongyu Wei, Zejun Li, Siyuan Wang, Haijun Shan, Xuanjing Huang, Jianqing Fan

Figure 1 for Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval
Figure 2 for Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval
Figure 3 for Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval
Figure 4 for Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval
Viaarxiv icon

TCIC: Theme Concepts Learning Cross Language and Vision for Image Captioning

Add code
Bookmark button
Alert button
Jun 21, 2021
Zhihao Fan, Zhongyu Wei, Siyuan Wang, Ruize Wang, Zejun Li, Haijun Shan, Xuanjing Huang

Figure 1 for TCIC: Theme Concepts Learning Cross Language and Vision for Image Captioning
Figure 2 for TCIC: Theme Concepts Learning Cross Language and Vision for Image Captioning
Figure 3 for TCIC: Theme Concepts Learning Cross Language and Vision for Image Captioning
Figure 4 for TCIC: Theme Concepts Learning Cross Language and Vision for Image Captioning
Viaarxiv icon

An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-Level Structural Information

Add code
Bookmark button
Alert button
Mar 21, 2021
Zejun Li, Zhongyu Wei, Zhihao Fan, Haijun Shan, Xuanjing Huang

Figure 1 for An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-Level Structural Information
Figure 2 for An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-Level Structural Information
Figure 3 for An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-Level Structural Information
Figure 4 for An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-Level Structural Information
Viaarxiv icon

AdaDNNs: Adaptive Ensemble of Deep Neural Networks for Scene Text Recognition

Add code
Bookmark button
Alert button
Oct 10, 2017
Chun Yang, Xu-Cheng Yin, Zejun Li, Jianwei Wu, Chunchao Guo, Hongfa Wang, Lei Xiao

Figure 1 for AdaDNNs: Adaptive Ensemble of Deep Neural Networks for Scene Text Recognition
Figure 2 for AdaDNNs: Adaptive Ensemble of Deep Neural Networks for Scene Text Recognition
Figure 3 for AdaDNNs: Adaptive Ensemble of Deep Neural Networks for Scene Text Recognition
Figure 4 for AdaDNNs: Adaptive Ensemble of Deep Neural Networks for Scene Text Recognition
Viaarxiv icon