Picture for Yueze Wang

Yueze Wang

DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception

Add code
Jul 11, 2024
Viaarxiv icon

Unveiling Encoder-Free Vision-Language Models

Add code
Jun 17, 2024
Figure 1 for Unveiling Encoder-Free Vision-Language Models
Figure 2 for Unveiling Encoder-Free Vision-Language Models
Figure 3 for Unveiling Encoder-Free Vision-Language Models
Figure 4 for Unveiling Encoder-Free Vision-Language Models
Viaarxiv icon

Seeing Clearly, Answering Incorrectly: A Multimodal Robustness Benchmark for Evaluating MLLMs on Leading Questions

Add code
Jun 15, 2024
Figure 1 for Seeing Clearly, Answering Incorrectly: A Multimodal Robustness Benchmark for Evaluating MLLMs on Leading Questions
Figure 2 for Seeing Clearly, Answering Incorrectly: A Multimodal Robustness Benchmark for Evaluating MLLMs on Leading Questions
Figure 3 for Seeing Clearly, Answering Incorrectly: A Multimodal Robustness Benchmark for Evaluating MLLMs on Leading Questions
Figure 4 for Seeing Clearly, Answering Incorrectly: A Multimodal Robustness Benchmark for Evaluating MLLMs on Leading Questions
Viaarxiv icon

Efficient Multimodal Learning from Data-centric Perspective

Add code
Feb 18, 2024
Viaarxiv icon

Universal Prompt Optimizer for Safe Text-to-Image Generation

Add code
Feb 16, 2024
Viaarxiv icon

Generative Multimodal Models are In-Context Learners

Add code
Dec 20, 2023
Figure 1 for Generative Multimodal Models are In-Context Learners
Figure 2 for Generative Multimodal Models are In-Context Learners
Figure 3 for Generative Multimodal Models are In-Context Learners
Figure 4 for Generative Multimodal Models are In-Context Learners
Viaarxiv icon

Generative Pretraining in Multimodality

Add code
Jul 11, 2023
Figure 1 for Generative Pretraining in Multimodality
Figure 2 for Generative Pretraining in Multimodality
Figure 3 for Generative Pretraining in Multimodality
Figure 4 for Generative Pretraining in Multimodality
Viaarxiv icon

Fine-Grained Visual Prompting

Add code
Jun 07, 2023
Figure 1 for Fine-Grained Visual Prompting
Figure 2 for Fine-Grained Visual Prompting
Figure 3 for Fine-Grained Visual Prompting
Figure 4 for Fine-Grained Visual Prompting
Viaarxiv icon