Picture for Chunyuan Li

Chunyuan Li

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models

Add code
Jul 17, 2024
Viaarxiv icon

LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models

Add code
Jul 10, 2024
Viaarxiv icon

Long Context Transfer from Language to Vision

Add code
Jun 24, 2024
Viaarxiv icon

Beyond Raw Videos: Understanding Edited Videos with Large Multimodal Model

Add code
Jun 15, 2024
Viaarxiv icon

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Add code
Jun 13, 2024
Figure 1 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 2 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 3 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 4 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Viaarxiv icon

Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment

Add code
May 28, 2024
Figure 1 for Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
Figure 2 for Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
Figure 3 for Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
Figure 4 for Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
Viaarxiv icon

Graphic Design with Large Multimodal Model

Add code
Apr 22, 2024
Figure 1 for Graphic Design with Large Multimodal Model
Figure 2 for Graphic Design with Large Multimodal Model
Figure 3 for Graphic Design with Large Multimodal Model
Figure 4 for Graphic Design with Large Multimodal Model
Viaarxiv icon

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

Add code
Apr 02, 2024
Figure 1 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 2 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 3 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 4 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Viaarxiv icon

Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging

Add code
Mar 20, 2024
Figure 1 for Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging
Figure 2 for Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging
Figure 3 for Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging
Figure 4 for Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging
Viaarxiv icon

TrustLLM: Trustworthiness in Large Language Models

Add code
Jan 25, 2024
Figure 1 for TrustLLM: Trustworthiness in Large Language Models
Figure 2 for TrustLLM: Trustworthiness in Large Language Models
Figure 3 for TrustLLM: Trustworthiness in Large Language Models
Figure 4 for TrustLLM: Trustworthiness in Large Language Models
Viaarxiv icon