Picture for Boming Chen

Boming Chen

Multimodal Large Language Models for Text-rich Image Understanding: A Comprehensive Review

Add code
Feb 23, 2025
Figure 1 for Multimodal Large Language Models for Text-rich Image Understanding: A Comprehensive Review
Figure 2 for Multimodal Large Language Models for Text-rich Image Understanding: A Comprehensive Review
Figure 3 for Multimodal Large Language Models for Text-rich Image Understanding: A Comprehensive Review
Figure 4 for Multimodal Large Language Models for Text-rich Image Understanding: A Comprehensive Review
Viaarxiv icon