Picture for Longfang Zhao

Longfang Zhao

Lumos : Empowering Multimodal LLMs with Scene Text Recognition

Add code
Feb 12, 2024
Figure 1 for Lumos : Empowering Multimodal LLMs with Scene Text Recognition
Figure 2 for Lumos : Empowering Multimodal LLMs with Scene Text Recognition
Figure 3 for Lumos : Empowering Multimodal LLMs with Scene Text Recognition
Figure 4 for Lumos : Empowering Multimodal LLMs with Scene Text Recognition
Viaarxiv icon