Alert button

Lumos : Empowering Multimodal LLMs with Scene Text Recognition

Feb 12, 2024
Ashish Shenoy, Yichao Lu, Srihari Jayakumar, Debojeet Chatterjee, Mohsen Moslehpour, Pierce Chuang, Abhay Harpale, Vikas Bhardwaj, Di Xu, Shicong Zhao, Longfang Zhao, Ankit Ramchandani, Xin Luna Dong, Anuj Kumar

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: