Alert button

Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models

Add code
Bookmark button
Alert button
Apr 19, 2024
Chuofan Ma, Yi Jiang, Jiannan Wu, Zehuan Yuan, Xiaojuan Qi

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: