Alert button

Enhancing Multimodal Understanding with CLIP-Based Image-to-Text Transformation

Jan 02, 2024
Chang Che, Qunwei Lin, Xinyu Zhao, Jiaxin Huang, Liqiang Yu

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: