Alert button

Enhancing Multimodal Large Language Models with Vision Detection Models: An Empirical Study

Jan 31, 2024
Qirui Jiao, Daoyuan Chen, Yilun Huang, Yaliang Li, Ying Shen

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: