Picture for Yin Hu

Yin Hu

Design as Desired: Utilizing Visual Question Answering for Multimodal Pre-training

Add code
Apr 08, 2024
Viaarxiv icon