Picture for Oscar Lo

Oscar Lo

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

Add code
Nov 12, 2024
Figure 1 for BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions
Figure 2 for BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions
Figure 3 for BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions
Figure 4 for BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions
Viaarxiv icon

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

Add code
Jun 17, 2024
Figure 1 for MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens
Figure 2 for MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens
Figure 3 for MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens
Figure 4 for MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens
Viaarxiv icon