WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning: Dataset

The authors of WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning have not publicly listed the dataset (just yet!)

To easily request data directly from the authors, please click here ✉️

(OR if you have data to share with the community, please let us know ✉️😊🙏)