The authors of WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning have not publicly listed the dataset (just yet!)
To easily request data directly from the authors, please
click here ✉️
(OR if you have data to share with the community, please let us know ✉️😊🙏)