The authors of The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset have not publicly listed the dataset (just yet!)
To easily request data directly from the authors, please
click here ✉️
(OR if you have data to share with the community, please let us know ✉️😊🙏)