Get our free extension to see links to code for papers anywhere online!


The authors of The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset have not publicly listed the dataset (just yet!)

To easily request data directly from the authors, please click here ✉️



(OR if you have data to share with the community, please let us know ✉️😊🙏)