Alert button

Decoding Data Quality via Synthetic Corruptions: Embedding-guided Pruning of Code Data

Dec 05, 2023
Yu Yang, Aaditya K. Singh, Mostafa Elhoushi, Anas Mahmoud, Kushal Tirumala, Fabian Gloeckle, Baptiste Rozière, Carole-Jean Wu, Ari S. Morcos, Newsha Ardalani

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: