QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining

Add code
Apr 23, 2025
Figure 1 for QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining
Figure 2 for QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining
Figure 3 for QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining
Figure 4 for QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: