Picture for Nguyen Tai

Nguyen Tai

DataDecide: How to Predict Best Pretraining Data with Small Experiments

Add code
Apr 15, 2025
Figure 1 for DataDecide: How to Predict Best Pretraining Data with Small Experiments
Figure 2 for DataDecide: How to Predict Best Pretraining Data with Small Experiments
Figure 3 for DataDecide: How to Predict Best Pretraining Data with Small Experiments
Figure 4 for DataDecide: How to Predict Best Pretraining Data with Small Experiments
Viaarxiv icon

MMTEB: Massive Multilingual Text Embedding Benchmark

Add code
Feb 19, 2025
Viaarxiv icon