Picture for Sara Bourbour Hosseinbeigi

Sara Bourbour Hosseinbeigi

Matina: A Large-Scale 73B Token Persian Text Corpus

Add code
Feb 13, 2025
Figure 1 for Matina: A Large-Scale 73B Token Persian Text Corpus
Figure 2 for Matina: A Large-Scale 73B Token Persian Text Corpus
Figure 3 for Matina: A Large-Scale 73B Token Persian Text Corpus
Figure 4 for Matina: A Large-Scale 73B Token Persian Text Corpus
Viaarxiv icon

Advancing Retrieval-Augmented Generation for Persian: Development of Language Models, Comprehensive Benchmarks, and Best Practices for Optimization

Add code
Jan 08, 2025
Viaarxiv icon