Picture for Hrishikesh Garud

Hrishikesh Garud

From English-Centric to Effective Bilingual: LLMs with Custom Tokenizers for Underrepresented Languages

Add code
Oct 24, 2024
Figure 1 for From English-Centric to Effective Bilingual: LLMs with Custom Tokenizers for Underrepresented Languages
Figure 2 for From English-Centric to Effective Bilingual: LLMs with Custom Tokenizers for Underrepresented Languages
Figure 3 for From English-Centric to Effective Bilingual: LLMs with Custom Tokenizers for Underrepresented Languages
Figure 4 for From English-Centric to Effective Bilingual: LLMs with Custom Tokenizers for Underrepresented Languages
Viaarxiv icon