Picture for Amanda Myntti

Amanda Myntti

Register Always Matters: Analysis of LLM Pretraining Data Through the Lens of Language Variation

Add code
Apr 02, 2025
Figure 1 for Register Always Matters: Analysis of LLM Pretraining Data Through the Lens of Language Variation
Figure 2 for Register Always Matters: Analysis of LLM Pretraining Data Through the Lens of Language Variation
Figure 3 for Register Always Matters: Analysis of LLM Pretraining Data Through the Lens of Language Variation
Figure 4 for Register Always Matters: Analysis of LLM Pretraining Data Through the Lens of Language Variation
Viaarxiv icon

Untangling the Unrestricted Web: Automatic Identification of Multilingual Registers

Add code
Jun 28, 2024
Figure 1 for Untangling the Unrestricted Web: Automatic Identification of Multilingual Registers
Figure 2 for Untangling the Unrestricted Web: Automatic Identification of Multilingual Registers
Figure 3 for Untangling the Unrestricted Web: Automatic Identification of Multilingual Registers
Figure 4 for Untangling the Unrestricted Web: Automatic Identification of Multilingual Registers
Viaarxiv icon

Explaining Classes through Word Attribution

Add code
Aug 31, 2021
Figure 1 for Explaining Classes through Word Attribution
Figure 2 for Explaining Classes through Word Attribution
Viaarxiv icon