Picture for Sina Ahmadi

Sina Ahmadi

Translation Asymmetry in LLMs as a Data Augmentation Factor: A Case Study for 6 Romansh Language Varieties

Add code
Mar 26, 2026
Viaarxiv icon

Robust Language Identification for Romansh Varieties

Add code
Mar 16, 2026
Viaarxiv icon

Language Models Are Borrowing-Blind: A Multilingual Evaluation of Loanword Identification across 10 Languages

Add code
Oct 30, 2025
Viaarxiv icon

Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization

Add code
Aug 06, 2025
Figure 1 for Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization
Figure 2 for Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization
Figure 3 for Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization
Figure 4 for Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization
Viaarxiv icon

Conversational Lexicography: Querying Lexicographic Data on Knowledge Graphs with SPARQL through Natural Language

Add code
May 26, 2025
Viaarxiv icon

SwiLTra-Bench: The Swiss Legal Translation Benchmark

Add code
Mar 03, 2025
Viaarxiv icon

Language and Speech Technology for Central Kurdish Varieties

Add code
Mar 04, 2024
Figure 1 for Language and Speech Technology for Central Kurdish Varieties
Figure 2 for Language and Speech Technology for Central Kurdish Varieties
Figure 3 for Language and Speech Technology for Central Kurdish Varieties
Figure 4 for Language and Speech Technology for Central Kurdish Varieties
Viaarxiv icon

A Morphologically-Aware Dictionary-based Data Augmentation Technique for Machine Translation of Under-Represented Languages

Add code
Feb 02, 2024
Viaarxiv icon

CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation

Add code
May 26, 2023
Figure 1 for CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation
Figure 2 for CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation
Figure 3 for CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation
Figure 4 for CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation
Viaarxiv icon

Script Normalization for Unconventional Writing of Under-Resourced Languages in Bilingual Communities

Add code
May 25, 2023
Viaarxiv icon