Picture for Sina Ahmadi

Sina Ahmadi

Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization

Add code
Aug 06, 2025
Viaarxiv icon

Conversational Lexicography: Querying Lexicographic Data on Knowledge Graphs with SPARQL through Natural Language

Add code
May 26, 2025
Viaarxiv icon

SwiLTra-Bench: The Swiss Legal Translation Benchmark

Add code
Mar 03, 2025
Viaarxiv icon

Language and Speech Technology for Central Kurdish Varieties

Add code
Mar 04, 2024
Figure 1 for Language and Speech Technology for Central Kurdish Varieties
Figure 2 for Language and Speech Technology for Central Kurdish Varieties
Figure 3 for Language and Speech Technology for Central Kurdish Varieties
Figure 4 for Language and Speech Technology for Central Kurdish Varieties
Viaarxiv icon

A Morphologically-Aware Dictionary-based Data Augmentation Technique for Machine Translation of Under-Represented Languages

Add code
Feb 02, 2024
Viaarxiv icon

CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation

Add code
May 26, 2023
Viaarxiv icon

Script Normalization for Unconventional Writing of Under-Resourced Languages in Bilingual Communities

Add code
May 25, 2023
Viaarxiv icon

Transfer Learning for Low-Resource Sentiment Analysis

Add code
Apr 10, 2023
Figure 1 for Transfer Learning for Low-Resource Sentiment Analysis
Figure 2 for Transfer Learning for Low-Resource Sentiment Analysis
Figure 3 for Transfer Learning for Low-Resource Sentiment Analysis
Figure 4 for Transfer Learning for Low-Resource Sentiment Analysis
Viaarxiv icon

Approaches to Corpus Creation for Low-Resource Language Technology: the Case of Southern Kurdish and Laki

Add code
Apr 03, 2023
Viaarxiv icon

PALI: A Language Identification Benchmark for Perso-Arabic Scripts

Add code
Apr 03, 2023
Viaarxiv icon