Arabic Text Diacritization


Arabic text diacritization is the process of adding diacritical marks to Arabic text to indicate vowels and pronunciation.

Thaka at KSAA-2026 Task 2: Regularized Fine-Tuning for Arabic Speech Diacritization

Add code
May 25, 2026
Viaarxiv icon

Pattern-and-root inflectional morphology: the Arabic broken plural

Add code
May 21, 2026
Viaarxiv icon

Grounding Arabic LLMs in the Doha Historical Dictionary: Retrieval-Augmented Understanding of Quran and Hadith

Add code
Mar 25, 2026
Viaarxiv icon

More Data, Fewer Diacritics: Scaling Arabic TTS

Add code
Mar 02, 2026
Viaarxiv icon

Habibi: Laying the Open-Source Foundation of Unified-Dialectal Arabic Speech Synthesis

Add code
Jan 20, 2026
Viaarxiv icon

synthocr-gen: A synthetic ocr dataset generator for low-resource languages- breaking the data barrier

Add code
Jan 22, 2026
Viaarxiv icon

Are LLMs Good Text Diacritizers? An Arabic and Yorùbá Case Study

Add code
Jun 13, 2025
Viaarxiv icon

Sadeed: Advancing Arabic Diacritization Through Small Language Model

Add code
Apr 30, 2025
Viaarxiv icon

A computational system to handle the orthographic layer of tajwid in contemporary Quranic Orthography

Add code
May 16, 2025
Viaarxiv icon

Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing

Add code
Dec 23, 2024
Viaarxiv icon