Arabic Text Diacritization


Arabic text diacritization is the process of adding diacritical marks to Arabic text to indicate vowels and pronunciation.

Grounding Arabic LLMs in the Doha Historical Dictionary: Retrieval-Augmented Understanding of Quran and Hadith

Add code
Mar 25, 2026
Viaarxiv icon

More Data, Fewer Diacritics: Scaling Arabic TTS

Add code
Mar 02, 2026
Viaarxiv icon

Habibi: Laying the Open-Source Foundation of Unified-Dialectal Arabic Speech Synthesis

Add code
Jan 20, 2026
Viaarxiv icon

synthocr-gen: A synthetic ocr dataset generator for low-resource languages- breaking the data barrier

Add code
Jan 22, 2026
Viaarxiv icon

Are LLMs Good Text Diacritizers? An Arabic and Yorùbá Case Study

Add code
Jun 13, 2025
Viaarxiv icon

Sadeed: Advancing Arabic Diacritization Through Small Language Model

Add code
Apr 30, 2025
Viaarxiv icon

A computational system to handle the orthographic layer of tajwid in contemporary Quranic Orthography

Add code
May 16, 2025
Viaarxiv icon

Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing

Add code
Dec 23, 2024
Viaarxiv icon

CATT: Character-based Arabic Tashkeel Transformer

Add code
Jul 03, 2024
Viaarxiv icon

HATFormer: Historic Handwritten Arabic Text Recognition with Transformers

Add code
Oct 03, 2024
Figure 1 for HATFormer: Historic Handwritten Arabic Text Recognition with Transformers
Figure 2 for HATFormer: Historic Handwritten Arabic Text Recognition with Transformers
Figure 3 for HATFormer: Historic Handwritten Arabic Text Recognition with Transformers
Figure 4 for HATFormer: Historic Handwritten Arabic Text Recognition with Transformers
Viaarxiv icon