Transliteration


Quran-MD: A Fine-Grained Multilingual Multimodal Dataset of the Quran

Add code
Jan 25, 2026
Viaarxiv icon

NADIR: Differential Attention Flow for Non-Autoregressive Transliteration in Indic Languages

Add code
Jan 18, 2026
Viaarxiv icon

Can Large Language Models Understand, Reason About, and Generate Code-Switched Text?

Add code
Jan 12, 2026
Viaarxiv icon

Symphonym: Universal Phonetic Embeddings for Cross-Script Toponym Matching via Teacher-Student Distillation

Add code
Jan 11, 2026
Viaarxiv icon

Pragya: An AI-Based Semantic Recommendation System for Sanskrit Subhasitas

Add code
Jan 10, 2026
Viaarxiv icon

Linear Script Representations in Speech Foundation Models Enable Zero-Shot Transliteration

Add code
Jan 06, 2026
Viaarxiv icon

The OCR-PT-CT Project: Semi-Automatic Recognition of Ancient Egyptian Hieroglyphs Based on Metric Learning

Add code
Dec 30, 2025
Viaarxiv icon

From Data Scarcity to Data Care: Reimagining Language Technologies for Serbian and other Low-Resource Languages

Add code
Dec 11, 2025
Viaarxiv icon

Irony Detection in Urdu Text: A Comparative Study Using Machine Learning Models and Large Language Models

Add code
Oct 25, 2025
Viaarxiv icon

Data Augmentation for Maltese NLP using Transliterated and Machine Translated Arabic Data

Add code
Sep 16, 2025
Viaarxiv icon