Cross Lingual Asr


BBPE16: UTF-16-based byte-level byte-pair encoding for improved multilingual speech recognition

Add code
Feb 02, 2026
Viaarxiv icon

Dynamic Multi-Expert Projectors with Stabilized Routing for Multilingual Speech Recognition

Add code
Jan 27, 2026
Viaarxiv icon

Dialect Matters: Cross-Lingual ASR Transfer for Low-Resource Indic Language Varieties

Add code
Jan 07, 2026
Viaarxiv icon

Multimodal In-context Learning for ASR of Low-resource Languages

Add code
Jan 09, 2026
Viaarxiv icon

Corpus of Cross-lingual Dialogues with Minutes and Detection of Misunderstandings

Add code
Dec 23, 2025
Viaarxiv icon

Efficient ASR for Low-Resource Languages: Leveraging Cross-Lingual Unlabeled Data

Add code
Dec 08, 2025
Viaarxiv icon

CLiFT-ASR: A Cross-Lingual Fine-Tuning Framework for Low-Resource Taiwanese Hokkien Speech Recognition

Add code
Nov 10, 2025
Viaarxiv icon

TTA: Transcribe, Translate and Alignment for Cross-lingual Speech Representation

Add code
Nov 18, 2025
Viaarxiv icon

The Limits of Data Scaling: Sub-token Utilization and Acoustic Saturation in Multilingual ASR

Add code
Oct 26, 2025
Figure 1 for The Limits of Data Scaling: Sub-token Utilization and Acoustic Saturation in Multilingual ASR
Figure 2 for The Limits of Data Scaling: Sub-token Utilization and Acoustic Saturation in Multilingual ASR
Figure 3 for The Limits of Data Scaling: Sub-token Utilization and Acoustic Saturation in Multilingual ASR
Figure 4 for The Limits of Data Scaling: Sub-token Utilization and Acoustic Saturation in Multilingual ASR
Viaarxiv icon

LRW-Persian: Lip-reading in the Wild Dataset for Persian Language

Add code
Oct 26, 2025
Viaarxiv icon