speech


TurkicNLP: An NLP Toolkit for Turkic Languages

Add code
Feb 22, 2026
Viaarxiv icon

Retrieval Augmented Enhanced Dual Co-Attention Framework for Target Aware Multimodal Bengali Hateful Meme Detection

Add code
Feb 22, 2026
Viaarxiv icon

CosyAccent: Duration-Controllable Accent Normalization Using Source-Synthesis Training Data

Add code
Feb 22, 2026
Viaarxiv icon

Pay Attention to CTC: Fast and Robust Pseudo-Labelling for Unified Speech Recognition

Add code
Feb 22, 2026
Viaarxiv icon

Whisper: Courtside Edition Enhancing ASR Performance Through LLM-Driven Context Generation

Add code
Feb 21, 2026
Viaarxiv icon

Towards Cross-lingual Values Assessment: A Consensus-Pluralism Perspective

Add code
Feb 19, 2026
Viaarxiv icon

CC-G2PnP: Streaming Grapheme-to-Phoneme and prosody with Conformer-CTC for unsegmented languages

Add code
Feb 19, 2026
Viaarxiv icon

Are LLMs Ready to Replace Bangla Annotators?

Add code
Feb 19, 2026
Viaarxiv icon

PEACE 2.0: Grounded Explanations and Counter-Speech for Combating Hate Expressions

Add code
Feb 19, 2026
Viaarxiv icon

The Cascade Equivalence Hypothesis: When Do Speech LLMs Behave Like ASR$\rightarrow$LLM Pipelines?

Add code
Feb 19, 2026
Viaarxiv icon