speech


Bandwidth-Efficient and Privacy-Preserving Edge-Cloud Many-to-Many Speech Translation

Add code
May 27, 2026
Viaarxiv icon

Building Community-Centred NLP Resources for Puno Quechua

Add code
May 27, 2026
Viaarxiv icon

Evaluating the Realism of LLM-powered Social Agents: A Case Study of Reactions to Spanish Online News

Add code
May 27, 2026
Viaarxiv icon

Breaking the Script Barrier: Enabling Automatic Alignment for PoS-based ASR Error Analysis in Non-Latin Scripts

Add code
May 27, 2026
Viaarxiv icon

Syllabic-Structure Decoder for Automatic Speech Recognition in Vietnamese

Add code
May 27, 2026
Viaarxiv icon

TARQ: Tail-Aware Reconstruction Quantization for Rare-Word Robust Automatic Speech Recognition

Add code
May 27, 2026
Viaarxiv icon

Slogans or Stance? A Label-Light Diagnostic for Entrepreneurial-Discourse Measurement on Chinese SOE Speeches

Add code
May 27, 2026
Viaarxiv icon

Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios

Add code
May 27, 2026
Viaarxiv icon

Benchmarking AI for low-resource contexts: Thinking beyond leaderboards

Add code
May 27, 2026
Viaarxiv icon

Why We Need Speech to Evaluate Speech Translation

Add code
May 27, 2026
Viaarxiv icon