speech


Hard to Be Heard: Phoneme-Level ASR Analysis of Phonologically Complex, Low-Resource Endangered Languages

Add code
Apr 20, 2026
Viaarxiv icon

MoVE: Translating Laughter and Tears via Mixture of Vocalization Experts in Speech-to-Speech Translation

Add code
Apr 19, 2026
Viaarxiv icon

Prosody as Supervision: Bridging the Non-Verbal--Verbal for Multilingual Speech Emotion Recognition

Add code
Apr 19, 2026
Viaarxiv icon

HCFD: A Benchmark for Audio Deepfake Detection in Healthcare

Add code
Apr 19, 2026
Viaarxiv icon

Explain the Flag: Contextualizing Hate Speech Beyond Censorship

Add code
Apr 16, 2026
Viaarxiv icon

VoxSafeBench: Not Just What Is Said, but Who, How, and Where

Add code
Apr 16, 2026
Viaarxiv icon

Pushing the Limits of On-Device Streaming ASR: A Compact, High-Accuracy English Model for Low-Latency Inference

Add code
Apr 16, 2026
Viaarxiv icon

The Acoustic Camouflage Phenomenon: Re-evaluating Speech Features for Financial Risk Prediction

Add code
Apr 16, 2026
Viaarxiv icon

Comparison of Modern Multilingual Text Embedding Techniques for Hate Speech Detection Task

Add code
Apr 16, 2026
Viaarxiv icon

UniPASE: A Generative Model for Universal Speech Enhancement with High Fidelity and Low Hallucinations

Add code
Apr 16, 2026
Viaarxiv icon