speech


Breaking Data Efficiency Dilemma: A Federated and Augmented Learning Framework For Alzheimer's Disease Detection via Speech

Add code
Feb 16, 2026
Viaarxiv icon

SA-SSL-MOS: Self-supervised Learning MOS Prediction with Spectral Augmentation for Generalized Multi-Rate Speech Assessment

Add code
Feb 16, 2026
Viaarxiv icon

CLAP-Based Automatic Word Naming Recognition in Post-Stroke Aphasia

Add code
Feb 16, 2026
Viaarxiv icon

"Sorry, I Didn't Catch That": How Speech Models Miss What Matters Most

Add code
Feb 16, 2026
Viaarxiv icon

From Scarcity to Scale: A Release-Level Analysis of the Pashto Common Voice Dataset

Add code
Feb 15, 2026
Viaarxiv icon

Bengali-Loop: Community Benchmarks for Long-Form Bangla ASR and Speaker Diarization

Add code
Feb 15, 2026
Viaarxiv icon

ProAct: A Dual-System Framework for Proactive Embodied Social Agents

Add code
Feb 15, 2026
Viaarxiv icon

Eureka-Audio: Triggering Audio Intelligence in Compact Language Models

Add code
Feb 15, 2026
Viaarxiv icon

Investigation for Relative Voice Impression Estimation

Add code
Feb 15, 2026
Viaarxiv icon

GSRM: Generative Speech Reward Model for Speech RLHF

Add code
Feb 14, 2026
Viaarxiv icon