speech


ELEAT-SAGA: Early & Late Integration with Evading Alternating Training for Spoof-Robust Speaker Verification

Add code
Feb 14, 2026
Viaarxiv icon

voice2mode: Phonation Mode Classification in Singing using Self-Supervised Speech Models

Add code
Feb 14, 2026
Viaarxiv icon

Enhancing spatial hearing with cochlear implants: exploring the role of AI, multimodal interaction and perceptual training

Add code
Feb 14, 2026
Viaarxiv icon

Benchmarking Video Foundation Models for Remote Parkinson's Disease Screening

Add code
Feb 13, 2026
Viaarxiv icon

Can we trust AI to detect healthy multilingual English speakers among the cognitively impaired cohort in the UK? An investigation using real-world conversational speech

Add code
Feb 13, 2026
Viaarxiv icon

ViMedCSS: A Vietnamese Medical Code-Switching Speech Dataset & Benchmark

Add code
Feb 13, 2026
Viaarxiv icon

Lamer-SSL: Layer-aware Mixture of LoRA Experts for Continual Multilingual Expansion of Self-supervised Models without Forgetting

Add code
Feb 13, 2026
Viaarxiv icon

ADEPT: RL-Aligned Agentic Decoding of Emotion via Evidence Probing Tools -- From Consensus Learning to Ambiguity-Driven Emotion Reasoning

Add code
Feb 13, 2026
Viaarxiv icon

Decoder-only Conformer with Modality-aware Sparse Mixtures of Experts for ASR

Add code
Feb 13, 2026
Viaarxiv icon

AIWizards at MULTIPRIDE: A Hierarchical Approach to Slur Reclamation Detection

Add code
Feb 13, 2026
Viaarxiv icon