speech


Zero-Shot Speech LLMs for Multi-Aspect Evaluation of L2 Speech: Challenges and Opportunities

Add code
Jan 20, 2026
Viaarxiv icon

HateXScore: A Metric Suite for Evaluating Reasoning Quality in Hate Speech Explanations

Add code
Jan 20, 2026
Viaarxiv icon

Quantifying Speaker Embedding Phonological Rule Interactions in Accented Speech Synthesis

Add code
Jan 20, 2026
Viaarxiv icon

DAME: Duration-Aware Matryoshka Embedding for Duration-Robust Speaker Verification

Add code
Jan 20, 2026
Viaarxiv icon

PRiSM: Benchmarking Phone Realization in Speech Models

Add code
Jan 20, 2026
Viaarxiv icon

VoCodec: An Efficient Lightweight Low-Bitrate Speech Codec

Add code
Jan 19, 2026
Viaarxiv icon

Content Leakage in LibriSpeech and Its Impact on the Privacy Evaluation of Speaker Anonymization

Add code
Jan 19, 2026
Viaarxiv icon

UNMIXX: Untangling Highly Correlated Singing Voices Mixtures

Add code
Jan 19, 2026
Viaarxiv icon

Typhoon ASR Real-time: FastConformer-Transducer for Thai Automatic Speech Recognition

Add code
Jan 19, 2026
Viaarxiv icon

Beyond Mapping : Domain-Invariant Representations via Spectral Embedding of Optimal Transport Plans

Add code
Jan 19, 2026
Viaarxiv icon