speech


A Dataset for Automatic Assessment of TTS Quality in Spanish

Add code
Jul 02, 2025
Viaarxiv icon

First Steps Towards Voice Anonymization for Code-Switching Speech

Add code
Jul 02, 2025
Viaarxiv icon

A Review on Sound Source Localization in Robotics: Focusing on Deep Learning Methods

Add code
Jul 01, 2025
Viaarxiv icon

MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement

Add code
Jul 01, 2025
Viaarxiv icon

GGTalker: Talking Head Systhesis with Generalizable Gaussian Priors and Identity-Specific Adaptation

Add code
Jun 26, 2025
Viaarxiv icon

Aligning Spoken Dialogue Models from User Interactions

Add code
Jun 26, 2025
Viaarxiv icon

Deception Detection in Dyadic Exchanges Using Multimodal Machine Learning: A Study on a Swedish Cohort

Add code
Jun 26, 2025
Viaarxiv icon

Hybrid Deep Learning and Signal Processing for Arabic Dialect Recognition in Low-Resource Settings

Add code
Jun 26, 2025
Viaarxiv icon

A Multi-Stage Framework for Multimodal Controllable Speech Synthesis

Add code
Jun 26, 2025
Viaarxiv icon

Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs

Add code
Jun 25, 2025
Viaarxiv icon