speech


Beyond Mapping : Domain-Invariant Representations via Spectral Embedding of Optimal Transport Plans

Add code
Jan 19, 2026
Viaarxiv icon

UNMIXX: Untangling Highly Correlated Singing Voices Mixtures

Add code
Jan 19, 2026
Viaarxiv icon

Arab Voices: Mapping Standard and Dialectal Arabic Speech Technology

Add code
Jan 19, 2026
Viaarxiv icon

Content Leakage in LibriSpeech and Its Impact on the Privacy Evaluation of Speaker Anonymization

Add code
Jan 19, 2026
Viaarxiv icon

Typhoon ASR Real-time: FastConformer-Transducer for Thai Automatic Speech Recognition

Add code
Jan 19, 2026
Viaarxiv icon

Bi-Attention HateXplain : Taking into account the sequential aspect of data during explainability in a multi-task context

Add code
Jan 19, 2026
Viaarxiv icon

Lombard Speech Synthesis for Any Voice with Controllable Style Embeddings

Add code
Jan 19, 2026
Viaarxiv icon

Exploring Talking Head Models With Adjacent Frame Prior for Speech-Preserving Facial Expression Manipulation

Add code
Jan 19, 2026
Viaarxiv icon

Resource-Conscious RL Algorithms for Deep Brain Stimulation

Add code
Jan 19, 2026
Viaarxiv icon

Robust Online Overdetermined Independent Vector Analysis Based on Bilinear Decomposition

Add code
Jan 18, 2026
Viaarxiv icon