speech


Requirements Elicitation Follow-Up Question Generation

Add code
Jul 03, 2025
Viaarxiv icon

Self-Steering Deep Non-Linear Spatially Selective Filters for Efficient Extraction of Moving Speakers under Weak Guidance

Add code
Jul 03, 2025
Viaarxiv icon

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

Add code
Jul 03, 2025
Viaarxiv icon

Measurement of the Granularity of Vowel Production Space By Just Producible Different (JPD) Limens

Add code
Jul 03, 2025
Viaarxiv icon

De-AntiFake: Rethinking the Protective Perturbations Against Voice Cloning Attacks

Add code
Jul 03, 2025
Viaarxiv icon

Multi-Utterance Speech Separation and Association Trained on Short Segments

Add code
Jul 03, 2025
Viaarxiv icon

Open-Source System for Multilingual Translation and Cloned Speech Synthesis

Add code
Jul 03, 2025
Viaarxiv icon

A Cookbook for Community-driven Data Collection of Impaired Speech in LowResource Languages

Add code
Jul 03, 2025
Viaarxiv icon

Benchmarking Akan ASR Models Across Domain-Specific Datasets: A Comparative Evaluation of Performance, Scalability, and Adaptability

Add code
Jul 03, 2025
Viaarxiv icon

Posterior Transition Modeling for Unsupervised Diffusion-Based Speech Enhancement

Add code
Jul 03, 2025
Viaarxiv icon