speech


"You've got a friend in me": Co-Designing a Peer Social Robot for Young Newcomers' Language and Cultural Learning

Add code
Mar 19, 2026
Viaarxiv icon

ARTT: Augmented Reverberant-Target Training for Unsupervised Monaural Speech Dereverberation

Add code
Mar 19, 2026
Viaarxiv icon

Enhancing Multi-Corpus Training in SSL-Based Anti-Spoofing Models: Domain-Invariant Feature Extraction

Add code
Mar 19, 2026
Viaarxiv icon

DiscoPhon: Benchmarking the Unsupervised Discovery of Phoneme Inventories With Discrete Speech Units

Add code
Mar 19, 2026
Viaarxiv icon

Listen First, Then Answer: Timestamp-Grounded Speech Reasoning

Add code
Mar 19, 2026
Viaarxiv icon

ALIGN: Adversarial Learning for Generalizable Speech Neuroprosthesis

Add code
Mar 18, 2026
Viaarxiv icon

STEP: Detecting Audio Backdoor Attacks via Stability-based Trigger Exposure Profiling

Add code
Mar 18, 2026
Viaarxiv icon

MOSS-TTS Technical Report

Add code
Mar 18, 2026
Viaarxiv icon

Multi-Source Evidence Fusion for Audio Question Answering

Add code
Mar 18, 2026
Viaarxiv icon

Robust Nasality Representation Learning for Cleft Palate-Related Velopharyngeal Dysfunction Screening in Real-World Settings

Add code
Mar 18, 2026
Viaarxiv icon