speech


Modeling Overlapped Speech with Shuffles

Add code
Mar 18, 2026
Viaarxiv icon

Neuron-Level Emotion Control in Speech-Generative Large Audio-Language Models

Add code
Mar 18, 2026
Viaarxiv icon

ALIGN: Adversarial Learning for Generalizable Speech Neuroprosthesis

Add code
Mar 18, 2026
Viaarxiv icon

STEP: Detecting Audio Backdoor Attacks via Stability-based Trigger Exposure Profiling

Add code
Mar 18, 2026
Viaarxiv icon

MOSS-TTS Technical Report

Add code
Mar 18, 2026
Viaarxiv icon

Multi-Source Evidence Fusion for Audio Question Answering

Add code
Mar 18, 2026
Viaarxiv icon

Robust Nasality Representation Learning for Cleft Palate-Related Velopharyngeal Dysfunction Screening in Real-World Settings

Add code
Mar 18, 2026
Viaarxiv icon

LoGSAM: Parameter-Efficient Cross-Modal Grounding for MRI Segmentation

Add code
Mar 18, 2026
Viaarxiv icon

SpokenUS: A Spoken User Simulator for Task-Oriented Dialogue

Add code
Mar 17, 2026
Viaarxiv icon

Linearized Bregman Iterations for Sparse Spiking Neural Networks

Add code
Mar 17, 2026
Viaarxiv icon