speech


Synchronization and Turn-Taking in Full-Duplex Speech Dialogue Models

Add code
May 19, 2026
Viaarxiv icon

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

Add code
May 19, 2026
Viaarxiv icon

CAIT: A Syntactic Parsing Toolkit for Child-Adult InTeractions

Add code
May 19, 2026
Viaarxiv icon

Can Large Language Models Reliably Correct Errors in Low-Resource ASR? A Contamination-Aware Case Study on West Frisian

Add code
May 19, 2026
Viaarxiv icon

Cross-Talk Speech Reduction, by Separation, for Separation

Add code
May 19, 2026
Viaarxiv icon

BCI-sift: An automated feature selection toolbox for Brain Computer Interface applications

Add code
May 19, 2026
Viaarxiv icon

FormalASR: End-to-End Spoken Chinese to Formal Text

Add code
May 19, 2026
Viaarxiv icon

Contextual Biasing for Streaming ASR via CTC-based Word Spotting

Add code
May 19, 2026
Viaarxiv icon

SIREM: Speech-Informed MRI Reconstruction with Learned Sampling

Add code
May 18, 2026
Viaarxiv icon

Heterogeneity-Aware Dataset Scheduling for Efficient Audio Large Language Model Training

Add code
May 18, 2026
Viaarxiv icon