speech


Exploring Efficient Directional and Distance Cues for Regional Speech Separation

Add code
Aug 11, 2025
Viaarxiv icon

SASST: Leveraging Syntax-Aware Chunking and LLMs for Simultaneous Speech Translation

Add code
Aug 11, 2025
Viaarxiv icon

G-IFT: A Gated Linear Unit adapter with Iterative Fine-Tuning for Low-Resource Children's Speaker Verification

Add code
Aug 11, 2025
Viaarxiv icon

AD-AVSR: Asymmetric Dual-stream Enhancement for Robust Audio-Visual Speech Recognition

Add code
Aug 11, 2025
Viaarxiv icon

UniFlow: Unifying Speech Front-End Tasks via Continuous Generative Modeling

Add code
Aug 11, 2025
Viaarxiv icon

Exploring Disentangled Neural Speech Codecs from Self-Supervised Representations

Add code
Aug 11, 2025
Viaarxiv icon

MSU-Bench: Towards Understanding the Conversational Multi-talker Scenarios

Add code
Aug 11, 2025
Viaarxiv icon

Bridging ASR and LLMs for Dysarthric Speech Recognition: Benchmarking Self-Supervised and Generative Approaches

Add code
Aug 11, 2025
Viaarxiv icon

Touch Speaks, Sound Feels: A Multimodal Approach to Affective and Social Touch from Robots to Humans

Add code
Aug 11, 2025
Viaarxiv icon

A Small-footprint Acoustic Echo Cancellation Solution for Mobile Full-Duplex Speech Interactions

Add code
Aug 11, 2025
Viaarxiv icon