speech


Towards Energy-Efficient and Low-Latency Voice-Controlled Smart Homes: A Proposal for Offline Speech Recognition and IoT Integration

Add code
Jun 09, 2025
Viaarxiv icon

Multilingual Hate Speech Detection in Social Media Using Translation-Based Approaches with Large Language Models

Add code
Jun 09, 2025
Viaarxiv icon

Multi-Distillation from Speech and Music Representation Models

Add code
Jun 08, 2025
Viaarxiv icon

E-BATS: Efficient Backpropagation-Free Test-Time Adaptation for Speech Foundation Models

Add code
Jun 08, 2025
Viaarxiv icon

Speech Recognition on TV Series with Video-guided Post-Correction

Add code
Jun 08, 2025
Viaarxiv icon

Streaming Endpointer for Spoken Dialogue using Neural Audio Codecs and Label-Delayed Training

Add code
Jun 08, 2025
Viaarxiv icon

"In This Environment, As That Speaker": A Text-Driven Framework for Multi-Attribute Speech Conversion

Add code
Jun 08, 2025
Viaarxiv icon

Technical Report: A Practical Guide to Kaldi ASR Optimization

Add code
Jun 08, 2025
Viaarxiv icon

Towards Generalized Source Tracing for Codec-Based Deepfake Speech

Add code
Jun 08, 2025
Viaarxiv icon

Automatic Speech Recognition of African American English: Lexical and Contextual Effects

Add code
Jun 07, 2025
Viaarxiv icon