speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Identifying Hearing Difficulty Moments in Conversational Audio

Add code
Jul 31, 2025
Viaarxiv icon

The Interspeech 2025 Speech Accessibility Project Challenge

Add code
Jul 29, 2025
Viaarxiv icon

The TEA-ASLP System for Multilingual Conversational Speech Recognition and Speech Diarization in MLC-SLM 2025 Challenge

Add code
Jul 24, 2025
Viaarxiv icon

Triple X: A LLM-Based Multilingual Speech Recognition System for the INTERSPEECH2025 MLC-SLM Challenge

Add code
Jul 23, 2025
Viaarxiv icon

System Report for CCL25-Eval Task 10: SRAG-MAV for Fine-Grained Chinese Hate Speech Recognition

Add code
Jul 24, 2025
Viaarxiv icon

End-to-End DOA-Guided Speech Extraction in Noisy Multi-Talker Scenarios

Add code
Jul 28, 2025
Viaarxiv icon

Tiny Noise-Robust Voice Activity Detector for Voice Assistants

Add code
Jul 29, 2025
Viaarxiv icon

An approach to measuring the performance of Automatic Speech Recognition (ASR) models in the context of Large Language Model (LLM) powered applications

Add code
Jul 22, 2025
Viaarxiv icon

Self-Improvement for Audio Large Language Model using Unlabeled Speech

Add code
Jul 27, 2025
Viaarxiv icon

Touch Speaks, Sound Feels: A Multimodal Approach to Affective and Social Touch from Robots to Humans

Add code
Aug 11, 2025
Viaarxiv icon