speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

AfriVoices-KE: A Multilingual Speech Dataset for Kenyan Languages

Add code
Apr 09, 2026
Viaarxiv icon

XR-CareerAssist: An Immersive Platform for Personalised Career Guidance Leveraging Extended Reality and Multimodal AI

Add code
Apr 08, 2026
Viaarxiv icon

Closing the Speech-Text Gap with Limited Audio for Effective Domain Adaptation in LLM-Based ASR

Add code
Apr 07, 2026
Viaarxiv icon

Contextual Biasing for ASR in Speech LLM with Common Word Cues and Bias Word Position Prediction

Add code
Apr 14, 2026
Viaarxiv icon

AI-Driven Modular Services for Accessible Multilingual Education in Immersive Extended Reality Settings: Integrating Speech Processing, Translation, and Sign Language Rendering

Add code
Apr 07, 2026
Viaarxiv icon

INTERACT: An AI-Driven Extended Reality Framework for Accesible Communication Featuring Real-Time Sign Language Interpretation and Emotion Recognition

Add code
Apr 07, 2026
Viaarxiv icon

Identification and Anonymization of Named Entities in Unstructured Information Sources for Use in Social Engineering Detection

Add code
Apr 10, 2026
Viaarxiv icon

Measuring Robustness of Speech Recognition from MEG Signals Under Distribution Shift

Add code
Apr 05, 2026
Viaarxiv icon

Benchmarking Multilingual Speech Models on Pashto: Zero-Shot ASR, Script Failure, and Cross-Domain Evaluation

Add code
Apr 06, 2026
Viaarxiv icon

Development and multi-center evaluation of domain-adapted speech recognition for human-AI teaming in real-world gastrointestinal endoscopy

Add code
Apr 02, 2026
Viaarxiv icon