speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

XR-CareerAssist: An Immersive Platform for Personalised Career Guidance Leveraging Extended Reality and Multimodal AI

Add code
Apr 08, 2026
Viaarxiv icon

Closing the Speech-Text Gap with Limited Audio for Effective Domain Adaptation in LLM-Based ASR

Add code
Apr 07, 2026
Viaarxiv icon

Measuring Robustness of Speech Recognition from MEG Signals Under Distribution Shift

Add code
Apr 05, 2026
Viaarxiv icon

SEDTalker: Emotion-Aware 3D Facial Animation Using Frame-Level Speech Emotion Diarization

Add code
Apr 14, 2026
Viaarxiv icon

AI-Driven Modular Services for Accessible Multilingual Education in Immersive Extended Reality Settings: Integrating Speech Processing, Translation, and Sign Language Rendering

Add code
Apr 07, 2026
Viaarxiv icon

INTERACT: An AI-Driven Extended Reality Framework for Accesible Communication Featuring Real-Time Sign Language Interpretation and Emotion Recognition

Add code
Apr 07, 2026
Viaarxiv icon

Benchmarking Multilingual Speech Models on Pashto: Zero-Shot ASR, Script Failure, and Cross-Domain Evaluation

Add code
Apr 06, 2026
Viaarxiv icon

Development and multi-center evaluation of domain-adapted speech recognition for human-AI teaming in real-world gastrointestinal endoscopy

Add code
Apr 02, 2026
Viaarxiv icon

Human-Guided Reasoning with Large Language Models for Vietnamese Speech Emotion Recognition

Add code
Apr 02, 2026
Viaarxiv icon

Advancing LLM-based phoneme-to-grapheme for multilingual speech recognition

Add code
Mar 31, 2026
Viaarxiv icon