speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Deaf and Hard of Hearing Access to Intelligent Personal Assistants: Comparison of Voice-Based Options with an LLM-Powered Touch Interface

Add code
Jan 21, 2026
Viaarxiv icon

Typhoon ASR Real-time: FastConformer-Transducer for Thai Automatic Speech Recognition

Add code
Jan 19, 2026
Viaarxiv icon

Purification Before Fusion: Toward Mask-Free Speech Enhancement for Robust Audio-Visual Speech Recognition

Add code
Jan 18, 2026
Viaarxiv icon

SSVD-O: Parameter-Efficient Fine-Tuning with Structured SVD for Speech Recognition

Add code
Jan 18, 2026
Viaarxiv icon

SoundBreak: A Systematic Study of Audio-Only Adversarial Attacks on Trimodal Models

Add code
Jan 20, 2026
Viaarxiv icon

Lost in Transcription: How Speech-to-Text Errors Derail Code Understanding

Add code
Jan 20, 2026
Viaarxiv icon

PRiSM: Benchmarking Phone Realization in Speech Models

Add code
Jan 20, 2026
Viaarxiv icon

RIR-Mega-Speech: A Reverberant Speech Corpus with Comprehensive Acoustic Metadata and Reproducible Evaluation

Add code
Jan 25, 2026
Viaarxiv icon

Motion-to-Response Content Generation via Multi-Agent AI System with Real-Time Safety Verification

Add code
Jan 20, 2026
Viaarxiv icon

CTC-DID: CTC-Based Arabic dialect identification for streaming applications

Add code
Jan 18, 2026
Viaarxiv icon