speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

An Approach to Combining Video and Speech with Large Language Models in Human-Robot Interaction

Add code
Feb 23, 2026
Viaarxiv icon

Pay Attention to CTC: Fast and Robust Pseudo-Labelling for Unified Speech Recognition

Add code
Feb 22, 2026
Viaarxiv icon

Assessing the Impact of Speaker Identity in Speech Spoofing Detection

Add code
Feb 24, 2026
Viaarxiv icon

TurkicNLP: An NLP Toolkit for Turkic Languages

Add code
Feb 22, 2026
Viaarxiv icon

Whisper: Courtside Edition Enhancing ASR Performance Through LLM-Driven Context Generation

Add code
Feb 21, 2026
Viaarxiv icon

Color-based Emotion Representation for Speech Emotion Recognition

Add code
Feb 18, 2026
Viaarxiv icon

Voice-Driven Semantic Perception for UAV-Assisted Emergency Networks

Add code
Feb 19, 2026
Viaarxiv icon

Joint Enhancement and Classification using Coupled Diffusion Models of Signals and Logits

Add code
Feb 17, 2026
Viaarxiv icon

Enroll-on-Wakeup: A First Comparative Study of Target Speech Extraction for Seamless Interaction in Real Noisy Human-Machine Dialogue Scenarios

Add code
Feb 17, 2026
Viaarxiv icon

"Sorry, I Didn't Catch That": How Speech Models Miss What Matters Most

Add code
Feb 16, 2026
Viaarxiv icon