speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

ViMedCSS: A Vietnamese Medical Code-Switching Speech Dataset & Benchmark

Add code
Feb 13, 2026
Viaarxiv icon

PISHYAR: A Socially Intelligent Smart Cane for Indoor Social Navigation and Multimodal Human-Robot Interaction for Visually Impaired People

Add code
Feb 13, 2026
Viaarxiv icon

On the Sensitivity of Firing Rate-Based Federated Spiking Neural Networks to Differential Privacy

Add code
Feb 12, 2026
Viaarxiv icon

TC-BiMamba: Trans-Chunk bidirectionally within BiMamba for unified streaming and non-streaming ASR

Add code
Feb 12, 2026
Viaarxiv icon

Moonshine v2: Ergodic Streaming Encoder ASR for Latency-Critical Speech Applications

Add code
Feb 12, 2026
Viaarxiv icon

Voxtral Realtime

Add code
Feb 11, 2026
Viaarxiv icon

ViSpeechFormer: A Phonemic Approach for Vietnamese Automatic Speech Recognition

Add code
Feb 10, 2026
Viaarxiv icon

Where Are We At with Automatic Speech Recognition for the Bambara Language?

Add code
Feb 10, 2026
Viaarxiv icon

Self-Supervised Learning for Speaker Recognition: A study and review

Add code
Feb 11, 2026
Viaarxiv icon

RE-LLM: Refining Empathetic Speech-LLM Responses by Integrating Emotion Nuance

Add code
Feb 11, 2026
Viaarxiv icon