speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Reference Microphone Selection for Guided Source Separation based on the Normalized L-p Norm

Add code
Oct 31, 2025
Viaarxiv icon

Overview of the MEDIQA-OE 2025 Shared Task on Medical Order Extraction from Doctor-Patient Consultations

Add code
Oct 30, 2025
Viaarxiv icon

HMM for short independent sequences: Multiple sequence Baum-Welch application

Add code
Oct 30, 2025
Viaarxiv icon

Adapting Speech Foundation Models with Large Language Models for Unified Speech Recognition

Add code
Oct 27, 2025
Viaarxiv icon

Mitigating Attention Sinks and Massive Activations in Audio-Visual Speech Recognition with LLMS

Add code
Oct 26, 2025
Viaarxiv icon

LRW-Persian: Lip-reading in the Wild Dataset for Persian Language

Add code
Oct 26, 2025
Viaarxiv icon

The Tonogenesis Continuum in Tibetan: A Computational Investigation

Add code
Oct 26, 2025
Viaarxiv icon

A Sociophonetic Analysis of Racial Bias in Commercial ASR Systems Using the Pacific Northwest English Corpus

Add code
Oct 26, 2025
Viaarxiv icon

EchoMind: An Interrelated Multi-level Benchmark for Evaluating Empathetic Speech Language Models

Add code
Oct 26, 2025
Viaarxiv icon

Tibetan Language and AI: A Comprehensive Survey of Resources, Methods and Challenges

Add code
Oct 22, 2025
Viaarxiv icon