speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Responsible Benchmarking of Fairness for Automatic Speech Recognition

Add code
May 11, 2026
Viaarxiv icon

A Calculus-Based Framework for Determining Vocabulary Size in End-to-End ASR

Add code
May 14, 2026
Viaarxiv icon

Toward Natural Emotional Text-To-Speech System with Fine-Grained Non-Verbal Expression Control

Add code
May 25, 2026
Viaarxiv icon

REALM: Retrospective Encoder Alignment for LFP Modeling

Add code
May 14, 2026
Viaarxiv icon

Mind the Pause: Disfluency-Aware Objective Tuning for Multilingual Speech Correction with LLMs

Add code
May 12, 2026
Viaarxiv icon

Towards Fine-Grained Multi-Dimensional Speech Understanding: Data Pipeline, Benchmark, and Model

Add code
May 12, 2026
Viaarxiv icon

A Comprehensive Analysis of Tokenization and Self-Supervised Learning in End-to-End Automatic Speech Recognition applied on French Language

Add code
May 05, 2026
Viaarxiv icon

WorldSpeech: A Multilingual Speech Corpus from Around the World

Add code
May 09, 2026
Viaarxiv icon

ORICF -- Open Robotics Inference and Control Framework

Add code
May 10, 2026
Viaarxiv icon

A Paradigm for Interpreting Metrics and Identifying Critical Errors in Automatic Speech Recognition

Add code
May 05, 2026
Viaarxiv icon