speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

CoSTA: Cognitive-State-Conditioned TTS Data Augmentation Using ASR Transcripts for Alzheimer's Disease Detection

Add code
Jun 04, 2026
Viaarxiv icon

Read What You Hear: Reference-Free Hypotheses Evaluation with Acoustic Discrepancy

Add code
Jun 03, 2026
Viaarxiv icon

Speech Emotion Recognition using Attention-based LSTM-Network with Residual Connection

Add code
Jun 02, 2026
Viaarxiv icon

Test-Time Compute Scaling for ASR with Depth-Conditioned Looped Transformers

Add code
Jun 03, 2026
Viaarxiv icon

TRADE: Transducer-Augmented Decoder for Speech LLM

Add code
Jun 07, 2026
Viaarxiv icon

Echo: A Joint-Embedding Predictive Architecture for Speaker Diarization and Speech Recognition in a Shared Latent Space

Add code
Jun 01, 2026
Viaarxiv icon

Spiking and Event-driven Neuromorphic Mamba Models for Efficient Speech Recognition

Add code
May 31, 2026
Viaarxiv icon

Efficient ASR Training with Conversations that Never Happened

Add code
Jun 02, 2026
Viaarxiv icon

Long-Term and Short-Term Transistor Aging in Deep Neural Networks: Impact and Mitigation

Add code
Jun 02, 2026
Viaarxiv icon

LaSR: Context-Aware Speech Recognition via Latent Reasoning

Add code
May 30, 2026
Viaarxiv icon