speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

CoSTA: Cognitive-State-Conditioned TTS Data Augmentation Using ASR Transcripts for Alzheimer's Disease Detection

Add code
Jun 04, 2026
Viaarxiv icon

TRADE: Transducer-Augmented Decoder for Speech LLM

Add code
Jun 07, 2026
Viaarxiv icon

Read What You Hear: Reference-Free Hypotheses Evaluation with Acoustic Discrepancy

Add code
Jun 03, 2026
Viaarxiv icon

Test-Time Compute Scaling for ASR with Depth-Conditioned Looped Transformers

Add code
Jun 03, 2026
Viaarxiv icon

Speech Emotion Recognition using Attention-based LSTM-Network with Residual Connection

Add code
Jun 02, 2026
Viaarxiv icon

Speaker-Invariant Representation Learning for Spoofing Detection via Gradient Reversal and A Variational Information Bottleneck

Add code
Jun 07, 2026
Viaarxiv icon

Echo: A Joint-Embedding Predictive Architecture for Speaker Diarization and Speech Recognition in a Shared Latent Space

Add code
Jun 01, 2026
Viaarxiv icon

Efficient ASR Training with Conversations that Never Happened

Add code
Jun 02, 2026
Viaarxiv icon

Long-Term and Short-Term Transistor Aging in Deep Neural Networks: Impact and Mitigation

Add code
Jun 02, 2026
Viaarxiv icon

Spiking and Event-driven Neuromorphic Mamba Models for Efficient Speech Recognition

Add code
May 31, 2026
Viaarxiv icon