speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Dynamic Quantization Error Propagation in Encoder-Decoder ASR Quantization

Add code
Jan 05, 2026
Viaarxiv icon

IO-RAE: Information-Obfuscation Reversible Adversarial Example for Audio Privacy Protection

Add code
Jan 03, 2026
Viaarxiv icon

VALLR-Pin: Uncertainty-Factorized Visual Speech Recognition for Mandarin with Pinyin Guidance

Add code
Dec 29, 2025
Viaarxiv icon

VocalBridge: Latent Diffusion-Bridge Purification for Defeating Perturbation-Based Voiceprint Defenses

Add code
Jan 05, 2026
Viaarxiv icon

Learning Speech Representations with Variational Predictive Coding

Add code
Dec 31, 2025
Viaarxiv icon

Three factor delay learning rules for spiking neural networks

Add code
Jan 02, 2026
Viaarxiv icon

Distilled HuBERT for Mobile Speech Emotion Recognition: A Cross-Corpus Validation Study

Add code
Dec 31, 2025
Viaarxiv icon

Index-ASR Technical Report

Add code
Dec 31, 2025
Viaarxiv icon

Enhancing Fully Formatted End-to-End Speech Recognition with Knowledge Distillation via Multi-Codebook Vector Quantization

Add code
Dec 22, 2025
Figure 1 for Enhancing Fully Formatted End-to-End Speech Recognition with Knowledge Distillation via Multi-Codebook Vector Quantization
Figure 2 for Enhancing Fully Formatted End-to-End Speech Recognition with Knowledge Distillation via Multi-Codebook Vector Quantization
Figure 3 for Enhancing Fully Formatted End-to-End Speech Recognition with Knowledge Distillation via Multi-Codebook Vector Quantization
Figure 4 for Enhancing Fully Formatted End-to-End Speech Recognition with Knowledge Distillation via Multi-Codebook Vector Quantization
Viaarxiv icon

VALLR-Pin: Dual-Decoding Visual Speech Recognition for Mandarin with Pinyin-Guided LLM Refinement

Add code
Dec 23, 2025
Viaarxiv icon