speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Automatic Speech Recognition for African Low-Resource Languages: Challenges and Future Directions

Add code
May 16, 2025
Viaarxiv icon

FiLLM -- A Filipino-optimized Large Language Model based on Southeast Asia Large Language Model (SEALLM)

Add code
May 25, 2025
Viaarxiv icon

Dual Precision Quantization for Efficient and Accurate Deep Neural Networks Inference

Add code
May 20, 2025
Viaarxiv icon

Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits

Add code
May 20, 2025
Viaarxiv icon

Calm-Whisper: Reduce Whisper Hallucination On Non-Speech By Calming Crazy Heads Down

Add code
May 19, 2025
Viaarxiv icon

KIT's Offline Speech Translation and Instruction Following Submission for IWSLT 2025

Add code
May 19, 2025
Viaarxiv icon

Cross-modal Knowledge Transfer Learning as Graph Matching Based on Optimal Transport for ASR

Add code
May 19, 2025
Viaarxiv icon

CAMEO: Collection of Multilingual Emotional Speech Corpora

Add code
May 16, 2025
Viaarxiv icon

Inclusivity of AI Speech in Healthcare: A Decade Look Back

Add code
May 15, 2025
Viaarxiv icon

On Multilingual Encoder Language Model Compression for Low-Resource Languages

Add code
May 22, 2025
Viaarxiv icon