speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Beyond Classification: Towards Speech Emotion Reasoning with Multitask AudioLLMs

Add code
Jun 07, 2025
Viaarxiv icon

A Survey of Retentive Network

Add code
Jun 07, 2025
Viaarxiv icon

Low-Resource Domain Adaptation for Speech LLMs via Text-Only Fine-Tuning

Add code
Jun 06, 2025
Viaarxiv icon

Lightweight Prompt Biasing for Contextualized End-to-End ASR Systems

Add code
Jun 06, 2025
Viaarxiv icon

Phonetically-Augmented Discriminative Rescoring for Voice Search Error Correction

Add code
Jun 06, 2025
Viaarxiv icon

From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars

Add code
Jun 16, 2025
Viaarxiv icon

LESS: Large Language Model Enhanced Semi-Supervised Learning for Speech Foundational Models

Add code
Jun 05, 2025
Viaarxiv icon

Effects of Speaker Count, Duration, and Accent Diversity on Zero-Shot Accent Robustness in Low-Resource ASR

Add code
Jun 04, 2025
Viaarxiv icon

Structured State Space Model Dynamics and Parametrization for Spiking Neural Networks

Add code
Jun 04, 2025
Viaarxiv icon

SocialDF: Benchmark Dataset and Detection Model for Mitigating Harmful Deepfake Content on Social Media Platforms

Add code
Jun 05, 2025
Viaarxiv icon