speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

BERSting at the Screams: A Benchmark for Distanced, Emotional and Shouted Speech Recognition

Add code
Apr 30, 2025
Viaarxiv icon

Retrieval-Enhanced Few-Shot Prompting for Speech Event Extraction

Add code
Apr 30, 2025
Viaarxiv icon

A Comprehensive Part-of-Speech Tagging to Standardize Central-Kurdish Language: A Research Guide for Kurdish Natural Language Processing Tasks

Add code
Apr 28, 2025
Viaarxiv icon

Improving Pretrained YAMNet for Enhanced Speech Command Detection via Transfer Learning

Add code
Apr 26, 2025
Viaarxiv icon

DEEMO: De-identity Multimodal Emotion Recognition and Reasoning

Add code
Apr 28, 2025
Viaarxiv icon

Kimi-Audio Technical Report

Add code
Apr 25, 2025
Viaarxiv icon

TinyML for Speech Recognition

Add code
Apr 22, 2025
Viaarxiv icon

Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides

Add code
Apr 21, 2025
Viaarxiv icon

PsyCounAssist: A Full-Cycle AI-Powered Psychological Counseling Assistant System

Add code
Apr 23, 2025
Viaarxiv icon

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale

Add code
Apr 22, 2025
Viaarxiv icon