speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Kimi-Audio Technical Report

Add code
Apr 25, 2025
Viaarxiv icon

Advancing Arabic Speech Recognition Through Large-Scale Weakly Supervised Learning

Add code
Apr 16, 2025
Viaarxiv icon

StableQuant: Layer Adaptive Post-Training Quantization for Speech Foundation Models

Add code
Apr 21, 2025
Viaarxiv icon

PsyCounAssist: A Full-Cycle AI-Powered Psychological Counseling Assistant System

Add code
Apr 23, 2025
Viaarxiv icon

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale

Add code
Apr 22, 2025
Viaarxiv icon

Dysarthria Normalization via Local Lie Group Transformations for Robust ASR

Add code
Apr 16, 2025
Viaarxiv icon

DEEMO: De-identity Multimodal Emotion Recognition and Reasoning

Add code
Apr 28, 2025
Viaarxiv icon

Acoustic to Articulatory Inversion of Speech; Data Driven Approaches, Challenges, Applications, and Future Scope

Add code
Apr 17, 2025
Viaarxiv icon

SoCov: Semi-Orthogonal Parametric Pooling of Covariance Matrix for Speaker Recognition

Add code
Apr 23, 2025
Viaarxiv icon

Optimism, Expectation, or Sarcasm? Multi-Class Hope Speech Detection in Spanish and English

Add code
Apr 24, 2025
Viaarxiv icon