speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

TRIDENT: A Redundant Architecture for Caribbean-Accented Emergency Speech Triage

Add code
Dec 11, 2025
Viaarxiv icon

Robust Speech Activity Detection in the Presence of Singing Voice

Add code
Dec 10, 2025
Figure 1 for Robust Speech Activity Detection in the Presence of Singing Voice
Figure 2 for Robust Speech Activity Detection in the Presence of Singing Voice
Figure 3 for Robust Speech Activity Detection in the Presence of Singing Voice
Figure 4 for Robust Speech Activity Detection in the Presence of Singing Voice
Viaarxiv icon

GeoSense-AI: Fast Location Inference from Crisis Microblogs

Add code
Dec 20, 2025
Figure 1 for GeoSense-AI: Fast Location Inference from Crisis Microblogs
Figure 2 for GeoSense-AI: Fast Location Inference from Crisis Microblogs
Figure 3 for GeoSense-AI: Fast Location Inference from Crisis Microblogs
Figure 4 for GeoSense-AI: Fast Location Inference from Crisis Microblogs
Viaarxiv icon

Evaluation of Generative Models for Emotional 3D Animation Generation in VR

Add code
Dec 18, 2025
Viaarxiv icon

DASH: Dialogue-Aware Similarity and Handshake Recognition for Topic Segmentation in Public-Channel Conversations

Add code
Dec 17, 2025
Viaarxiv icon

Efficient ASR for Low-Resource Languages: Leveraging Cross-Lingual Unlabeled Data

Add code
Dec 08, 2025
Viaarxiv icon

A Simple Method to Enhance Pre-trained Language Models with Speech Tokens for Classification

Add code
Dec 08, 2025
Viaarxiv icon

Stronger Normalization-Free Transformers

Add code
Dec 11, 2025
Viaarxiv icon

Poster: Recognizing Hidden-in-the-Ear Private Key for Reliable Silent Speech Interface Using Multi-Task Learning

Add code
Dec 18, 2025
Viaarxiv icon

NagaNLP: Bootstrapping NLP for Low-Resource Nagamese Creole with Human-in-the-Loop Synthetic Data

Add code
Dec 14, 2025
Viaarxiv icon