speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Robust Speech Activity Detection in the Presence of Singing Voice

Add code
Dec 10, 2025
Figure 1 for Robust Speech Activity Detection in the Presence of Singing Voice
Figure 2 for Robust Speech Activity Detection in the Presence of Singing Voice
Figure 3 for Robust Speech Activity Detection in the Presence of Singing Voice
Figure 4 for Robust Speech Activity Detection in the Presence of Singing Voice
Viaarxiv icon

Efficient ASR for Low-Resource Languages: Leveraging Cross-Lingual Unlabeled Data

Add code
Dec 08, 2025
Viaarxiv icon

A Simple Method to Enhance Pre-trained Language Models with Speech Tokens for Classification

Add code
Dec 08, 2025
Viaarxiv icon

GeoSense-AI: Fast Location Inference from Crisis Microblogs

Add code
Dec 20, 2025
Figure 1 for GeoSense-AI: Fast Location Inference from Crisis Microblogs
Figure 2 for GeoSense-AI: Fast Location Inference from Crisis Microblogs
Figure 3 for GeoSense-AI: Fast Location Inference from Crisis Microblogs
Figure 4 for GeoSense-AI: Fast Location Inference from Crisis Microblogs
Viaarxiv icon

Evaluation of Generative Models for Emotional 3D Animation Generation in VR

Add code
Dec 18, 2025
Viaarxiv icon

Stronger Normalization-Free Transformers

Add code
Dec 11, 2025
Viaarxiv icon

DASH: Dialogue-Aware Similarity and Handshake Recognition for Topic Segmentation in Public-Channel Conversations

Add code
Dec 17, 2025
Viaarxiv icon

NagaNLP: Bootstrapping NLP for Low-Resource Nagamese Creole with Human-in-the-Loop Synthetic Data

Add code
Dec 14, 2025
Viaarxiv icon

ImageTalk: Designing a Multimodal AAC Text Generation System Driven by Image Recognition and Natural Language Generation

Add code
Dec 10, 2025
Viaarxiv icon

Poster: Recognizing Hidden-in-the-Ear Private Key for Reliable Silent Speech Interface Using Multi-Task Learning

Add code
Dec 18, 2025
Viaarxiv icon