speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

A stylometric analysis of speaker attribution from speech transcripts

Add code
Dec 18, 2025
Viaarxiv icon

EEG-to-Voice Decoding of Spoken and Imagined speech Using Non-Invasive EEG

Add code
Dec 14, 2025
Viaarxiv icon

All-in-One ASR: Unifying Encoder-Decoder Models of CTC, Attention, and Transducer in Dual-Mode ASR

Add code
Dec 12, 2025
Viaarxiv icon

TRIDENT: A Redundant Architecture for Caribbean-Accented Emergency Speech Triage

Add code
Dec 11, 2025
Viaarxiv icon

Robust Speech Activity Detection in the Presence of Singing Voice

Add code
Dec 10, 2025
Figure 1 for Robust Speech Activity Detection in the Presence of Singing Voice
Figure 2 for Robust Speech Activity Detection in the Presence of Singing Voice
Figure 3 for Robust Speech Activity Detection in the Presence of Singing Voice
Figure 4 for Robust Speech Activity Detection in the Presence of Singing Voice
Viaarxiv icon

GeoSense-AI: Fast Location Inference from Crisis Microblogs

Add code
Dec 20, 2025
Figure 1 for GeoSense-AI: Fast Location Inference from Crisis Microblogs
Figure 2 for GeoSense-AI: Fast Location Inference from Crisis Microblogs
Figure 3 for GeoSense-AI: Fast Location Inference from Crisis Microblogs
Figure 4 for GeoSense-AI: Fast Location Inference from Crisis Microblogs
Viaarxiv icon

Efficient ASR for Low-Resource Languages: Leveraging Cross-Lingual Unlabeled Data

Add code
Dec 08, 2025
Viaarxiv icon

A Simple Method to Enhance Pre-trained Language Models with Speech Tokens for Classification

Add code
Dec 08, 2025
Viaarxiv icon

Evaluation of Generative Models for Emotional 3D Animation Generation in VR

Add code
Dec 18, 2025
Viaarxiv icon

Stronger Normalization-Free Transformers

Add code
Dec 11, 2025
Viaarxiv icon