speech recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Stronger Normalization-Free Transformers

Add code
Dec 11, 2025
Viaarxiv icon

NagaNLP: Bootstrapping NLP for Low-Resource Nagamese Creole with Human-in-the-Loop Synthetic Data

Add code
Dec 14, 2025
Viaarxiv icon

Efficient ASR for Low-Resource Languages: Leveraging Cross-Lingual Unlabeled Data

Add code
Dec 08, 2025
Viaarxiv icon

A Simple Method to Enhance Pre-trained Language Models with Speech Tokens for Classification

Add code
Dec 08, 2025
Viaarxiv icon

ImageTalk: Designing a Multimodal AAC Text Generation System Driven by Image Recognition and Natural Language Generation

Add code
Dec 10, 2025
Viaarxiv icon

Toward Conversational Hungarian Speech Recognition: Introducing the BEA-Large and BEA-Dialogue Datasets

Add code
Nov 17, 2025
Figure 1 for Toward Conversational Hungarian Speech Recognition: Introducing the BEA-Large and BEA-Dialogue Datasets
Figure 2 for Toward Conversational Hungarian Speech Recognition: Introducing the BEA-Large and BEA-Dialogue Datasets
Figure 3 for Toward Conversational Hungarian Speech Recognition: Introducing the BEA-Large and BEA-Dialogue Datasets
Figure 4 for Toward Conversational Hungarian Speech Recognition: Introducing the BEA-Large and BEA-Dialogue Datasets
Viaarxiv icon

TTA: Transcribe, Translate and Alignment for Cross-lingual Speech Representation

Add code
Nov 18, 2025
Viaarxiv icon

Speech-Aware Long Context Pruning and Integration for Contextualized Automatic Speech Recognition

Add code
Nov 14, 2025
Viaarxiv icon

Context-Aware Dynamic Chunking for Streaming Tibetan Speech Recognition

Add code
Nov 12, 2025
Viaarxiv icon

Spatial Blind Spot: Auditory Motion Perception Deficits in Audio LLMs

Add code
Nov 17, 2025
Viaarxiv icon