speech


A Sociolinguistic Analysis of Automatic Speech Recognition Bias in Newcastle English

Add code
Mar 25, 2026
Viaarxiv icon

Robust Multilingual Text-to-Pictogram Mapping for Scalable Reading Rehabilitation

Add code
Mar 25, 2026
Viaarxiv icon

When AI Meets Early Childhood Education: Large Language Models as Assessment Teammates in Chinese Preschools

Add code
Mar 25, 2026
Viaarxiv icon

Semantic Alignment across Ancient Egyptian Language Stages via Normalization-Aware Multitask Learning

Add code
Mar 25, 2026
Viaarxiv icon

Crab: Multi Layer Contrastive Supervision to Improve Speech Emotion Recognition Under Both Acted and Natural Speech Condition

Add code
Mar 24, 2026
Viaarxiv icon

A Multimodal Framework for Human-Multi-Agent Interaction

Add code
Mar 24, 2026
Viaarxiv icon

Who Spoke What When? Evaluating Spoken Language Models for Conversational ASR with Semantic and Overlap-Aware Metrics

Add code
Mar 24, 2026
Viaarxiv icon

Ethio-ASR: Joint Multilingual Speech Recognition and Language Identification for Ethiopian Languages

Add code
Mar 24, 2026
Viaarxiv icon

InterDyad: Interactive Dyadic Speech-to-Video Generation by Querying Intermediate Visual Guidance

Add code
Mar 24, 2026
Viaarxiv icon

Autoregressive Guidance of Deep Spatially Selective Filters using Bayesian Tracking for Efficient Extraction of Moving Speakers

Add code
Mar 24, 2026
Viaarxiv icon