speech


Infrequent Child-Directed Speech Is Bursty and May Draw Infant Vocalizations

Add code
Mar 25, 2026
Viaarxiv icon

Unified Diffusion Refinement for Multi-Channel Speech Enhancement and Separation

Add code
Mar 25, 2026
Viaarxiv icon

A Sociolinguistic Analysis of Automatic Speech Recognition Bias in Newcastle English

Add code
Mar 25, 2026
Viaarxiv icon

When AI Meets Early Childhood Education: Large Language Models as Assessment Teammates in Chinese Preschools

Add code
Mar 25, 2026
Viaarxiv icon

Crab: Multi Layer Contrastive Supervision to Improve Speech Emotion Recognition Under Both Acted and Natural Speech Condition

Add code
Mar 24, 2026
Viaarxiv icon

InterDyad: Interactive Dyadic Speech-to-Video Generation by Querying Intermediate Visual Guidance

Add code
Mar 24, 2026
Viaarxiv icon

Autoregressive Guidance of Deep Spatially Selective Filters using Bayesian Tracking for Efficient Extraction of Moving Speakers

Add code
Mar 24, 2026
Viaarxiv icon

Evaluating a Multi-Agent Voice-Enabled Smart Speaker for Care Homes: A Safety-Focused Framework

Add code
Mar 24, 2026
Viaarxiv icon

Prompt Amplification and Zero-Shot Late Fusion in Audio-Language Models for Speech Emotion Recognition

Add code
Mar 24, 2026
Viaarxiv icon

Beyond Hate: Differentiating Uncivil and Intolerant Speech in Multimodal Content Moderation

Add code
Mar 24, 2026
Viaarxiv icon