speech


VisG AV-HuBERT: Viseme-Guided AV-HuBERT

Add code
Apr 01, 2026
Viaarxiv icon

English to Central Kurdish Speech Translation: Corpus Creation, Evaluation, and Orthographic Standardization

Add code
Apr 01, 2026
Viaarxiv icon

TRACE: Training-Free Partial Audio Deepfake Detection via Embedding Trajectory Analysis of Speech Foundation Models

Add code
Apr 01, 2026
Viaarxiv icon

Speech LLMs are Contextual Reasoning Transcribers

Add code
Apr 01, 2026
Viaarxiv icon

Adapting Text LLMs to Speech via Multimodal Depth Up-Scaling

Add code
Apr 01, 2026
Viaarxiv icon

Frege in the Flesh: Biolinguistics and the Neural Enforcement of Syntactic Structures

Add code
Mar 31, 2026
Viaarxiv icon

Vocal Prognostic Digital Biomarkers in Monitoring Chronic Heart Failure: A Longitudinal Observational Study

Add code
Mar 31, 2026
Viaarxiv icon

MambaVoiceCloning: Efficient and Expressive Text-to-Speech via State-Space Modeling and Diffusion Control

Add code
Mar 31, 2026
Viaarxiv icon

Covertly improving intelligibility with data-driven adaptations of speech timing

Add code
Mar 31, 2026
Viaarxiv icon

Can LLM Agents Identify Spoken Dialects like a Linguist?

Add code
Mar 31, 2026
Viaarxiv icon