speech


From Text to Voice: A Reproducible and Verifiable Framework for Evaluating Tool Calling LLM Agents

Add code
May 14, 2026
Viaarxiv icon

PROCESS-2: A Benchmark Speech Corpus for Early Cognitive Impairment Detection

Add code
May 14, 2026
Viaarxiv icon

REALM: Retrospective Encoder Alignment for LFP Modeling

Add code
May 14, 2026
Viaarxiv icon

Streaming Speech-to-Text Translation with a SpeechLLM

Add code
May 14, 2026
Viaarxiv icon

IsoNet: Spatially-aware audio-visual target speech extraction in complex acoustic environments

Add code
May 14, 2026
Viaarxiv icon

UMo: Unified Sparse Motion Modeling for Real-Time Co-Speech Avatars

Add code
May 14, 2026
Viaarxiv icon

A Calculus-Based Framework for Determining Vocabulary Size in End-to-End ASR

Add code
May 14, 2026
Viaarxiv icon

Leveraging Speech to Identify Signatures of Insight and Transfer in Problem Solving

Add code
May 14, 2026
Viaarxiv icon

Measuring and Mitigating Toxicity in Large Language Models: A Comprehensive Replication Study

Add code
May 13, 2026
Viaarxiv icon

A Benchmark for Early-stage Parkinson's Disease Detection from Speech

Add code
May 13, 2026
Viaarxiv icon