speech


IHBench: Evaluating Post-Interruption Recovery in Voice Agents with Structured Workflows

Add code
Jun 17, 2026
Viaarxiv icon

RIVET: Robust Idempotent Voice Attribute Editing

Add code
Jun 17, 2026
Viaarxiv icon

Reference-Driven Multi-Speaker Audio Scene Generation from In-the-Wild Priors

Add code
Jun 17, 2026
Viaarxiv icon

Generating Natural and Expressive Robot Gestures through Iterative Reinforcement Learning with Human Feedback using LLMs

Add code
Jun 17, 2026
Viaarxiv icon

Augmenting Dysarthric Speech Severity Assessment with MOS Supervision

Add code
Jun 17, 2026
Viaarxiv icon

IndicContextEval: A Benchmark for Evaluating Context Utilisation in Audio Large Language Models Across 8 Indic Languages

Add code
Jun 17, 2026
Viaarxiv icon

DASH: Dual-View Self-Distillation with Multi-Layer Hidden Representations for Robust Speech Recognition

Add code
Jun 17, 2026
Viaarxiv icon

Mitigating Scoring Errors and Compensating for Nonverbal Subtests in Speech-Based Dementia Assessment

Add code
Jun 17, 2026
Viaarxiv icon

Continuous-Speech Parkinson's Disease Detection Using Acoustic and Inharmonicity Features

Add code
Jun 17, 2026
Viaarxiv icon

Adaptive Speech-to-Spike Encoding for Spiking Neural Networks

Add code
Jun 17, 2026
Viaarxiv icon