speech


FlowFake: Liquid Networks for Audio Deepfake Detection

Add code
Jun 17, 2026
Viaarxiv icon

Reference-Driven Multi-Speaker Audio Scene Generation from In-the-Wild Priors

Add code
Jun 17, 2026
Viaarxiv icon

Aligning Implied Statements for Implicit Hate Speech Generalizability with Context-Bounded Semi-hard Negative Mining

Add code
Jun 17, 2026
Viaarxiv icon

Generating Natural and Expressive Robot Gestures through Iterative Reinforcement Learning with Human Feedback using LLMs

Add code
Jun 17, 2026
Viaarxiv icon

Augmenting Dysarthric Speech Severity Assessment with MOS Supervision

Add code
Jun 17, 2026
Viaarxiv icon

Low-resource Language Discrimination Towards Chinese Dialects with Transfer learning and Data Augmentation

Add code
Jun 17, 2026
Viaarxiv icon

Speech-Driven End-to-End Language Discrimination towards Chinese Dialects

Add code
Jun 17, 2026
Viaarxiv icon

Fair Cognitive Impairment Detection Through Unlearning

Add code
Jun 17, 2026
Viaarxiv icon

S-JEPA : Soft Clustering Anchors for Self-Supervised Speech Representation Learning

Add code
Jun 17, 2026
Viaarxiv icon

IHBench: Evaluating Post-Interruption Recovery in Voice Agents with Structured Workflows

Add code
Jun 17, 2026
Viaarxiv icon