speech


Persona Prompting as a Lens on LLM Social Reasoning

Add code
Jan 28, 2026
Viaarxiv icon

Audio Deepfake Detection in the Age of Advanced Text-to-Speech models

Add code
Jan 28, 2026
Viaarxiv icon

Text-only adaptation in LLM-based ASR through text denoising

Add code
Jan 28, 2026
Viaarxiv icon

Self Voice Conversion as an Attack against Neural Audio Watermarking

Add code
Jan 28, 2026
Viaarxiv icon

SpeechMapper: Speech-to-text Embedding Projector for LLMs

Add code
Jan 28, 2026
Viaarxiv icon

A Study of Data Selection Strategies for Pre-training Self-Supervised Speech Models

Add code
Jan 28, 2026
Viaarxiv icon

ASR for Affective Speech: Investigating Impact of Emotion and Speech Generative Strategy

Add code
Jan 28, 2026
Viaarxiv icon

SoftHateBench: Evaluating Moderation Models Against Reasoning-Driven, Policy-Compliant Hostility

Add code
Jan 28, 2026
Viaarxiv icon

Improving X-Codec-2.0 for Multi-Lingual Speech: 25 Hz Latent Rate and 24 kHz Sampling

Add code
Jan 28, 2026
Viaarxiv icon

SAM Audio Judge: A Unified Multimodal Framework for Perceptual Evaluation of Audio Separation

Add code
Jan 27, 2026
Viaarxiv icon