Speech


xList-Hate: A Checklist-Based Framework for Interpretable and Generalizable Hate Speech Detection

Add code
Feb 05, 2026
Viaarxiv icon

Zero-Shot TTS With Enhanced Audio Prompts: Bsc Submission For The 2026 Wildspoof Challenge TTS Track

Add code
Feb 05, 2026
Viaarxiv icon

Wave-Trainer-Fit: Neural Vocoder with Trainable Prior and Fixed-Point Iteration towards High-Quality Speech Generation from SSL features

Add code
Feb 05, 2026
Viaarxiv icon

Enabling Automatic Disordered Speech Recognition: An Impaired Speech Dataset in the Akan Language

Add code
Feb 05, 2026
Viaarxiv icon

ARCHI-TTS: A flow-matching-based Text-to-Speech Model with Self-supervised Semantic Aligner and Accelerated Inference

Add code
Feb 05, 2026
Viaarxiv icon

Sounding Highlights: Dual-Pathway Audio Encoders for Audio-Visual Video Highlight Detection

Add code
Feb 05, 2026
Viaarxiv icon

Beyond Length: Context-Aware Expansion and Independence as Developmentally Sensitive Evaluation in Child Utterances

Add code
Feb 05, 2026
Viaarxiv icon

Bagpiper: Solving Open-Ended Audio Tasks via Rich Captions

Add code
Feb 05, 2026
Viaarxiv icon

Speech Emotion Recognition Leveraging OpenAI's Whisper Representations and Attentive Pooling Methods

Add code
Feb 05, 2026
Viaarxiv icon

LinGO: A Linguistic Graph Optimization Framework with LLMs for Interpreting Intents of Online Uncivil Discourse

Add code
Feb 04, 2026
Viaarxiv icon