Acoustic


Zero-Shot TTS With Enhanced Audio Prompts: Bsc Submission For The 2026 Wildspoof Challenge TTS Track

Add code
Feb 05, 2026
Viaarxiv icon

Sounding Highlights: Dual-Pathway Audio Encoders for Audio-Visual Video Highlight Detection

Add code
Feb 05, 2026
Viaarxiv icon

AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders

Add code
Feb 04, 2026
Viaarxiv icon

DementiaBank-Emotion: A Multi-Rater Emotion Annotation Corpus for Alzheimer's Disease Speech (Version 1.0)

Add code
Feb 04, 2026
Viaarxiv icon

Universal Robust Speech Adaptation for Cross-Domain Speech Recognition and Enhancement

Add code
Feb 04, 2026
Viaarxiv icon

Multilingual Extraction and Recognition of Implicit Discourse Relations in Speech and Text

Add code
Feb 04, 2026
Viaarxiv icon

Decoupled Hierarchical Distillation for Multimodal Emotion Recognition

Add code
Feb 04, 2026
Viaarxiv icon

RIR-Former: Coordinate-Guided Transformer for Continuous Reconstruction of Room Impulse Responses

Add code
Feb 03, 2026
Viaarxiv icon

Adaptive Evidence Weighting for Audio-Spatiotemporal Fusion

Add code
Feb 03, 2026
Viaarxiv icon

Conditional Flow Matching for Visually-Guided Acoustic Highlighting

Add code
Feb 03, 2026
Viaarxiv icon