speech


Continual Learning for Acoustic Event Classification

Add code
Dec 10, 2025
Viaarxiv icon

VABench: A Comprehensive Benchmark for Audio-Video Generation

Add code
Dec 10, 2025
Viaarxiv icon

NeuroSketch: An Effective Framework for Neural Decoding via Systematic Architectural Optimization

Add code
Dec 10, 2025
Viaarxiv icon

UniLS: End-to-End Audio-Driven Avatars for Unified Listening and Speaking

Add code
Dec 10, 2025
Viaarxiv icon

VocSim: A Training-free Benchmark for Zero-shot Content Identity in Single-source Audio

Add code
Dec 10, 2025
Viaarxiv icon

Robust Speech Activity Detection in the Presence of Singing Voice

Add code
Dec 10, 2025
Viaarxiv icon

Can LLMs Evaluate What They Cannot Annotate? Revisiting LLM Reliability in Hate Speech Detection

Add code
Dec 10, 2025
Viaarxiv icon

A Survey of Body and Face Motion: Datasets, Performance Evaluation Metrics and Generative Techniques

Add code
Dec 09, 2025
Viaarxiv icon

SpeechQualityLLM: LLM-Based Multimodal Assessment of Speech Quality

Add code
Dec 09, 2025
Viaarxiv icon

BUT Systems for Environmental Sound Deepfake Detection in the ESDD 2026 Challenge

Add code
Dec 09, 2025
Viaarxiv icon