Audio Emotion Recognition


EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection

Add code
Jun 11, 2025
Viaarxiv icon

CoMuMDR: Code-mixed Multi-modal Multi-domain corpus for Discourse paRsing in conversations

Add code
Jun 10, 2025
Viaarxiv icon

Beyond Classification: Towards Speech Emotion Reasoning with Multitask AudioLLMs

Add code
Jun 07, 2025
Viaarxiv icon

ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs

Add code
May 26, 2025
Viaarxiv icon

CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-training

Add code
May 23, 2025
Viaarxiv icon

X-ARES: A Comprehensive Framework for Assessing Audio Encoder Performance

Add code
May 22, 2025
Viaarxiv icon

Audio-to-Audio Emotion Conversion With Pitch And Duration Style Transfer

Add code
May 23, 2025
Viaarxiv icon

VAEmo: Efficient Representation Learning for Visual-Audio Emotion with Knowledge Injection

Add code
May 05, 2025
Viaarxiv icon

DEEMO: De-identity Multimodal Emotion Recognition and Reasoning

Add code
Apr 28, 2025
Viaarxiv icon

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

Add code
May 05, 2025
Viaarxiv icon