speech


MOSAIC-F: A Framework for Enhancing Students' Oral Presentation Skills through Personalized Feedback

Add code
Jun 10, 2025
Viaarxiv icon

$(RSA)^2$: A Rhetorical-Strategy-Aware Rational Speech Act Framework for Figurative Language Understanding

Add code
Jun 10, 2025
Viaarxiv icon

Employing self-supervised learning models for cross-linguistic child speech maturity classification

Add code
Jun 10, 2025
Viaarxiv icon

SPBA: Utilizing Speech Large Language Model for Backdoor Attacks on Speech Classification Models

Add code
Jun 10, 2025
Viaarxiv icon

Multi-Teacher Language-Aware Knowledge Distillation for Multilingual Speech Emotion Recognition

Add code
Jun 10, 2025
Viaarxiv icon

FROST-EMA: Finnish and Russian Oral Speech Dataset of Electromagnetic Articulography Measurements with L1, L2 and Imitated L2 Accents

Add code
Jun 10, 2025
Viaarxiv icon

SimClass: A Classroom Speech Dataset Generated via Game Engine Simulation For Automatic Speech Recognition Research

Add code
Jun 10, 2025
Viaarxiv icon

Seeing Voices: Generating A-Roll Video from Audio with Mirage

Add code
Jun 09, 2025
Viaarxiv icon

Uncovering the Functional Roles of Nonlinearity in Memory

Add code
Jun 09, 2025
Viaarxiv icon

Swiss Parliaments Corpus Re-Imagined (SPC_R): Enhanced Transcription with RAG-based Correction and Predicted BLEU

Add code
Jun 09, 2025
Viaarxiv icon