speech


mSTEB: Massively Multilingual Evaluation of LLMs on Speech and Text Tasks

Add code
Jun 10, 2025
Viaarxiv icon

Pureformer-VC: Non-parallel Voice Conversion with Pure Stylized Transformer Blocks and Triplet Discriminative Training

Add code
Jun 10, 2025
Viaarxiv icon

Neighbors and relatives: How do speech embeddings reflect linguistic connections across the world?

Add code
Jun 10, 2025
Viaarxiv icon

MOSAIC-F: A Framework for Enhancing Students' Oral Presentation Skills through Personalized Feedback

Add code
Jun 10, 2025
Viaarxiv icon

$(RSA)^2$: A Rhetorical-Strategy-Aware Rational Speech Act Framework for Figurative Language Understanding

Add code
Jun 10, 2025
Viaarxiv icon

Employing self-supervised learning models for cross-linguistic child speech maturity classification

Add code
Jun 10, 2025
Viaarxiv icon

SPBA: Utilizing Speech Large Language Model for Backdoor Attacks on Speech Classification Models

Add code
Jun 10, 2025
Viaarxiv icon

Multi-Teacher Language-Aware Knowledge Distillation for Multilingual Speech Emotion Recognition

Add code
Jun 10, 2025
Viaarxiv icon

FROST-EMA: Finnish and Russian Oral Speech Dataset of Electromagnetic Articulography Measurements with L1, L2 and Imitated L2 Accents

Add code
Jun 10, 2025
Viaarxiv icon

SimClass: A Classroom Speech Dataset Generated via Game Engine Simulation For Automatic Speech Recognition Research

Add code
Jun 10, 2025
Viaarxiv icon