speech


Improving Named Entity Transcription with Contextual LLM-based Revision

Add code
Jun 12, 2025
Viaarxiv icon

FairASR: Fair Audio Contrastive Learning for Automatic Speech Recognition

Add code
Jun 12, 2025
Viaarxiv icon

Robust Unsupervised Adaptation of a Speech Recogniser Using Entropy Minimisation and Speaker Codes

Add code
Jun 12, 2025
Viaarxiv icon

Beyond Single-User Dialogue: Assessing Multi-User Dialogue State Tracking Capabilities of Large Language Models

Add code
Jun 12, 2025
Viaarxiv icon

Joint ASR and Speaker Role Tagging with Serialized Output Training

Add code
Jun 12, 2025
Viaarxiv icon

RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding

Add code
Jun 12, 2025
Viaarxiv icon

Scheduled Interleaved Speech-Text Training for Speech-to-Speech Translation with LLMs

Add code
Jun 12, 2025
Viaarxiv icon

FASCIST-O-METER: Classifier for Neo-fascist Discourse Online

Add code
Jun 12, 2025
Viaarxiv icon

Developing a High-performance Framework for Speech Emotion Recognition in Naturalistic Conditions Challenge for Emotional Attribute Prediction

Add code
Jun 12, 2025
Viaarxiv icon

GLAP: General contrastive audio-text pretraining across domains and languages

Add code
Jun 12, 2025
Viaarxiv icon