Spoken Language Understanding


DiffuSpeech: Silent Thought, Spoken Answer via Unified Speech-Text Diffusion

Add code
Jan 30, 2026
Viaarxiv icon

Lost in Transcription: How Speech-to-Text Errors Derail Code Understanding

Add code
Jan 20, 2026
Viaarxiv icon

Deaf and Hard of Hearing Access to Intelligent Personal Assistants: Comparison of Voice-Based Options with an LLM-Powered Touch Interface

Add code
Jan 21, 2026
Viaarxiv icon

Social Caption: Evaluating Social Understanding in Multimodal Models

Add code
Jan 21, 2026
Viaarxiv icon

Credit C-GPT: A Domain-Specialized Large Language Model for Conversational Understanding in Vietnamese Debt Collection

Add code
Jan 15, 2026
Viaarxiv icon

MCGA: A Multi-task Classical Chinese Literary Genre Audio Corpus

Add code
Jan 14, 2026
Viaarxiv icon

MoST: Mixing Speech and Text with Modality-Aware Mixture of Experts

Add code
Jan 15, 2026
Viaarxiv icon

PALM-Bench: A Comprehensive Benchmark for Personalized Audio-Language Models

Add code
Jan 07, 2026
Viaarxiv icon

Multi-Intent Spoken Language Understanding: Methods, Trends, and Challenges

Add code
Dec 12, 2025
Viaarxiv icon

MiMo-Audio: Audio Language Models are Few-Shot Learners

Add code
Dec 29, 2025
Viaarxiv icon