speech


Exploration of Perceptual Speech Features for Clinical Decision-Support in Mental Health Care

Add code
May 27, 2026
Viaarxiv icon

LoSATok: Low-dimensional Semantic-Acoustic Tokenizer for Cross-Domain Audio Understanding and Generation

Add code
May 27, 2026
Viaarxiv icon

Diffusion Large Language Models for Visual Speech Recognition

Add code
May 27, 2026
Viaarxiv icon

EchoAvatar: Real-time Generative Avatar Animation from Audio Streams

Add code
May 27, 2026
Viaarxiv icon

PilotTTS: A Disciplined Modular Recipe for Competitive Speech Synthesis

Add code
May 27, 2026
Viaarxiv icon

A Dataset of Robot-Patient and Doctor-Patient Medical Dialogues for Spoken Language Processing Tasks

Add code
May 26, 2026
Viaarxiv icon

Debate Helps Weak Judges Reward Stronger Models

Add code
May 26, 2026
Viaarxiv icon

Beyond Binary: Speech Representations Across the Cognitive Score Hierarchy

Add code
May 26, 2026
Viaarxiv icon

UNIQUE: Universal Top-k Sparse Attention for Training-free Inference and Sparsity-aware Training

Add code
May 26, 2026
Viaarxiv icon

Cyberbullying Governance on Social Media: A Unified Framework from Content Identification to Intervention

Add code
May 26, 2026
Viaarxiv icon