speech


Lyapunov Spectral Analysis of Speech Embedding Trajectories in Psychosis

Add code
Feb 18, 2026
Viaarxiv icon

Color-based Emotion Representation for Speech Emotion Recognition

Add code
Feb 18, 2026
Viaarxiv icon

LLM-to-Speech: A Synthetic Data Pipeline for Training Dialectal Text-to-Speech Models

Add code
Feb 17, 2026
Viaarxiv icon

Enroll-on-Wakeup: A First Comparative Study of Target Speech Extraction for Seamless Interaction in Real Noisy Human-Machine Dialogue Scenarios

Add code
Feb 17, 2026
Viaarxiv icon

The Equalizer: Introducing Shape-Gain Decomposition in Neural Audio Codecs

Add code
Feb 17, 2026
Viaarxiv icon

Bottleneck Transformer-Based Approach for Improved Automatic STOI Score Prediction

Add code
Feb 17, 2026
Viaarxiv icon

Joint Enhancement and Classification using Coupled Diffusion Models of Signals and Logits

Add code
Feb 17, 2026
Viaarxiv icon

What Do Neurons Listen To? A Neuron-level Dissection of a General-purpose Audio Model

Add code
Feb 17, 2026
Viaarxiv icon

Clinically Inspired Symptom-Guided Depression Detection from Emotion-Aware Speech Representations

Add code
Feb 17, 2026
Viaarxiv icon

Under-resourced studies of under-resourced languages: lemmatization and POS-tagging with LLM annotators for historical Armenian, Georgian, Greek and Syriac

Add code
Feb 17, 2026
Viaarxiv icon