speech


UniWhisper: Efficient Continual Multi-task Training for Robust Universal Audio Representation

Add code
Feb 25, 2026
Viaarxiv icon

The Design Space of Tri-Modal Masked Diffusion Models

Add code
Feb 25, 2026
Viaarxiv icon

mmWave Radar Aware Dual-Conditioned GAN for Speech Reconstruction of Signals With Low SNR

Add code
Feb 25, 2026
Viaarxiv icon

Absorbing Discrete Diffusion for Speech Enhancement

Add code
Feb 25, 2026
Viaarxiv icon

Robust Long-Form Bangla Speech Processing: Automatic Speech Recognition and Speaker Diarization

Add code
Feb 25, 2026
Viaarxiv icon

WaveSSM: Multiscale State-Space Models for Non-stationary Signal Attention

Add code
Feb 25, 2026
Viaarxiv icon

iMiGUE-Speech: A Spontaneous Speech Dataset for Affective Analysis

Add code
Feb 25, 2026
Viaarxiv icon

Continuous Telemonitoring of Heart Failure using Personalised Speech Dynamics

Add code
Feb 25, 2026
Viaarxiv icon

A Fusion of context-aware based BanglaBERT and Two-Layer Stacked LSTM Framework for Multi-Label Cyberbullying Detection

Add code
Feb 25, 2026
Viaarxiv icon

Detecting Hate and Inflammatory Content in Bengali Memes: A New Multimodal Dataset and Co-Attention Framework

Add code
Feb 25, 2026
Viaarxiv icon