speech


Discriminating real and synthetic super-resolved audio samples using embedding-based classifiers

Add code
Jan 06, 2026
Viaarxiv icon

Tigrinya Number Verbalization: Rules, Algorithm, and Implementation

Add code
Jan 06, 2026
Viaarxiv icon

LTX-2: Efficient Joint Audio-Visual Foundation Model

Add code
Jan 06, 2026
Viaarxiv icon

X-MuTeST: A Multilingual Benchmark for Explainable Hate Speech Detection and A Novel LLM-consulted Explanation Framework

Add code
Jan 06, 2026
Viaarxiv icon

ToxiGAN: Toxic Data Augmentation via LLM-Guided Directional Adversarial Generation

Add code
Jan 06, 2026
Viaarxiv icon

Towards Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training

Add code
Jan 06, 2026
Viaarxiv icon

Interpretable All-Type Audio Deepfake Detection with Audio LLMs via Frequency-Time Reinforcement Learning

Add code
Jan 06, 2026
Viaarxiv icon

XLSR-MamBo: Scaling the Hybrid Mamba-Attention Backbone for Audio Deepfake Detection

Add code
Jan 06, 2026
Viaarxiv icon

Linear Script Representations in Speech Foundation Models Enable Zero-Shot Transliteration

Add code
Jan 06, 2026
Viaarxiv icon

Advancing Assistive Robotics: Multi-Modal Navigation and Biophysical Monitoring for Next-Generation Wheelchairs

Add code
Jan 06, 2026
Viaarxiv icon