speech


Advancing Assistive Robotics: Multi-Modal Navigation and Biophysical Monitoring for Next-Generation Wheelchairs

Add code
Jan 06, 2026
Viaarxiv icon

Boosting Accuracy and Interpretability in Multilingual Hate Speech Detection Through Layer Freezing and Explainable AI

Add code
Jan 06, 2026
Viaarxiv icon

Multi-channel multi-speaker transformer for speech recognition

Add code
Jan 06, 2026
Viaarxiv icon

Tigrinya Number Verbalization: Rules, Algorithm, and Implementation

Add code
Jan 06, 2026
Viaarxiv icon

X-MuTeST: A Multilingual Benchmark for Explainable Hate Speech Detection and A Novel LLM-consulted Explanation Framework

Add code
Jan 06, 2026
Viaarxiv icon

Vclip: Face-based Speaker Generation by Face-voice Association Learning

Add code
Jan 06, 2026
Viaarxiv icon

Interpretable All-Type Audio Deepfake Detection with Audio LLMs via Frequency-Time Reinforcement Learning

Add code
Jan 06, 2026
Viaarxiv icon

ToxiGAN: Toxic Data Augmentation via LLM-Guided Directional Adversarial Generation

Add code
Jan 06, 2026
Viaarxiv icon

Linear Script Representations in Speech Foundation Models Enable Zero-Shot Transliteration

Add code
Jan 06, 2026
Viaarxiv icon

Discriminating real and synthetic super-resolved audio samples using embedding-based classifiers

Add code
Jan 06, 2026
Viaarxiv icon