speech


CLARITY: Contextual Linguistic Adaptation and Accent Retrieval for Dual-Bias Mitigation in Text-to-Speech Generation

Add code
Nov 14, 2025
Viaarxiv icon

Language-Aided State Estimation

Add code
Nov 14, 2025
Viaarxiv icon

CAT-Net: A Cross-Attention Tone Network for Cross-Subject EEG-EMG Fusion Tone Decoding

Add code
Nov 14, 2025
Viaarxiv icon

Proactive Hearing Assistants that Isolate Egocentric Conversations

Add code
Nov 14, 2025
Viaarxiv icon

Real-Time Speech Enhancement via a Hybrid ViT: A Dual-Input Acoustic-Image Feature Fusion

Add code
Nov 14, 2025
Viaarxiv icon

Speech-Aware Long Context Pruning and Integration for Contextualized Automatic Speech Recognition

Add code
Nov 14, 2025
Viaarxiv icon

Analysing Personal Attacks in U.S. Presidential Debates

Add code
Nov 14, 2025
Viaarxiv icon

Speech-Audio Compositional Attacks on Multimodal LLMs and Their Mitigation with SALMONN-Guard

Add code
Nov 14, 2025
Figure 1 for Speech-Audio Compositional Attacks on Multimodal LLMs and Their Mitigation with SALMONN-Guard
Figure 2 for Speech-Audio Compositional Attacks on Multimodal LLMs and Their Mitigation with SALMONN-Guard
Figure 3 for Speech-Audio Compositional Attacks on Multimodal LLMs and Their Mitigation with SALMONN-Guard
Figure 4 for Speech-Audio Compositional Attacks on Multimodal LLMs and Their Mitigation with SALMONN-Guard
Viaarxiv icon

HI-TransPA: Hearing Impairments Translation Personal Assistant

Add code
Nov 14, 2025
Viaarxiv icon

SpikCommander: A High-performance Spiking Transformer with Multi-view Learning for Efficient Speech Command Recognition

Add code
Nov 13, 2025
Viaarxiv icon