speech


Enhancing time-frequency resolution with optimal transport and barycentric fusion of multiple spectrogram

Add code
Apr 16, 2026
Viaarxiv icon

Explain the Flag: Contextualizing Hate Speech Beyond Censorship

Add code
Apr 16, 2026
Viaarxiv icon

AIPC: Agent-Based Automation for AI Model Deployment with Qualcomm AI Runtime

Add code
Apr 16, 2026
Viaarxiv icon

VoxSafeBench: Not Just What Is Said, but Who, How, and Where

Add code
Apr 16, 2026
Viaarxiv icon

Pushing the Limits of On-Device Streaming ASR: A Compact, High-Accuracy English Model for Low-Latency Inference

Add code
Apr 16, 2026
Viaarxiv icon

The Acoustic Camouflage Phenomenon: Re-evaluating Speech Features for Financial Risk Prediction

Add code
Apr 16, 2026
Viaarxiv icon

Comparison of Modern Multilingual Text Embedding Techniques for Hate Speech Detection Task

Add code
Apr 16, 2026
Viaarxiv icon

Giving Faces Their Feelings Back: Explicit Emotion Control for Feedforward Single-Image 3D Head Avatars

Add code
Apr 16, 2026
Viaarxiv icon

UniPASE: A Generative Model for Universal Speech Enhancement with High Fidelity and Low Hallucinations

Add code
Apr 16, 2026
Viaarxiv icon

WavAlign: Enhancing Intelligence and Expressiveness in Spoken Dialogue Models via Adaptive Hybrid Post-Training

Add code
Apr 16, 2026
Viaarxiv icon