speech


SimulU: Training-free Policy for Long-form Simultaneous Speech-to-Speech Translation

Add code
Mar 11, 2026
Viaarxiv icon

Beyond Deep Learning: Speech Segmentation and Phone Classification with Neural Assemblies

Add code
Mar 11, 2026
Viaarxiv icon

Synthetic Data Domain Adaptation for ASR via LLM-based Text and Phonetic Respelling Augmentation

Add code
Mar 11, 2026
Viaarxiv icon

EPIC-EuroParl-UdS: Information-Theoretic Perspectives on Translation and Interpreting

Add code
Mar 10, 2026
Viaarxiv icon

The Patrologia Graeca Corpus: OCR, Annotation, and Open Release of Noisy Nineteenth-Century Polytonic Greek Editions

Add code
Mar 10, 2026
Viaarxiv icon

Reading the Mood Behind Words: Integrating Prosody-Derived Emotional Context into Socially Responsive VR Agents

Add code
Mar 10, 2026
Viaarxiv icon

SPAR-K: Scheduled Periodic Alternating Early Exit for Spoken Language Models

Add code
Mar 10, 2026
Viaarxiv icon

DuplexCascade: Full-Duplex Speech-to-Speech Dialogue with VAD-Free Cascaded ASR-LLM-TTS Pipeline and Micro-Turn Optimization

Add code
Mar 10, 2026
Viaarxiv icon

Speech-Omni-Lite: Portable Speech Interfaces for Vision-Language Models

Add code
Mar 10, 2026
Viaarxiv icon

Acoustic and Semantic Modeling of Emotion in Spoken Language

Add code
Mar 10, 2026
Viaarxiv icon